Orange is an wonderful training tool for new learning who are trying work with data mining. I was fortunate enough to go through the tutorial videos. I found the following things interesting in ORANGE.
1. Lot of interesting data sets available to learn.
2. Lot of Video Tutorial to start with 2 step workflow and gradually to move to specialization.
3. Few Example templates can be created in one click available for starters.
4. Data visualization like scatter plots, Hierarchical dendron diagram, linear regression charts, k-means clustering charts are possible.
5. It also does split the data into training, cross validation and test and does train models on various models like linear regression, classification and clustering models.
6. Interactive Selection: Selection of few data on chart pipelines the data into the next workflow component like "data table" or "scatter plot", which is some interactive UI, I did not expect.
7. Interactive K-means along with position of centroid was also possible.
8. Measures: From accuracy, confidence, Interstates measures, confusion matrix. Tools like euclidean distance, jaccard distance, almost all of them were available.
9. Addons: Education, Text, Image and geospace add on and lot more domain specific measures are also available.
10. It is easy to do PCA in a flick.
11. Paining Data: It is such a wonder feature, we can paint the spread of data just like a kid would paint on any painting tool
12. It seems to have been built on top of R, but it also provides packages to work with python.
The tool made me to realize that many measure are just drag and drop away and we can draw workflows and pipelines in a flick, but it is important to understand domain & measures and work on variety of datasets.
if you are new like me, give orange a try :)
1. Lot of interesting data sets available to learn.
2. Lot of Video Tutorial to start with 2 step workflow and gradually to move to specialization.
3. Few Example templates can be created in one click available for starters.
4. Data visualization like scatter plots, Hierarchical dendron diagram, linear regression charts, k-means clustering charts are possible.
5. It also does split the data into training, cross validation and test and does train models on various models like linear regression, classification and clustering models.
6. Interactive Selection: Selection of few data on chart pipelines the data into the next workflow component like "data table" or "scatter plot", which is some interactive UI, I did not expect.
7. Interactive K-means along with position of centroid was also possible.
8. Measures: From accuracy, confidence, Interstates measures, confusion matrix. Tools like euclidean distance, jaccard distance, almost all of them were available.
9. Addons: Education, Text, Image and geospace add on and lot more domain specific measures are also available.
10. It is easy to do PCA in a flick.
11. Paining Data: It is such a wonder feature, we can paint the spread of data just like a kid would paint on any painting tool
12. It seems to have been built on top of R, but it also provides packages to work with python.
The tool made me to realize that many measure are just drag and drop away and we can draw workflows and pipelines in a flick, but it is important to understand domain & measures and work on variety of datasets.
if you are new like me, give orange a try :)
No comments:
Post a Comment