![IBM Watson Projects](https://wfqqreader-1252317822.image.myqcloud.com/cover/334/36699334/b_36699334.jpg)
上QQ阅读APP看书,第一时间看更新
Refine
One of the more important IBM Watson tasks is Refine (which we mentioned earlier in this chapter). Here will walk through the basics, using some Watson sample data:
- From the Quick-start bar, click on Refine. From the Refine data set dialog, you then can scroll down and select Sample data; at this point you will see the Sample data dialog, which is displayed as follows:
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/7895c4e8-2689-4500-9024-0ad45d7a6cba.png?sign=1739352431-zwlB3XQfHBd4z50vU4Bi7SGVTkrHjWOF-0-fd7b43620b454f12630ce544d9d89d94)
- IBM Watson provides a nice list of sample data, each worth spending a bit of your time exploring and experimenting with. For now, let's pick Bike Sharing data set and then click on Upload:
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/fd55e780-04aa-498b-bd90-1f88f15c70ec.png?sign=1739352431-Fu3xyv7X3tVqX4Zejfxgu6L2yseuyeof-0-e0fb5a5598f3380791020ea279b428ff)
To access the sample data it needs to be uploaded. Once uploaded it will appear as an informational tile (shown as follows) that can be selected and used:
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/4088c37d-e3b9-4408-8695-4db61158c546.png?sign=1739352431-Gj4GZmGCv355nzLVAEcCK8LbZkr68iyv-0-33615f6aa2789922c3124e9dd5282a2a)
- Now that we have our data loaded and available, you can select it from the Refine data set list, which automatically loads it into the Refine page (which looks a lot like an Excel worksheet):
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/96bccdd2-acf4-41bb-bccf-244165702c5a.png?sign=1739352431-d2PXomgcqDzxZQo397z5yLiTm1eYu2Ri-0-765713ff3f860c81adad7ae048e1b67e)
There are many tasks you can perform using Refine, such as:
- General housekeeping: Such as renaming columns, changing data types, or creating a subset of the data by filtering out irrelevant records
- Summarization: By altering the default aggregations
- Enrichment: By adding calculated fields, hierarchies and groups
- Review the metrics of the data, such as a quality score by data field or column
For now, let's assume we've made some of the previously-mentioned refinements to our data and want to save it as a new file. To do that, you simply click on the SAVE icon (looks such as a tiny diskette in the upper left of the page), enter an appropriate name for the new file, and click Save (on the Save as popup shown as follows):
![](https://epubservercos.yuewen.com/014768/19470387508851706/epubprivate/OEBPS/Images/4bca9538-35e3-410a-aa32-ef69105a92a6.png?sign=1739352431-pBLcmRbdcozquFdy26NUnUWHQVSBBuRq-0-e215f84762acf871d92c38d009b8af17)
If you are working in a multi-user environment, your new (refined) dataset is saved by default in your personal folder. To share your refined dataset with others, move it to a shared folder.