Preparation and transformation of data with IBM Data Refinery is the third course in the learning path for professional data scientists that are working with the IBM Cloud Pak for Data platform. The course aims to familiarize data scientists with the data cleansing, and data shaping capabilities of the Data Refinery tool. Data Refinery saves preparation time by quickly transforming large amounts of raw data into consumable, high-quality information that's ready for analytics.
Learners follow the story of Sara (the data scientist), Muneiza (the data engineer), Liam (the data steward), and Tim (the data quality analyst) working in the Data Analytics department of a large health products company. The company plans a marketing campaign around coupons that are issued to customers and wants to better understand customer behavior. But they first need to access and prepare the relevant data for analytics. The team will mainly use IBM Data Refinery for this task.
Follow along with Sara, Muneiza, Liam, and Timâs story as learners create a suitable data set ready for analytics. Learners verify their acquired knowledge by completing several hands-on lab exercises in a remote classroom environment (provided to each learner during the course introduction).