LENGTH: 24 Hours (3 days)
This course will step you through the QualityStage data cleansing process. You will transform an unstructured data source into a format suitable for loading into an existing data target. You will cleanse the source data by building a customer rule set that you create and use that rule set to standardize the data. You will next build a reference match to relate the cleansed source data to the existing target data.
Please refer to the course overview
The intended audience for this course are: • QualityStage programmers • Data Analysts responsible for data quality using QualityStage • Data Quality Architects • Data Cleansing Developers • Data Quality Developers needing to customize QualityStage rule sets
Participants should have: • Compled the QualityStage Essentials course, or have equivalent experience • familiarity with Windows and a text editor • familiarity with elementary statistics and probability concepts (desirable but not essential)
After completing this course, you should be able to:• Modify rule sets• Build custom rule sets• Standardize data using the custom rule set• Perform a reference match using standardized data and a reference data set• Use advanced techniques to refine a Two-source match
Prior to enrolling, IBM Employees must follow their Division/Department processes to obtain approval to attend this public training class. Failure to follow Division/Department approval processes may result in the IBM Employee being personally responsible for the class charges.
GBS practitioners that use the EViTA system for requesting external training should use that same process for this course. Go to the EViTA site to start this process: http://w3.ibm.com/services/gbs/evita/BCSVTEnrl.nsf
Once you enroll in a GTP class, you will receive a confirmation letter that should show:
The current GTP list price
The 20% discounted price available to IBMers. This is the price you will be invoiced for the class.