LENGTH: 8 Hours (1 day)
Teaches data engineers how to run DataStage jobs in a Hadoop environment. You will run jobs in traditional and YARN mode, access HDFS files and Hive tables using different file formats and connector stages.
Please refer to course overview
This course is intended for Analysts and Contributors.
• Knowledge of OLAP fundamentals is recommended
• Introductions to BigData and BigQuality• YARN for dynamic DataStage job allocation• Tracing and debugging in YARN mode• Using log files• Understanding configuration parameters• Accessing Hadoop data using WebHDFS and HttpFS and using various connector stages
Prior to enrolling, IBM Employees must follow their Division/Department processes to obtain approval to attend this public training class. Failure to follow Division/Department approval processes may result in the IBM Employee being personally responsible for the class charges.
GBS practitioners that use the EViTA system for requesting external training should use that same process for this course. Go to the EViTA site to start this process:
Once you enroll in a GTP class, you will receive a confirmation letter that should show: