LENGTH: 8 Hours
Contains PDF course guide, as well as a lab environment where students can work through demonstrations and exercises at their own pace.
This course teaches data engineers how to run DataStage jobs in a Hadoop environment. You will run jobs in traditional and YARN mode, access HDFS files and Hive tables using different file formats and connector stages.
If you are enrolling in a Self Paced Virtual Classroom or Web Based Training course, before you enroll, please review the Self-Paced Virtual Classes and Web-Based Training Classes on our Terms and Conditions page, as well as the system requirements, to ensure that your system meets the minimum requirements for this course. http://www.ibm.com/training/terms
Please refer to course overview
This course is intended for Analysts and Contributors.
• Knowledge of OLAP fundamentals is recommended
• Introductions to BigData and BigQuality• YARN for dynamic DataStage job allocation• Tracing and debugging in YARN mode• Using log files• Understanding configuration parameters• Accessing Hadoop data using WebHDFS and HttpFS and using various connector stages
Prior to enrolling, IBM Employees must follow their Division/Department processes to obtain approval to attend this public training class. Failure to follow Division/Department approval processes may result in the IBM Employee being personally responsible for the class charges.
GBS practitioners that use the EViTA system for requesting external training should use that same process for this course. Go to the EViTA site to start this process:
Once you enroll in a GTP class, you will receive a confirmation letter that should show:
02 Apr 2023
Self Paced Training