This course is designed to teach you about the Streams Processing Language. It will begin with the basic concepts of InfoSphere Streams and the basic Streams Processing Language operators used in a Streams program. You will learn how to access data from an external source using the Source type operators and write an output stream using the Sink type operators.
You will then learn how and when to use the various Stream operators, like the Functor, Punctor, Aggregation, Sort, Join, Split, Barrier, Delay, and Switch operators. Lab exercises will use the InfoSphere Streams IDE that is based upon Eclipse as the development and testing environment, but you will get the opportunity to invoke the compilation of a Streams program from the command line as well. In the labs you will be given the choice to develop the applications using the SPL Graphical Editor, introduced in Version 3, that allows drag and drop or the original SPL Editor that is text based.
The second half of the course shows how to control the placement of processing elements and the debugging capabilities of the Streams Processing Language. You will learn about consistent regions and how to use them to process tuples at-least-once.
You will be introduced to the various toolkits supplied with InfoSphere Streams and work with data mining and database toolkits in a lab.
Finally, you will be is shown how to extend the Streams Processing Language through the development of user-defined functions and both generic and non-generic primitive operators. Both C++ and Java non-generic primitive operators are covered.