Chapter 4. Controlling the Flow of Data

In the previous chapter, you learned the basics of transforming data. Basically you read data from some file, did some transformation to the data, and sent the data back to a different output. This is the simplest scenario. Think of a different situation. Suppose you collect results from a survey. You receive several files with the data and those files have different formats. You have to merge those files somehow and generate a unified view of the information. You also want to put aside the rows of data whose content is irrelevant. Finally, based on the rows that interest you, you want to create another file with some statistics. This kind of requirement is very common. In this chapter you will learn how to implement it with PDI.

Splitting streams

Until now, you have been working with simple, straight flows of data. When you deal with real problems, those simple flows are not enough. Many times, the rows of your dataset have to take different paths. This situation is handled very easily, and you will learn how to do it in this section.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.117.187.62