Talend

Talend is an open source ETL development (graphical), monitoring and scheduling tool. In purview of this chapter, we are only taking the ETL capability of Talend into discussion but it has a suite of products having different capabilities ideal for big data. Talend is supported by a very large community and has a huge amount of connectors (800+, largest connector library), using which you will be able to do the integration work with a variety of tools and technologies with ease. Talend is a mature product and supports a variety of big data technologies.

Having a rich set of connectors, Talend can integrate and transfer data from a variety of database systems to Hadoop without much trouble and is a viable alternate to Sqoop. Talend also has a graphical user interface, using which the data pipelines can be authored and executed, making it very user-friendly to operate. It also has a Sqoop connector, using which Sqoop’s advantages can also be brought into your big data landscape.

A reference transformation graph taken from talendexpert.com is as shown in the following figure (Figure 11):

Figure 27: Talend graphical user interface for ETL development
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.16.79.147