Talend Tutorial
Talend Tutorial
Talend Tutorial
This tutorial helps you to learn all the fundamentals of Talend tool for data integration and
big data with examples.
Audience
This tutorial is for beginner's who are aspiring to become an ETL expert. It is also ideal for
Big Data professionals who are looking to use an ETL tool with Big Data ecosystem.
Prerequisites
Before proceeding with this tutorial, you should be familiar with basic Data warehousing
concepts as well as fundamentals of ETL (Extract, Transform, Load). If you are a beginner
to any of these concepts, we suggest you to go through tutorials based on these concepts
first to gain a solid understanding of Talend.
All the content and graphics published in this e-book are the property of Tutorials Point (I)
Pvt. Ltd. The user of this e-book is prohibited to reuse, retain, copy, distribute or republish
any contents or a part of contents of this e-book in any manner without written consent
of the publisher.
We strive to update the contents of our website and tutorials as timely and as precisely as
possible, however, the contents may contain inaccuracies or errors. Tutorials Point (I) Pvt.
Ltd. provides no guarantee regarding the accuracy, timeliness or completeness of our
website or its contents including this tutorial. If you discover any errors on our website or
in this tutorial, please notify us at contact@tutorialspoint.com
i
Talend
Table of Contents
About the Tutorial ................................................................................................................................ i
Audience............................................................................................................................................... i
Prerequisites ......................................................................................................................................... i
Table of Contents................................................................................................................................. ii
Benefits ............................................................................................................................................... 8
ii
Talend
Introduction....................................................................................................................................... 40
iii
Talend
iv
Talend
1. Talend – Introduction
Talend is a software integration platform which provides solutions for Data integration,
Data quality, Data management, Data Preparation and Big Data. The demand for ETL
professionals with knowledge on Talend is high. Also, it is the only ETL tool with all the
plugins to integrate with Big Data ecosystem easily.
According to Gartner, Talend falls in Leaders magic quadrant for Data Integration tools.
Talend also offers Open Studio, which is an open source free tool used widely for Data
Integration and Big Data.
1
Talend
2. Talend – System Requirements
The following are the system requirements to download and work on Talend Open Studio:
Microsoft Windows 10
Ubuntu 16.04 LTS
Apple macOS 10.13/High Sierra
Memory Requirement
Besides, you also need an up and running Hadoop cluster (preferably Cloudera.
2
Talend
3. Talend – Installation
To download Talend Open Studio for Big Data and Data Integration, please follow the steps
given below:
Step 2: After the download finishes, extract the contents of the zip file, it will create a
folder with all the Talend files in it.
Step 3: Open the Talend folder and double click the executable file: TOS_BD-win-
x86_64.exe. Accept the User License Agreement.
3
Talend
Step 5: Click Allow Access in case you get Windows Security Alert.
4
Talend
5
Talend
6
Talend
4. Talend — Talend Open Studio
Talend Open Studio is a free open source ETL tool for Data Integration and Big Data. It is
an Eclipse based developer tool and job designer. You just need to Drag and Drop
components and connect them to create and run ETL or ETL Jobs. The tool will create the
Java code for the job automatically and you need not write a single line of code.
There are multiple options to connect with Data Sources such as RDBMS, Excel, SaaS Big
Data ecosystem, as well as apps and technologies like SAP, CRM, Dropbox and many more.
Some important benefits which Talend Open Studio offers are as below:
Provides all features needed for data integration and synchronization with 900
components, built-in connectors, converting jobs to Java code automatically and
much more.
The tool is completely free, hence there are big cost savings.
In last 12 years, multiple giant organizations have adopted TOS for Data
integration, which shows very high trust factor in this tool.
Talend keeps on adding features to these tools and the documentations are well
structured and very easy to follow.
7
Talend
5. Talend – Data Integration
Most organizations get data from multiple places and are store it separately. Now if the
organization has to do decision making, it has to take data from different sources, put it
in a unified view and then analyze it to get a result. This process is called as Data
Integration.
Benefits
Data Integration offers many benefits as described below:
Saves time and eases data analysis, as the data is integrated effectively.
Automated data integration process synchronizes the data and eases real time and
periodic reporting, which otherwise is time consuming if done manually.
Data which is integrated from several sources matures and improves over time,
which eventually helps in better data quality.
Creating a Project
Double click on TOS Big Data executable file, the window shown below will open.
Select Create a new project option, mention the name of the project and click on Create.
8
Talend
Importing a Project
Double click on TOS Big Data executable file, you can see the window as shown below.
Select Import a demo project option and click Select.
9
Talend
You can choose from the options shown below. Here we are choosing Data Integration
Demos. Now, click Finish.
10
Talend
You can see your imported project under existing projects list.
11
Talend
Give Project Name and select the “Select root directory” option.
12
Talend
Browse your existing Talend project home directory and click Finish.
Opening a Project
Select a project from existing project and click Finish. This will open that Talend project.
13
Talend
14