bright idea

Goals


- Understand the data integration process

- Master the Talend Open Studio tool and the component library

Program

Data
integration issue Operational and Business Intelligence Integration
ETL (Extract Transform Load) tools available
Talend Open Studio: presentation, licenses, link with Eclipse
Support available: documentation, community, etc.

Workshop: Installation and presentation of the Talend Open Studio environment

Talend perspectives and views
Using the Business Modeler
Jobs: grid, palette of components
Main flows (main)
The notion of schema: types of columns
First components: input, output, flow generation

Practical workshop: creation and execution of a simple job, generation of content in a file

Input components (tFileInput, tMysqlInput,…)
Generation of content (tRowGenerator, tFixedFlowInput, …)
Outputs and display (tLogRow, tFileOutput, tMysqlOutput,…)
Logic and chaining of components
Step by step monitoring of execution traces of the Reject
management job
Simple transformations (filter, sort, replace, join)

Practical workshop: generation of content in a database, extraction and processing of data for display in console

Job versions and statuses
Metadata: properties and schemas.
Schema discovery for CSV files
Retrieving database schemas
Job updates from metadata

Practical workshop: inserting data sources and targets in the repository metadata

Use variables in a job (tSetGlobalVar)
Setting up a context local to the job
Context groups in the repository
Metadata as contexts

Practical workshop: use a variable in a job, contextualize the parameters of a metadata, create a test and production context

The ETL component par excellence
Managing inputs: main, secondary flows, naming of flows
Managing outputs: the integrated editor of tMap
Filtering data: input, output
Joint outputs, capture of output rejects, internal join
The expression editor
Performances

Practical workshop: multi-source, multi-destination extraction with a tMap, on a complete example

Parallel executions of sub-jobs
Talend triggers
Chaining of sub-jobs and components (OnSubjobOK, OnComponentOK)
Management of component errors, sub-job (OnComponentError, OnSubjobError)
Start and end component (tPreJob, tPostJob)
Catch and throw errors: tWarn, tDie, tLogCatcher

Practical workshop: assemble several jobs into one, factorize error management to generate a report file

Duration

3 days

Price

£ 1625

Audience

System administrators / BDD, Project managers, Developers

Prerequisites

Knowledge of Java

Reference

BUS875-F

Sessions

Contact us for more informations about session date