bright idea

Goals


- Understand the data integration process

- Master the Talend Open Studio tool and the component library

Program

Data
integration issue Operational and Business Intelligence Integration
ETL (Extract Transform Load) tools available
Talend Open Studio: presentation, licenses, link with Eclipse
Support available: documentation, community, etc.

Workshop: Installation and presentation of the Talend Open Studio environment

Talend perspectives and views
Using the Business Modeler
Jobs: grid, palette of components
Main flows (main)
The notion of schema: types of columns
First components: input, output, flow generation

Practical workshop: creation and execution of a simple job, generation of content in a file

Input components (tFileInput, tMysqlInput,…)
Generation of content (tRowGenerator, tFixedFlowInput, …)
Outputs and display (tLogRow, tFileOutput, tMysqlOutput,…)
Logic and chaining of components
Step by step monitoring of execution traces of the Reject
management job
Simple transformations (filter, sort, replace, join)

Practical workshop: generation of content in a database, extraction and processing of data for display in console

Job versions and statuses
Metadata: properties and schemas.
Schema discovery for CSV files
Retrieving database schemas
Job updates from metadata

Practical workshop: inserting data sources and targets in the repository metadata

Use variables in a job (tSetGlobalVar)
Setting up a context local to the job
Context groups in the repository
Metadata as contexts

Practical workshop: use a variable in a job, contextualize the parameters of a metadata, create a test and production context

The ETL component par excellence
Managing inputs: main, secondary flows, naming of flows
Managing outputs: the integrated editor of tMap
Filtering data: input, output
Joint outputs, capture of output rejects, internal join
The expression editor
Performances

Practical workshop: multi-source, multi-destination extraction with a tMap, on a complete example

Parallel executions of sub-jobs
Talend triggers
Chaining of sub-jobs and components (OnSubjobOK, OnComponentOK)
Management of component errors, sub-job (OnComponentError, OnSubjobError)
Start and end component (tPreJob, tPostJob)
Catch and throw errors: tWarn, tDie, tLogCatcher

Practical workshop: assemble several jobs into one, factorize error management to generate a report file

Duration

5 days

Price

£ 2568

Audience

System administrators / BDD, Project managers, Developers

Prerequisites

Knowledge of Java

Reference

BUS305-F

Differentiate between main streams from iterate streams
Loops and programming via tLoop, tForeach
Switch from a main stream to an iterate stream, and vice versa (tFlowToIterate, tIterateToFlow) Job
synchronization with tWaitForFile, tWaitForSocket, tWaitForSQL

Lab: parallelize subjobs, synchronize jobs with external event

Use of advanced components: Java or regular expressions
The specificity of XML feeds in Talend
Code reuse: routines

Workshop: Using the tJavaRow, tJavaFlex components, using the embedded Java editor, deploying a custom routine

Reminders on debugging traces
Using the Java Debug mode: Debug perspective, step-by-step mode
Visualization and hot modification of variables

Workshop: Basic / advanced task debugging

Reminder on the contexts
Export of jobs for execution on a server
Passage of context between jobs, at runtime
Use of a file to load the context (tContextLoad)

Workshop: Deployment of tasks and dynamic loading of the context

Use of advanced components: Java or regular expressions
Creation of specific logs
Management of XML flows
Reuse of code

Workshop: Use of the tJavaRow, tJavaFlex components – creation of logs with tCatch, tDie … – use of the embedded Java editor

Essential concepts for extending Talend Open Studio
Creation of the component mockup with tJavaFlex / graphical interface
Discover the Talend forge and its community components
Integration into Talend Open Studio of the plugins created or downloaded

Workshop: Creation of a simple component via tJavaFlex, integration of components from the Talend forge

Sessions

Contact us for more informations about session date