Goals
- Understand the data integration process
- Master the Talend Open Studio tool and the component library
Program
Data
integration issue Operational and Business Intelligence Integration
ETL (Extract Transform Load) tools available
Talend Open Studio: presentation, licenses, link with Eclipse
Support available: documentation, community, etc.
Workshop: Installation and presentation of the Talend Open Studio environment
Talend perspectives and views
Using the Business Modeler
Jobs: grid, palette of components
Main flows (main)
The notion of schema: types of columns
First components: input, output, flow generation
Practical workshop: creation and execution of a simple job, generation of content in a file
Input components (tFileInput, tMysqlInput,…)
Generation of content (tRowGenerator, tFixedFlowInput, …)
Outputs and display (tLogRow, tFileOutput, tMysqlOutput,…)
Logic and chaining of components
Step by step monitoring of execution traces of the Reject
management job
Simple transformations (filter, sort, replace, join)
Practical workshop: generation of content in a database, extraction and processing of data for display in console
Job versions and statuses
Metadata: properties and schemas.
Schema discovery for CSV files
Retrieving database schemas
Job updates from metadata
Practical workshop: inserting data sources and targets in the repository metadata
Use variables in a job (tSetGlobalVar)
Setting up a context local to the job
Context groups in the repository
Metadata as contexts
Practical workshop: use a variable in a job, contextualize the parameters of a metadata, create a test and production context
The ETL component par excellence
Managing inputs: main, secondary flows, naming of flows
Managing outputs: the integrated editor of tMap
Filtering data: input, output
Joint outputs, capture of output rejects, internal join
The expression editor
Performances
Practical workshop: multi-source, multi-destination extraction with a tMap, on a complete example
Parallel executions of sub-jobs
Talend triggers
Chaining of sub-jobs and components (OnSubjobOK, OnComponentOK)
Management of component errors, sub-job (OnComponentError, OnSubjobError)
Start and end component (tPreJob, tPostJob)
Catch and throw errors: tWarn, tDie, tLogCatcher
Practical workshop: assemble several jobs into one, factorize error management to generate a report file
Duration
3 days
Price
£ 1625
Audience
System administrators / BDD, Project managers, Developers
Prerequisites
Knowledge of Java
Reference
BUS875-F
Sessions
Contact us for more informations about session date