Goals
- Understand the data integration process
- Master the Talend Open Studio tool and the component library
Program
Data
integration issue Operational and Business Intelligence Integration
ETL (Extract Transform Load) tools available
Talend Open Studio: presentation, licenses, link with Eclipse
Support available: documentation, community, etc.
Workshop: Installation and presentation of the Talend Open Studio environment
Talend perspectives and views
Using the Business Modeler
Jobs: grid, palette of components
Main flows (main)
The notion of schema: types of columns
First components: input, output, flow generation
Practical workshop: creation and execution of a simple job, generation of content in a file
Input components (tFileInput, tMysqlInput,…)
Generation of content (tRowGenerator, tFixedFlowInput, …)
Outputs and display (tLogRow, tFileOutput, tMysqlOutput,…)
Logic and chaining of components
Step by step monitoring of execution traces of the Reject
management job
Simple transformations (filter, sort, replace, join)
Practical workshop: generation of content in a database, extraction and processing of data for display in console
Job versions and statuses
Metadata: properties and schemas.
Schema discovery for CSV files
Retrieving database schemas
Job updates from metadata
Practical workshop: inserting data sources and targets in the repository metadata
Use variables in a job (tSetGlobalVar)
Setting up a context local to the job
Context groups in the repository
Metadata as contexts
Practical workshop: use a variable in a job, contextualize the parameters of a metadata, create a test and production context
The ETL component par excellence
Managing inputs: main, secondary flows, naming of flows
Managing outputs: the integrated editor of tMap
Filtering data: input, output
Joint outputs, capture of output rejects, internal join
The expression editor
Performances
Practical workshop: multi-source, multi-destination extraction with a tMap, on a complete example
Parallel executions of sub-jobs
Talend triggers
Chaining of sub-jobs and components (OnSubjobOK, OnComponentOK)
Management of component errors, sub-job (OnComponentError, OnSubjobError)
Start and end component (tPreJob, tPostJob)
Catch and throw errors: tWarn, tDie, tLogCatcher
Practical workshop: assemble several jobs into one, factorize error management to generate a report file
Duration
5 days
Price
£ 2568
Audience
System administrators / BDD, Project managers, Developers
Prerequisites
Knowledge of Java
Reference
BUS305-F
Differentiate between main streams from iterate streams
Loops and programming via tLoop, tForeach
Switch from a main stream to an iterate stream, and vice versa (tFlowToIterate, tIterateToFlow) Job
synchronization with tWaitForFile, tWaitForSocket, tWaitForSQL
Lab: parallelize subjobs, synchronize jobs with external event
Use of advanced components: Java or regular expressions
The specificity of XML feeds in Talend
Code reuse: routines
Workshop: Using the tJavaRow, tJavaFlex components, using the embedded Java editor, deploying a custom routine
Reminders on debugging traces
Using the Java Debug mode: Debug perspective, step-by-step mode
Visualization and hot modification of variables
Workshop: Basic / advanced task debugging
Reminder on the contexts
Export of jobs for execution on a server
Passage of context between jobs, at runtime
Use of a file to load the context (tContextLoad)
Workshop: Deployment of tasks and dynamic loading of the context
Use of advanced components: Java or regular expressions
Creation of specific logs
Management of XML flows
Reuse of code
Workshop: Use of the tJavaRow, tJavaFlex components – creation of logs with tCatch, tDie … – use of the embedded Java editor
Essential concepts for extending Talend Open Studio
Creation of the component mockup with tJavaFlex / graphical interface
Discover the Talend forge and its community components
Integration into Talend Open Studio of the plugins created or downloaded
Workshop: Creation of a simple component via tJavaFlex, integration of components from the Talend forge
Sessions
Contact us for more informations about session date