Description
This course will teach you how to effectively use the Talend Open Studio for Data Quality tool to evaluate the level of data quality in an information system. You will implement analyses, verify business rules and define correction strategies for erroneous data.
Who is this training for ?
For whom ?Business analysts, data integrators, data managers.
Prerequisites
Training objectives
Training program
- The problem of data quality
- Evaluation of the data quality of an information system.
- Fundamental criteria: completeness, precision and integrity of data.
- Product positioning Talend Open Studio for Data Quality in the Talend suite.
- Practical work Product installation, configuration of preferences.
- The fundamental concepts of TOS for Data Quality
- Metadata: connections to databases, delimited files and Excel files.
- Presentation of the different types of analyses.
- Tools and indicators to help with carrying out analyses.
- The data explorer.
- Practical work Perform a first column analysis on data from a csv file, exploitation of the results obtained.
- Simple analyzes
- Duplicate search, respect of interval constraints, date format, email.
- Table metrics, functional dependencies between columns.
- Identification of value redundancies.
- Consistency checks between foreign and primary keys.
- Use indicators, models, rules and source files.
- Practical work Carry out an analysis of each type on a set of partially erroneous data.
- Advanced analytics
- Analysis of schema and table structure via the data explorer.
- Multi-table and multi-column analysis, compliance with business rules.
- Search and visualization of correlation between columns.
- Create your own indicators and source files.
- Manage analyses.
- Practical work Create a complex business rule involving several tables and associate it with a task.
- Publish the rule in the Talend forge.
- Advanced elements
- Use context variables.
- Create models based on regular expressions.
- Export/import analyzes and analyzed data.
- Correct erroneous data with Talend Data Integration.
- Practical work Configure metadata and analyzes using context variables, export analyzed data to correct them in Talend Data Integration