Description
Who should attend
Project administrators and ETL developers responsible for data extraction and transformation using DataStage.
Prerequisites
- Basic knowledge of Windows operating system
- Familiarity with database access techniques
Course Objectives
- Describe the uses of DataStage and the DataStage workflow
- Describe the Information Server architecture and how DataStage fits within it
- Describe the Information Server and DataStage deployment options
- Use the Information Server Web Console and the DataStage Administrator client to create DataStage users and to configure the DataStage environment
- Import and export DataStage objects to a file
- Import table definitions for sequential files and relational tables
- Design, compile, run, and monitor DataStage parallel jobs
- Design jobs that read and write to sequential files
- Describe the DataStage parallel processing architecture
- Design jobs that combine data using joins and lookups
- Design jobs that sort and aggregate data
- Implement complex business logic using the DataStage Transformer stage
- Debug DataStage jobs using the DataStage PX Debugger
Outline: IBM InfoSphere DataStage Essentials (v11.5) (KM204G)
1. Introduction to DataStage2. Deployment3. DataStage Administration4. Work with Metadata5. Create Parallel Jobs6. Access Sequential Data7. Partitioning and Collecting Algorithms8. Combine Data9. Group Processing Stages10. Transformer Stage11. Repository Functions12. Work with Relational Data13. Control Jobs