Get in Touch

Course Outline

Advanced Transformation Building Blocks

  • Handling complex data types
  • Managing fields, metadata, and dynamic structures
  • Utilizing reusable transformation patterns

Parameters, Variables, and Job-Oriented Design

  • Runtime variables and scoping mechanisms
  • Parameterizing transformations
  • Structuring parent-child jobs

Database Integration and Lookup Strategies

  • Utilizing advanced lookup steps
  • Implementing caching strategies
  • Designing efficient joins

Working with Files, APIs, and External Systems

  • Processing JSON and XML formats
  • Invoking REST and SOAP services
  • Executing streaming and batch loads

Error Handling and Data Quality Techniques

  • Capturing and routing errors
  • Applying data validation patterns
  • Conducting auditing and logging

Performance Tuning Essentials

  • Optimizing step design
  • Addressing memory and threading considerations
  • Identifying and resolving bottlenecks

Introduction to Repository-Based Development

  • Using the Pentaho repository
  • Managing versions
  • Practices for team collaboration

Deployment and Migration Practices

  • Promoting jobs across different environments
  • Configuration management
  • Operational best practices

Summary and Next Steps

Requirements

  • A foundational understanding of ETL principles
  • Practical experience with Pentaho Data Integration
  • Basic knowledge of data warehousing concepts

Target Audience

  • ETL developers
  • Data engineers
  • Technical professionals looking to expand their PDI expertise
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories