Session

How Does Anything Happen? Orchestration with Apache Airflow

Thursday,  Mar 25 | 12:00PM - 12:45PM US ET

Level: Intermediate

An orchestrator is the central component of a modern data platform and is responsible for executing complex data operations in an efficient and reliable manner. Airflow is the leading data orchestration stack and we'll discuss how to use it effectively in production, including development, testing, operations and data quality. In this casual talk, we’ll:
  • discuss orchestration theory and best practices
  • explore tools commonly used in data pipelines (including dbt, Papermill and Great expectations)
  • hear stories of things gone wrong (and right!)