Favicon of Apache Airflow

Apache Airflow

Apache Airflow is an open-source platform for programmatically authoring, scheduling, and monitoring data pipelines and batch workflows.

Screenshot of Apache Airflow website

Apache Airflow is a scalable, extensible workflow orchestration platform that empowers teams to programmatically author, schedule, and monitor complex data pipelines. Created by the open-source community, Airflow is battle-tested at organizations of every size.

Key Features:

  • Pure Python — Define workflows as code using standard Python, no XML or CLI hacks
  • Robust Web UI — Monitor, schedule, and manage workflows with a modern dashboard
  • Extensible Operators — Plug-and-play integrations with AWS, GCP, Azure, and hundreds more
  • Dynamic Pipelines — Generate pipeline DAGs dynamically with Python loops and parameters
  • Scalable Architecture — Modular message queue design scales to orchestrate unlimited workers

Whether you're a data engineer, ML practitioner, or scraping team lead, Airflow provides the foundation to automate and monitor any pipeline.

Share:

  • Stars

    44K
  • Forks

    16.3K
  • Last commit

    2 months ago
  • License

    Apache-2.0
  • Language

    Python
View Repository

Similar to Apache Airflow

Favicon

 

  
  
Favicon

 

  
  
Favicon