Data Engineering Hub
GitHub Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode

Apache Airflow

![[Assets/apache-airflow-logo.png|100]]

Apache Airflow is an open-source platform used to programmatically develop, schedule, and orchestrate [[Batch Data Processing|batch workflows]]. It is most commonly used to schedule Python and SQL scripts but is flexible enough to schedule any other type of script you might use to build your data pipeline. There are also several companies that offer Airflow as a service if you don’t want to manage the infrastructure yourself.

Official Documentation

https://airflow.apache.org/docs/

Learning Resources

https://airflow.apache.org/ecosystem/#learning-resources