Using Apache Airflow to create dynamic, extensible, elegant, scalable, data workflows on Google Cloud at SoulCycle
Join SoulCycle, Caserta and more than 150 fellow data nerds for pizza, drinks, mingling and presentations. Data architects, data engineers and data scientists alike will enjoy the contents of the talks.
Caserta Consultants will share, in technical detail, how to define and configure Airflow pipelines to create workflows that are maintainable, versionable, testable, and collaborative.
In this meetup you will learn best best practices to create Airflow directed acyclic graphs (DAGs) of tasks, enforcing relationships and dependencies. The talk covers constructing Airflow pipelines with Spark based ETL, a Google Storage Data Lake, and a Big Query Data Warehouse.
The stage will be shared with Dallas S. Simpson, Director of Data Engineering at SoulCycle Inc. Dallas will co-present with Caserta to share his experience and lessons learned using Airflow on their internal GCP Data Analytics Platform project.
>> Can’t make it? Register for our live webinar version on May 10th at 2pm ET.
Caserta, Founder & President
Dallas S. Simpson
SoulCycle, Director, Data Engineering
Caserta, Solutions Architect