Loading Events

« All Events

  • This event has passed.

Meetup: Using Apache Airflow to Create Dynamic, Extensible Data Workflows on GCP

May 8, 2018 @ 6:30 pm - 9:00 pm

SoulCycle Data Transformation

Using Apache Airflow to create dynamic, extensible, elegant, scalable, data workflows on Google Cloud at SoulCycle

Join SoulCycle, Caserta and more than 150 fellow data nerds for pizza, drinks, mingling and presentations. Data architects, data engineers and data scientists alike will enjoy the contents of the talks.

Caserta Consultants will share, in technical detail, how to define and configure Airflow pipelines to create workflows that are maintainable, versionable, testable, and collaborative.

In this meetup you will learn best best practices to create Airflow directed acyclic graphs (DAGs) of tasks, enforcing relationships and dependencies. The talk covers constructing Airflow pipelines with Spark based ETL, a Google Storage Data Lake, and a Big Query Data Warehouse.

The stage will be shared with Dallas S. Simpson, Director of Data Engineering at SoulCycle Inc. Dallas will co-present with Caserta to share his experience and lessons learned using Airflow on their internal GCP Data Analytics Platform project.

>> Register for the meetup.

>> Can’t make it? Register for our live webinar version on May 10th at 2pm ET.

Featured Speakers:

joe caserta

Joe Caserta,

Caserta, Founder & President

 

 

 

dallas s simpson

 

Dallas S. Simpson

SoulCycle, Director, Data Engineering

 

 

Dovy Paukstys

Dovy Paukstys

Caserta, Solutions Architect

 

Details

Date:
May 8, 2018
Time:
6:30 pm - 9:00 pm
Event Categories:
,

Details

Date:
May 8, 2018
Time:
6:30 pm - 9:00 pm
Event Categories:
,