Delta Lake Sessions at Spark+AI Summit North America 2020
by Denny Lee, June 22, 2020
We're really excited for the numerous Delta Lake training and conference sessions that will be showcased throughout Spark+AI Summit NA 2020.
Training Sessions
Let's start off with the Delta Lake Training sessions at SAISNA20:
- Delta from a Data Engineer's Perspective by Palla Lentz and Jake Therianos (Databricks)
- Introduction to Delta Lake by Joshua Cook
Wednesday, June 24th, 2020
The conference sessions start off with the Wednesday Morning Keynotes which includes a great call out of Delta Lake usage at Starbucks!
- 10:30am: Modernize Your Data Warehouse and Data Lake to Databricks Delta with Informatica by Rodrigo Sanchez Bredee (Informatica)
- 10:30am: Delta Lake OSS AMA with Wesley Hoffman, Jacek Laskowski, Vini Jaiswal, Chris Hoshino-Fish, Palla Lentz, Burak Yavuz, Tathagata Das, and Denny Lee
- 11:30am: Building a Better Delta Lake with Talend and Databricks by Michael Destein and Cameron Davie (Talend)
- 12:10pm: Powering Interactive BI Analytics with Presto and Delta Lake by Kamil Bajda-Pawlikowski (Starburst)
- 2:30pm: Building Data Quality Audit Framework using Delta Lake at Cerner by Madhav Agni (Cerner)
- 3:05pm: Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with Delta Lake by Lara Minor (Columbia) and Bilal Obeidat (Databricks)
- 3:05pm: Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake by Jun Song (Alibaba)
- 3:05pm: Operationalizing Big Data Pipelines At Scale by Brad May and Arjit Dhavale (Starbucks)
- 3:40pm: A Thorough Comparison of Delta Lake, Iceberg and Hudi by @Junjie Chen and Junping Du (Tencent)
- 4:00pm: Delta Lake 0.7.0 Early Preview VIP AMA: with Burak Yavuz, Tathagata Das, and Denny Lee (Databricks)
Thursday, June 25th, 2020
- 10:30am: Delta Lake VIP AMA: with Joe Widen, Franco Patano, Andreas Neumann, Palla Lentz, Chris Hoshino-Fish, and Denny Lee
- 11:35am: Parallelization of Structured Streaming Jobs Using Delta Lake by Oliver Lewis (Tubi)
- 12:10pm: Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL Pipeline by George Claireaux and Kirby Prowting (Mars)
- 3:40pm: Automatic Forecasting using Prophet, Databricks, Delta Lake and MLflow by Perry Stephenson (Atlassian)
Friday, June 26th, 2020
- 10:35: Best Practices for Building Robust Data Platform with Apache Spark and Delta by Vini Jaiswal (Databricks)
- 10:35: Machine Learning Data Lineage with MLflow and Delta Lake by Richard Zang and Denny Lee (Databricks)
- 10:35: Patterns and Operational Insights from the First Users of Delta Lake by Dominique Brezinski (Apple)
- 12:20: Real-Time Forecasting at Scale using Delta Lake and Delta Caching by Rashmina Menon and Jatinder Assi (GumGum)
- 12:20: Simplifying Disaster Recovery with Delta Lake by Zeashan Pappa and Itai Weiss (Databricks)