Delta Lake Tutorials
Try out the latest tutorials for the open-source Delta Lake project.
Join us for a three-part learning series: Diving Into Delta Lake. This series of tech talks takes you through the internals of Delta Lake, a popular open-source technology enabling ACID transactions, time travel, schema enforcement, and more on top of your data lakes. The Delta Lake engineering team, including Burak Yavuz, Andrea Neumann, Tathagata “TD”.
The rise of the Lakehouse architectural pattern is built upon tech innovations enabling the data lake to support ACID transactions and other features of traditional data warehouse workloads. Join us for a five-part learning series on Getting Started with Delta Lake. This series of tech talks takes you through the technology foundation of Delta Lake.
Delta Lake’s transaction log brings high reliability, performance, and ACID compliant transactions to data lakes. But exactly how does it accomplish this? Working through concrete examples, we will take a close look at how the transaction logs are managed and leveraged by Delta to supercharge data lakes.
We will demonstrate on Apache Spark™ 2.4.3 how to use Python and the Python APIs in Delta Lake 0.4.0 within the context of an on-time flight performance scenario. We will show how to upsert and delete data, query old versions of data with time travel and vacuum older versions for cleanup.
This tutorial goes through many features of Delta Lake features including schema enforcement and schema evolution, interoperability between batch and streaming workloads, time travel, and DML commands like Delete and Merge. It was originally given at Spark Summit 2019 Europe and is available in both Scala and Python.
This is the notebook primer for the Delta Lake workshop featuring Delta Lake and MLflow. It is also used extensively for the Delta Lake Hands-on Labs.
Take a walk through the daily struggles of a data engineer in this presentation as we cover what is truly needed to create robust end to end Big Data solutions.
This guide helps you quickly explore the main features of Delta Lake. It provides code snippets that show how to read from and write to Delta tables from interactive, batch, and streaming queries.