The Linux Foundation Projects
Delta Lake

Delta Lake Blogs

Thumbnail for Unlock Delta Lakes for PyTorch Training with DeltaTorch

Unlock Delta Lakes for PyTorch Training with DeltaTorch

By Daniel Liden , Michael Shtelma

This post demonstrates how to create PyTorch DataLoaders using Delta tables as data sources for training deep learning models.

Thumbnail for Introducing Delta Lake Table Features

Introducing Delta Lake Table Features

By Nick Karpov

This introduces Delta Lake Table Features, a discrete feature-based compatibility scheme that replaces the traditional integer protocol versioning for Delta Lake tables and clients.

Thumbnail for Delta Lake Change Data Feed (CDF)

Delta Lake Change Data Feed (CDF)

By Nick Karpov , Matthew Powers

This blog shows how to enable and use the Delta Lake Change Data Feed.

Thumbnail for Delta Lake’s transaction log protocol and its implementations

Delta Lake’s transaction log protocol and its implementations

By Matthew Powers

This blog explains the Delta Lake transaction log protocol and its various implementation.

Thumbnail for Delta Lake Deletion Vectors

Delta Lake Deletion Vectors

By Nick Karpov

This blog introduces the new Deletion Vectors table feature for Delta Lake tables, and explains how Deletion Vectors speed up operations that modify existing data in your lakehouse.

Thumbnail for Using Ibis with PySpark on Delta Lake tables

Using Ibis with PySpark on Delta Lake tables

By Marlene Mhangami , Matthew Powers

This post explains how to use Ibis to query Delta tables with PySpark

Thumbnail for Delta Lake Z Order

Delta Lake Z Order

By Matthew Powers

This post explains how to use Delta Lake Z Order to make your queries run faster

Thumbnail for Delta Lake 2.3.0 Released

Delta Lake 2.3.0 Released

By Allison Portis , Matthew Powers

This post explains some of the key features in the Delta Lake 2.3.0 release

Thumbnail for Open source self-hosted Delta Sharing server

Open source self-hosted Delta Sharing server

By Shingo Okawa

This post explains Kotosiro Delta Sharing server basic instructions

Thumbnail for How Delta Lake uses metadata to make certain aggregations much faster

How Delta Lake uses metadata to make certain aggregations much faster

By Matthew Powers , Scott Sandre

This post explains Delta Lake performance optimizations that make some aggregations execute quicker

Thumbnail for How to use Delta Lake generated columns

How to use Delta Lake generated columns

By Matthew Powers

How to create Delta Lake tables with generated columns and the benefits of this feature

Thumbnail for Introducing Support for Delta Lake Tables in AWS Lambda

Introducing Support for Delta Lake Tables in AWS Lambda

By Nick Karpov

How to use deltalake in AWS Lambda with AWS SDK for pandas