Delta Lake Blogs
Unlock Delta Lakes for PyTorch Training with DeltaTorch
By Daniel Liden , Michael Shtelma
This post demonstrates how to create PyTorch DataLoaders using Delta tables as data sources for training deep learning models.
Introducing Delta Lake Table Features
By Nick Karpov
This introduces Delta Lake Table Features, a discrete feature-based compatibility scheme that replaces the traditional integer protocol versioning for Delta Lake tables and clients.
Delta Lake Change Data Feed (CDF)
By Nick Karpov , Matthew Powers
This blog shows how to enable and use the Delta Lake Change Data Feed.
Delta Lake’s transaction log protocol and its implementations
This blog explains the Delta Lake transaction log protocol and its various implementation.
Delta Lake Deletion Vectors
By Nick Karpov
This blog introduces the new Deletion Vectors table feature for Delta Lake tables, and explains how Deletion Vectors speed up operations that modify existing data in your lakehouse.
Using Ibis with PySpark on Delta Lake tables
By Marlene Mhangami , Matthew Powers
This post explains how to use Ibis to query Delta tables with PySpark
Delta Lake Z Order
This post explains how to use Delta Lake Z Order to make your queries run faster
Delta Lake 2.3.0 Released
By Allison Portis , Matthew Powers
This post explains some of the key features in the Delta Lake 2.3.0 release
Open source self-hosted Delta Sharing server
By Shingo Okawa
This post explains Kotosiro Delta Sharing server basic instructions
How Delta Lake uses metadata to make certain aggregations much faster
By Matthew Powers , Scott Sandre
This post explains Delta Lake performance optimizations that make some aggregations execute quicker
How to use Delta Lake generated columns
How to create Delta Lake tables with generated columns and the benefits of this feature
Introducing Support for Delta Lake Tables in AWS Lambda
By Nick Karpov
How to use deltalake in AWS Lambda with AWS SDK for pandas