Delta Lake Blogs
Delta Lake’s transaction log protocol and its implementations
This blog explains the Delta Lake transaction log protocol and its various implementation.
Delta Lake Deletion Vectors
By Nick Karpov
This blog introduces the new Deletion Vectors table feature for Delta Lake tables, and explains how Deletion Vectors speed up operations that modify existing data in your lakehouse.
Using Ibis with PySpark on Delta Lake tables
By Marlene Mhangami , Matthew Powers
This post explains how to use Ibis to query Delta tables with PySpark
Delta Lake Z Order
This post explains how to use Delta Lake Z Order to make your queries run faster
Delta Lake 2.3.0 Released
By Allison Portis , Matthew Powers
This post explains some of the key features in the Delta Lake 2.3.0 release
Open source self-hosted Delta Sharing server
By Shingo Okawa
This post explains Kotosiro Delta Sharing server basic instructions
How Delta Lake uses metadata to make certain aggregations much faster
By Matthew Powers , Scott Sandre
This post explains Delta Lake performance optimizations that make some aggregations execute quicker
How to use Delta Lake generated columns
How to create Delta Lake tables with generated columns and the benefits of this feature
Introducing Support for Delta Lake Tables in AWS Lambda
By Nick Karpov
How to use deltalake in AWS Lambda with AWS SDK for pandas
How to create and append to Delta Lake tables with pandas
This post explains how to create and append to Delta Lake tables with pandas
Running ML Workflows with Delta Lake and Ray
By Jim Hibbard
This post explains how you can read Delta Lake with the Ray compute framework
How to Convert from CSV to Delta Lake
This post explains how to convert from a CSV data lake to Delta Lake, which offers much better features.