Reliable Data Lakes at Scale

Delta Lake is an open-source storage layer that brings ACID
transactions to Apache Spark™ and big data workloads.


Key Features

ACID Transactions:
Scalable Metadata Handling:
Time Travel (data versioning):
Open Format:
Unified Batch and Streaming Source and Sink:
Schema Enforcement:
Schema Evolution:
Audit History:
Updates and Deletes:
100% Compatible with Apache Spark API:
Instead of parquet...
dataframe
   .write
   .format("parquet")
   .save("/data")
… simply say delta
dataframe
   .write
   .format("delta")
   .save("/data")


New Features Coming Soon

Expectations:

Users of Delta Lake

Hundreds of companies are processing exabytes of data per month with Delta Lake.

To add your organization here, email our user list at delta-users@googlegroups.com.

Join the Delta Lake Community

Communicate with fellow Delta Lake users and contributors, ask questions and share tips
Slack ChannelGoogle Group
© 2019
twitterstack-overflow