Delta Lake Newsletter, 2020-03-20 Edition
March 3, 2020
For this edition of the Delta Lake Newsletter, find out more about the latest and upcoming tech talks and videos. For this edition, we will focus on the weekly tech talk series. Please share this newsletter with anyone who would like to know more about Delta Lake!
Diving into Delta Lake: Unpacking the Transaction Log
March 26th, 2020
The transaction log is key to understanding Delta Lake because it is the common thread that runs through many of its most important features, including ACID transactions, scalable metadata handling, time travel, and more. In this session, we’ll explore what the Delta Lake transaction log is, how it works at the file level, and how it offers an elegant solution to the problem of multiple concurrent reads and writes.
Diving into Delta Lake: Enforcing and Evolving the Schema
April 2nd, 2020
As business problems and requirements evolve over time, so to does the structure of your data. With Delta Lake, as the data changes, incorporating new dimensions is easy. Users have access to simple semantics to control the schema of their tables. These tools include schema enforcement, which prevents users from accidentally polluting their tables with mistakes or garbage data, as well as schema evolution, which enables them to automatically add new columns of rich data when those columns belong. In this webinar, we’ll dive into the use of these tools.
Diving into Delta Lake: DML Internals
April 16th, 2020
In the earlier Delta Lake Internals webinar series sessions, we described how the Delta Lake transaction log works. In this session, we will dive deeper into how commits, snapshot isolation, and partition and files change when performing deletes, updates, merges, and structured streaming.
Previous EventsMachine Learning Lessons Learned from the Field: Interview with Brooke Wenig
March 18th, 2020
We had a great interview with Brooke Wenig, Machine Learning Practice Lead, on the best practices and patterns when developing, training, and deploying Machine Learning algorithms in production.
We also had updates to previous tech talks with new live demos and Q&A sessions all available on-demand now.
- [February 27, 2020] Getting Data Ready for Data Science with Delta Lake and MLflow
- [March 5, 2020] Beyond Lambda: Introducing Delta Architecture
- [March 12, 2020] Simplify and Scale Data Engineering Pipelines with Delta Lake
The Genesis of Delta Lake - An Interview with Burak Yavuz
February 19th, 2020
We interviewed Burak Yavuz to learn about the Delta Lake team's decision making process and why they designed, architected, and implemented the architecture that it is today. Understand technical challenges that the team faced, how those challenges were solved, and learn about the plans for the future.
In light of the times, instead of our usual Delta Lake Thursday Tech Talk sessions, we decided to do a session around the analysis of COVID-19 datasets with Vini Jaiswal, Chengyin Eng, Dhruv Kumar, and Denny Lee. For more information on the session, please refer to the background and the link below to watch the session on-demand.
Analyzing COVID-19: Can the Data Community Help?
March 19th, 2020
If you have any questions or feedback, please do not hesitate to provide feedback on the #deltalake-oss Slack channel. Join the Delta Lake Channel (Register | Login) and join the Delta Users Email Distribution List today!
Thanks!Denny Lee, Developer Advocate