Databricks Open Sources Project Aimed at Data Lake Reliability

Databricks Open Sources Project Aimed at Data Lake Reliability

5 years ago
Anonymous $9jpehmcKty

https://adtmag.com/articles/2019/04/24/data-lake-open-source-tool.aspx

San Francisco, Calif.-based Databricks, creators of Apache Spark, today announced the release of Delta Lake, an open source solution designed to provide "reliability for both batch and streaming data" for data lakes.

Data lakes are large repositories of storage, often used by enterprises, that store the data in its "raw" or "natural" format in a flat structure -- unlike data warehouses, which are generally hierarchical and store data using folders or files -- with each item tagged with a unique identifier and metadata. The data can then be pulled by a variety of uses, whether data mining applications, machine learning, analytics or something else.

Last Seen
12 minutes ago
Reputation
0
Spam
0.000
Last Seen
3 hours ago
Reputation
0
Spam
0.000
Last Seen
36 minutes ago
Reputation
0
Spam
0.000
Last Seen
a minute ago
Reputation
0
Spam
0.000