- Faster than traditional ETL tools
- Data is never thrown away
- Users have possibility to query and explore data
- More flexible than a traditional Data warehouse, as there is no demand to ingest only structured data
Then What is Azure Data Lake Storage Gen2?
ADLS Gen2 converges the capabilities of ADLS Gen1 with Azure Blob storage. So basically has a top up - provides file system semantics, file-level security available, and better scalability.
All these additional capabilites for ADLS Gen2 are built on Azure Blob storage - thereby supports low-cost data storage, tiered storage and with higher availability (Blog storage disaster recovery capabilities are inherited)
Lot more details on Data lake can be found in the links shared below.
Also it is important to understand that,
- Data Lake is usually the first stop in the data flow. So further processing of the raw data needs to be done utilizing big data technologies.
- The raw data dump into data lake comes with a responsibility to include governance and need to ensure quality of meta data. Data discovery and analytics capabilities should be developed in order to make proper use of data stored in data lake