Azure Data Lake includes all the capabilities required to make it easy for developers, data scientists and analysts to store data of any size, shape and speed, and do all types of processing and analytics across platforms and languages. It removes the complexities of ingesting and storing all of your data while making it faster to get up and running with batch, streaming and interactive analytics. Azure Data Lake works with existing IT investments for identity, management and security for simplified data management and governance. It also integrates seamlessly with operational stores and data warehouses so you can extend current data applications. We have drawn on the experience of working with enterprise customers and running some of the largest scale processing and analytics in the world for Microsoft businesses like Office 365, Xbox Live, Azure, Windows, Bing and Skype. Azure Data Lake solves many of the productivity and scalability challenges that prevent you from maximizing the value of your data assets with a service that is ready to meet your current and future business needs.
Currently we have two generations of Data Lake Storage
- Data Lake Storage Gen1
- Data Lake Storage Gen2(Preview)
Data Lake Storage Gen1
Azure Data Lake Store has been renamed to Azure Data Lake Storage Gen1 Since there is an updated Version released as Azure Data Lake Storage Gen2. Azure Data Lake Storage Gen1 is an enterprise-wide hyper-scale repository for big data analytic workloads. Azure Data Lake enables you to capture data of any size, type, and ingestion speed in one single place for operational and exploratory analytics.
Data Lake Storage Gen1 can be accessed from Hadoop (available with HDInsight cluster) using the WebHDFS-compatible REST APIs. It is specifically designed to enable analytics on the stored data and is tuned for performance for data analytics scenarios. Out of the box, it includes all the enterprise-grade capabilities—security, manageability, scalability, reliability, and availability—essential for real-world enterprise use cases.
Data Lake Storage Gen2(Preview)
Azure Data Lake Storage Gen2 Preview is a set of capabilities dedicated to big data analytics, built into Azure Blob storage. It allows you to interface with your data using both file system and object storage paradigms. The addition of Data Lake Storage Gen2 makes Azure Storage the only cloud-based multi-modal platform, allowing you to extract analytics value from all of your data.
Data Lake Storage Gen2 brings all the qualities that are required for the full lifecycle of analytics data to Azure Storage. It is the result of converging the capabilities of our two existing storage services, Azure Blob Storage and Azure Data Lake Storage Gen1. Features from Azure Data Lake Storage Gen1, such as file system semantics, file-level security and scale are combined with low-cost, tiered storage, high availability/disaster recovery capabilities from Azure Blob storage.
Steps to Create DataLake Storage in Portal
Select Datalake Service in the Azure Portal

Give the Name to the Datalake Storage

Newly created Datalake will be listed in the Datalake Pool

Select the Data explorer to see the files listed in the datalake

Select New Folder to the place the files in the folder System. IN Blogs the Container system is followed whereas in Datalake it would be File system.

We can add Access to the Datalake and they can be given utmost permission

We can give Access to User in Azure AD Directory or guest users also.

Access can be limited to Folder, Files or Sub folders


Folders and files has been uploaded as we do in normal file system using upload tab.

Leave a comment