Overview of Azure Data Lake Store
8 Slides94.57 KB
Overview of Azure Data Lake Store
Fundamentals Reliable Automatically replicates your data Three copies within a single region Highly available Unlimited Storage Optimized for Analytics Unlimited account sizes Built for running large analytics systems that require massive throughput Individual file sizes from gigabytes to petabytes No limits to scale Optimized for parallel computation over petabytes of data Automatically optimizes for any
Secure your Data Access control Auditing Encryption POSIX-compliant Access Control Lists (ACLs) on Files and Folders * Audit logs for all operations Transparent serverside encryption * Audit logs that can be analyzed with ADL U-SQL Scripts Azure-managed (Azure Key Vault) and customer-managed keys* Integrated with Azure Active Directory * Features arriving by GA
HDFS for the Cloud Built from the ground up as a Hadoop file system Tools running HDI Cluster Types Hadoop Works Today Storm Works Today HBase Works Today Spark By GA Hadoop Distros Hortonwor ks* Cloudera* in HDI By GA Sqoop By GA Distcp Works Today Other Microsoft R Services Works Today (Revolution R) Works Today Apache Hadoop Version 2.8 and above * Features arriving by GA
ADL Store Scenarios Billing Optimized for Analytics Azure Blob Storage General purpose bulk storage Pay for amount stored and for I/O operations WebHDFS Implements WebHDFS No WebHDFS Authentication Azure Active Directory Access Keys POSIX-style ACLs Access Keys Transparent Server-side Encryption* Client-Side Encryption Authorization Data Encryption * Features arriving by GA
Ingress and Egress Services ADL SDKs Tools ADL REST endpoints Azure Data Factory ADL Copy Service Azure Import/Export Service Azure Stream Analytics* Apache Sqoop DistCp Azure Portal Azure PowerShell Azure X-Platform CLI .NET SDK Node.Js SDK Java SDK * Python SDK * Curl Any HTTP REST Client * Features arriving by GA
Integration with Azure Data Factory Sources Sinks Azure Blob Azure Table Azure Blob Azure SQL Database Azure SQL Data Warehouse Azure Table Azure DocumentDB Azure Data Lake Store Azure SQL Database SQL Server File system Azure SQL Data Warehouse Oracle database MySQL database Azure DocumentDB DB2 database Teradata database Azure Data Lake Store Sybase database PostgreSQL database SQL Server
http://aka.ms/ AzureDataLake