Warehouse of the Future Lakes Vs Marts Simon Whiteley|
41 Slides2.31 MB
Warehouse of the Future Lakes Vs Marts Simon Whiteley Adatis 06/04/2016
Why Cloud? Azure Data Warehouse Azure Data Lakes The New BI Architecture Q&A
Management Responsibility On Prem IaaS PaaS You Microsoft SaaS
Traditional Warehouse Architecture SSIS SSIS Stage SSIS Clean Warehous e
ETL
? Azur e On Prem AzCopy SSIS Feature Pack for Azure SSIS C# Task Azure Storage Explorer More!
Why Cloud? Azure Data Warehouse
Data Warehouse Massively Parallel Processing Clustered ColumnStores Distribution & Partitioning
PDW/APS Scaling Control ¼ Rack 8 Readers 8 Writers 8 Distributions
PDW/APS Scaling Control ¼ Rack 16 Readers 16 Writers 16 Distributions ¼ Rack
PDW/APS Scaling Control ¼ Rack 32 Readers ¼ Rack 32 Writers 32 Distributions ¼ Rack ¼ Rack
60 Distribution Nodes 600,000,000 Records
60 Distribution Nodes
Distribute by client – select * where clientId 321 Query Time Node 33 0 2 4 6 8 10 12 14 16
Query Time Node 58 Distribute by other – select * where clientId 321 Node 55 Node 52 Node 49 Node 46 Node 43 Node 40 Node 37 Node 34 Node 31 Node 28 Node 25 Node 22 Node 19 Node 16 Node 13 Node 10 Node 7 Node 4 Node 1 0 0.5 1 1.5 2 2.5 Query Time 3 3.5 4 4.5
100 DWUs Control Compute 8 Readers, 60 Writers 8 Readers
200 DWUs Control Compute 8 Readers Compute 8 Readers 16 Readers, 60 Writers
400 DWUs Control Compute 8 Readers Compute 8 Readers Compute 8 Readers Compute 8 Readers 32 Readers, 60 Writers
SSIS ADF BCP File, general Control Compute Compute Compute Compute
SSIS Control File, general Compute Compute Compute File, general File, general Compute
Control Compute File, general Compute Compute File, general Compute File, general PolyBas e File, general Source Files
Demo
Azure Data Warehouse External Table File, general Transform Table Warehouse Table ELT
Why Cloud? Azure Data Warehouse Azure Data Lakes
Data Lake Store Web HDFS Any File Type Any Size
Data Lake Analytics PAYG Scalable Processing New, Simple Querying Language
U-SQL C# meets SQL Natively Parallel VS Data Lake Tools
Demo
U-SQL File, general File, general File, general File, general File, general File, general File, general Data Lake Store File, general External Table TEL Warehouse Table Azure Data Warehouse
Why Cloud? Azure Data Warehouse Azure Data Lakes The New BI Architecture
Data Lake Store Scales as a system TEL ELT Scales for each job independently Data Warehouse
TELT
What’s Missing? Polybase over Data Lake Store Azure DW as U-SQL Destination Upload to Data Lake Store Component Azure DW Development Tools
Why Cloud? Azure Data Warehouse Azure Data Lakes The New BI Architecture Q&A
sqlsurrey.sqlpass.org @SQLSurrey
Thanks for Listening Simon Whiteley Principal BI Consultant @MrSiWhiteley http://blogs.adatis.co.uk