SlideShare une entreprise Scribd logo
1  sur  24
1© Cloudera, Inc. All rights reserved.
Cloudera’s Analytic Database:
BI & SQL Analytics in a Hybrid
Cloud World
Alex Gutow | Product Marketing
Greg Rahn | Product Management
2© Cloudera, Inc. All rights reserved.
What’s Driving Analytics to the Cloud?
Big data deployments in cloud
are accelerating:
● Increased Agility: End-user self-service
● Elasticity: Optimize infrastructure usage
● Lower Overall TCO
● Executive Mandate: Minimize on-prem
datacenter footprint
3© Cloudera, Inc. All rights reserved.
POLL
What best describes your cloud strategy today?
•Standardizing on single-cloud deployment (now/future)
•Multi-cloud deployment (now/future)
•Hybrid on-prem and cloud (now/future)
•Not sure
•No cloud plans yet
4© Cloudera, Inc. All rights reserved.
Key Applications
EDW
Optimization
Data Preparation
Self-Service BI &
Exploration
Use your EDW more
efficiently by offloading
workloads to Hadoop
Fast, flexible ETL over large
data volumes, so data is always
ready for your business
Fastest time-to-insights with a modern
analytic database designed with
Hadoop’s flexibility and agility
5© Cloudera, Inc. All rights reserved.
Key Benefits
An analytic database designed for Hadoop
High-Performance BI and SQL Analytics
Flexibility for Data and Use Case Variety
Cost-effective Scale for Today and Tomorrow
Go Beyond SQL with an Open Architecture
6© Cloudera, Inc. All rights reserved.
Anatomy of an Analytic Database
Cloudera Decoupled by Design
Query Engine
Storage Engine
Catalog
Query Engine
(Impala)
Catalog
(HMS)
Monolithic Analytic Database Modern Analytic Database
Storage
(Kudu)
Storage
(S3)
Storage
(HDFS)
7© Cloudera, Inc. All rights reserved.
Traditional Monolithic Analytic Databases
No Cloud Elasticity or Cloud
Storage Integration
Rigid Data Model with Tightly
Coupled Storage/Compute
Limited to SQL with Data
Movement Necessary
Static Sizing
∞
COMPUTE
STORE
8© Cloudera, Inc. All rights reserved.
Advantages of Our Approach
Cloud-Native & On-Premise
Go Beyond SQL
• Open Architecture: Open
formats and open storage
• Shared data across SQL and
non-SQL workloads
Data Flexibility
• Faster, more agile data
acquisition
• Data portability: Open
formats and open storage
Cost-Effective Scalability
• Elastic scale on-prem or in
the cloud
• Cloud-native pay-per-use
and transience
• Proven at big data scale
Hybrid
• Runs across multi-cloud &
on-prem
• Multi-storage over S3, HDFS,
Kudu, Isilon, DSSD, etcShared Data
9© Cloudera, Inc. All rights reserved.
Cost-Efficiencies & Flexibility in the Cloud
Primary Analytic Database Patterns
Only pay for what you need,
when you need it
▪ Transient clusters
▪ Object storage centric
▪ Cloud-native deployment
ETL
Reduce Operating Costs New Insights, New Revenue
BI/Analytics
Explore and analyze all data,
wherever it lives
▪ Long-running clusters
▪ Object storage or local storage
▪ Lift-and-shift deployment
10© Cloudera, Inc. All rights reserved.
Add Use Cases, Analytics,
and Data On-Demand
• Avoid the IT backlog with instant
access to all data
• On-demand clusters query directly
on shared object storage
Predictable Results
Whenever You Want
• Consistent query performance,
even during peak times
• Multi-tenancy via isolated clusters
on shared data
Just-in-Time Resources
• Real-time capacity for your needs,
as they change
• Elastically grow/shrink your cluster
via decoupled architecture
Contention-Free ETL
• ETL anytime without impacting
other workloads or risking SLAs
• Separate ETL clusters as-needed on
shared data
Benefits Across the Business
ETL and BI/Analytics in the cloud
11© Cloudera, Inc. All rights reserved.
Demo
Querying against cloud object stores
12© Cloudera, Inc. All rights reserved.
Deployment Patterns: ETL in the Cloud
Object Storage
Batch
Cluster
Transient Batch (most flexible)
Spin up clusters as needed
● On-demand/spot instances
● Usage-based pricing
● Sized for workload
● Cluster per tenant/user
Batch
Cluster
Batch
Cluster
Persistent Batch (most control)
Persistent cluster(s) for frequent ETL
● Reserved instances
● Node-based pricing
● Grow/shrink
● Cluster per tenant group
Persistent
Cluster
Batch
Persistent Batch on HDFS (fastest)
Top performance for frequent ETL
● Reserved instances
● Node-based pricing
● Grow/shrink
● Shared across tenant groups
Batch
Persistent
Cluster
Batch Batch
Persistent Cluster
HDFS
Batch Batch
13© Cloudera, Inc. All rights reserved.
Deployment Patterns: BI/Analytics in the Cloud
Three Architecture Options to Optimize Price/Performance
Object Storage
Transient
Cluster
Transient BI (infrequent usage)
Spin up clusters when needed
● On-demand instances
● Usage-based pricing
● Grow/shrink
● Cluster per tenant or user
Persistent BI (regular usage)
Persistent clusters for BI any time
● Reserved instances
● Node-based pricing
● Grow/shrink
● Cluster per tenant group
Persistent
Cluster
Persistent BI with Local Storage (fastest)
Max speed for more regular workloads
● Reserved instances
● Node-based pricing
● Less frequent grow/shrink
● Shared cluster for shared local data
Persistent Cluster HDFS and/or
Kudu
Persistent
Cluster
Transient
Cluster
Default Choice
14© Cloudera, Inc. All rights reserved.
Persistent BI on Object Storage
Best for elasticity (and speed vs transient)
● This is usually the best choice
● Best when workloads are:
o Flexible and changing
o Frequent during most working days
o Not scheduled for fixed hours
● Benefits include:
o Predictable results readily available
o Full multi-tenant isolation
o Common data in shared object storage
o Grow/shrink for TCO efficiency
● Tradeoffs:
o Per node performance of object storage (use
more, cheaper nodes)
Object Storage
Persistent BI (regular usage)
Persistent clusters for ready availability
● Reserved instances
● Node-based pricing
● Grow/shrink
● Cluster per tenant group
Persistent
Cluster
Persistent
Cluster
Default Choice
15© Cloudera, Inc. All rights reserved.
Persistent BI with Locally-Attached Storage
Best performance for consistent workloads
● Best when workloads are:
o Regular and consistent
o Consistently querying common data
o Tight SLAs for performance
o Fast changing data (that needs Kudu)
o Running without object storage (eg. Azure, GCE)
● Benefits include:
o Faster performance per node on local data
o Ability to query object storage for rest of data
● Tradeoffs:
o Less elastic than object stored based clusters
o Less isolation for multi-tenant workloads using
same HDFS data
o Cost if there are off-peak hours
Object Storage
Persistent BI with HDFS (fastest)
Max speed for more regular workloads
● Reserved instances
● Node-based pricing
● Less frequent grow/shrink
● Shared cluster for shared HDFS data
Persistent
Cluster
HDFS and/or
Kudu
16© Cloudera, Inc. All rights reserved.
Transient BI on Object Storage
Best TCO for infrequent usage
Object Storage
Cloudera
Director
● Best when workloads are:
o Infrequent or scheduled
● Benefits include:
o Lowest TCO with clusters only when needed
o Full multi-tenant isolation
o Common data in shared object storage
● Tradeoffs:
o Delay to spin-up clusters when needed
o Capability of BI users to spin up clusters
o Per node performance of object storage (use
more, cheaper nodes)
Shared
HMS DB
Transient
Cluster
Transient BI (infrequent usage)
Spin up clusters when needed.
● On-demand instances
● Usage-based pricing
● Grow/shrink
● Cluster per tenant or user
Transient
Cluster
17© Cloudera, Inc. All rights reserved.
• Engine: Impala
• Data Formats:
• Use Parquet, Snappy, and 256 MB blocks for best I/O efficiency (especially on object storage)
• Avoid small files to avoid performance issues (especially with object storage)
• Metadata (Hive metastore):
• Use local RDBMS for clusters that use locally-stored HDFS/Kudu data
• Use local RDBMS for persisted clusters (or transient clusters with Kerberos or Sentry)
• Use a shared RDBMS for transient single-tenant clusters without Kerberos or Sentry
• Avoid sharing RDBMS for non-data sharing clusters
• Deployment and Administration:
• Use Cloudera Manager for persistent clusters
• Use Cloudera Director for transient clusters
• Deploy NLB for persistent clusters as usual (and only when needed for transience)
• Monitor workloads with Cloudera Manager
• Security:
• Transient BI clusters: use VPC with strict security groups
• Persistent BI clusters: (see next slide)
Basic Guidelines for BI in the cloud
18© Cloudera, Inc. All rights reserved.
Security Guidelines for BI in the cloud
Persistent clusters
• Identity:
• Tie the cluster to the corporate user directory (AD or LDAP) using Linux SSSD
• Authentication:
• Kerberos for internal CDH services
• AD (LDAP) for BI user/tools with Impala
• Authorization:
• Use Sentry for authorization per BI cluster (shared RDBMS with Sentry is not yet available)
• Encryption:
• With AWS, use SSE-S3 server-side encryption
• If you need HDFS-level encryption or keys outside AWS, then use Persisted Cluster with Local Storage using CM
NOTE: if all users for a cluster have access to all data in the cluster, skip steps above and just use VPC with
strict security groups
19© Cloudera, Inc. All rights reserved.
Real-World Use Cases
20© Cloudera, Inc. All rights reserved.
Helping Companies with a Global Value
Chain Save Millions
• 360° view of supply chain process in
seconds with data from suppliers,
manufacturing, equipment, field
service, IoT and repair
• Improved product quality by identifying
and addressing supply chain issues in
near-real time
• $15-$25 million savings annually for
Siemens clients
• Costs 90% less per TB than RDBMS; 75%
less per TB than Netezza
CUSTOMER 360
21© Cloudera, Inc. All rights reserved.
Providing a complete view of consumer
watching and buying habits
• Helps customers optimize their ad
spend for greater campaign ROI
• Improves processing performance as
data volumes double
• Boosts agility and flexibility and
reduces risk with hybrid and
multi-cloud strategy
CUSTOMER 360
22© Cloudera, Inc. All rights reserved.
Measure user interaction across the
ecosystem, help direct R&D and
development spend
• Virtuous cycle: Identify features that
facilitate sharing of content that drive
new customers
• Real-time streaming and batch data
from product logs, web analytics,
channel data and ERP
• Impala connects to third-party data
wrangling and BI tools for fast reporting
23© Cloudera, Inc. All rights reserved.
Next Steps
Learn about Operational and Data Engineering
Workloads in the Cloud
www.cloudera.com/about-cloudera/events/webinars/cloud-webinar-series.html
Get Started with Impala in
the Cloud
Check Out the Latest
Benchmarks
• www.cloudera.com/downloads
• www.cloudera.com/documentation/e
nterprise/latest/topics/impala_s3.html
http://blog.cloudera.com/blog/2016/09/apache-
impala-incubating-vs-amazon-redshift-s3-
integration-elasticity-agility-and-cost-
performance-benefits-on-aws/
24© Cloudera, Inc. All rights reserved.
Thank you

Contenu connexe

Tendances

Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester WebinarCloudera, Inc.
 
The Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnThe Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnCloudera, Inc.
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Cloudera, Inc.
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduCloudera, Inc.
 
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Cloudera, Inc.
 
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Cloudera, Inc.
 
Intuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchIntuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchCloudera, Inc.
 
Transforming Insurance Analytics with Big Data and Automated Machine Learning

Transforming Insurance Analytics with Big Data and Automated Machine Learning
Transforming Insurance Analytics with Big Data and Automated Machine Learning

Transforming Insurance Analytics with Big Data and Automated Machine Learning
Cloudera, Inc.
 
Part 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndPart 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndCloudera, Inc.
 
How to Lower TCO and Avoid Cloud Lock-in

How to Lower TCO and Avoid Cloud Lock-in
How to Lower TCO and Avoid Cloud Lock-in

How to Lower TCO and Avoid Cloud Lock-in
Cloudera, Inc.
 
Big data journey to the cloud rohit pujari 5.30.18
Big data journey to the cloud   rohit pujari 5.30.18Big data journey to the cloud   rohit pujari 5.30.18
Big data journey to the cloud rohit pujari 5.30.18Cloudera, Inc.
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Cloudera, Inc.
 
Part 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduPart 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduCloudera, Inc.
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudPart 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudCloudera, Inc.
 
Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Building a Data Hub that Empowers Customer Insight (Technical Workshop)Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Building a Data Hub that Empowers Customer Insight (Technical Workshop)Cloudera, Inc.
 
A Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsA Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsCloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Moving Beyond Lambda Architectures with Apache Kudu
Moving Beyond Lambda Architectures with Apache KuduMoving Beyond Lambda Architectures with Apache Kudu
Moving Beyond Lambda Architectures with Apache KuduCloudera, Inc.
 
Apache Impala (incubating) 2.5 Performance Update
Apache Impala (incubating) 2.5 Performance UpdateApache Impala (incubating) 2.5 Performance Update
Apache Impala (incubating) 2.5 Performance UpdateCloudera, Inc.
 
Customer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWSCustomer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWSCloudera, Inc.
 

Tendances (20)

Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester Webinar
 
The Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnThe Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in Churn
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
 
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
 
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr

 
Intuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchIntuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with Search
 
Transforming Insurance Analytics with Big Data and Automated Machine Learning

Transforming Insurance Analytics with Big Data and Automated Machine Learning
Transforming Insurance Analytics with Big Data and Automated Machine Learning

Transforming Insurance Analytics with Big Data and Automated Machine Learning

 
Part 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndPart 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to End
 
How to Lower TCO and Avoid Cloud Lock-in

How to Lower TCO and Avoid Cloud Lock-in
How to Lower TCO and Avoid Cloud Lock-in

How to Lower TCO and Avoid Cloud Lock-in

 
Big data journey to the cloud rohit pujari 5.30.18
Big data journey to the cloud   rohit pujari 5.30.18Big data journey to the cloud   rohit pujari 5.30.18
Big data journey to the cloud rohit pujari 5.30.18
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera

 
Part 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduPart 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache Kudu
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudPart 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
 
Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Building a Data Hub that Empowers Customer Insight (Technical Workshop)Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Building a Data Hub that Empowers Customer Insight (Technical Workshop)
 
A Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsA Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber Threats
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Moving Beyond Lambda Architectures with Apache Kudu
Moving Beyond Lambda Architectures with Apache KuduMoving Beyond Lambda Architectures with Apache Kudu
Moving Beyond Lambda Architectures with Apache Kudu
 
Apache Impala (incubating) 2.5 Performance Update
Apache Impala (incubating) 2.5 Performance UpdateApache Impala (incubating) 2.5 Performance Update
Apache Impala (incubating) 2.5 Performance Update
 
Customer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWSCustomer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWS
 

Similaire à Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World

Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18Cloudera, Inc.
 
Cloudian Webinar - 7 Key Reasons why Object Storage lowers Storage TCO
Cloudian Webinar - 7 Key Reasons why Object Storage lowers Storage TCOCloudian Webinar - 7 Key Reasons why Object Storage lowers Storage TCO
Cloudian Webinar - 7 Key Reasons why Object Storage lowers Storage TCOStorage Switzerland
 
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterWebinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterStorage Switzerland
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAlluxio, Inc.
 
IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015Doug O'Flaherty
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartchCloudera, Inc.
 
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera, Inc.
 
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Doug O'Flaherty
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...Alluxio, Inc.
 
Presentation dell™ power vault™ md3
Presentation   dell™ power vault™ md3Presentation   dell™ power vault™ md3
Presentation dell™ power vault™ md3xKinAnx
 
Choosing the Right Data Storage Solution
Choosing the Right Data Storage SolutionChoosing the Right Data Storage Solution
Choosing the Right Data Storage SolutionAmazon Web Services
 
Five Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSFive Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSCloudera, Inc.
 
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Community
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAlluxio, Inc.
 
Getting more into GCP.pdf
Getting more into GCP.pdfGetting more into GCP.pdf
Getting more into GCP.pdfKnoldus Inc.
 
Big Data on Cloud Native Platform
Big Data on Cloud Native PlatformBig Data on Cloud Native Platform
Big Data on Cloud Native PlatformSunil Govindan
 
Big Data on Cloud Native Platform
Big Data on Cloud Native PlatformBig Data on Cloud Native Platform
Big Data on Cloud Native PlatformSunil Govindan
 
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for TomorrowOpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for TomorrowEd Balduf
 

Similaire à Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World (20)

Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18
 
Cloudian Webinar - 7 Key Reasons why Object Storage lowers Storage TCO
Cloudian Webinar - 7 Key Reasons why Object Storage lowers Storage TCOCloudian Webinar - 7 Key Reasons why Object Storage lowers Storage TCO
Cloudian Webinar - 7 Key Reasons why Object Storage lowers Storage TCO
 
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterWebinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 
IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartch
 
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
 
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...
 
Presentation dell™ power vault™ md3
Presentation   dell™ power vault™ md3Presentation   dell™ power vault™ md3
Presentation dell™ power vault™ md3
 
Choosing the Right Data Storage Solution
Choosing the Right Data Storage SolutionChoosing the Right Data Storage Solution
Choosing the Right Data Storage Solution
 
Five Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSFive Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWS
 
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 
Getting more into GCP.pdf
Getting more into GCP.pdfGetting more into GCP.pdf
Getting more into GCP.pdf
 
Big Data on Cloud Native Platform
Big Data on Cloud Native PlatformBig Data on Cloud Native Platform
Big Data on Cloud Native Platform
 
Big Data on Cloud Native Platform
Big Data on Cloud Native PlatformBig Data on Cloud Native Platform
Big Data on Cloud Native Platform
 
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for TomorrowOpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
 

Plus de Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Cloudera, Inc.
 

Plus de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18
 

Dernier

Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...Akihiro Suda
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfYashikaSharma391629
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxAndreas Kunz
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 

Dernier (20)

Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 

Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World

  • 1. 1© Cloudera, Inc. All rights reserved. Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World Alex Gutow | Product Marketing Greg Rahn | Product Management
  • 2. 2© Cloudera, Inc. All rights reserved. What’s Driving Analytics to the Cloud? Big data deployments in cloud are accelerating: ● Increased Agility: End-user self-service ● Elasticity: Optimize infrastructure usage ● Lower Overall TCO ● Executive Mandate: Minimize on-prem datacenter footprint
  • 3. 3© Cloudera, Inc. All rights reserved. POLL What best describes your cloud strategy today? •Standardizing on single-cloud deployment (now/future) •Multi-cloud deployment (now/future) •Hybrid on-prem and cloud (now/future) •Not sure •No cloud plans yet
  • 4. 4© Cloudera, Inc. All rights reserved. Key Applications EDW Optimization Data Preparation Self-Service BI & Exploration Use your EDW more efficiently by offloading workloads to Hadoop Fast, flexible ETL over large data volumes, so data is always ready for your business Fastest time-to-insights with a modern analytic database designed with Hadoop’s flexibility and agility
  • 5. 5© Cloudera, Inc. All rights reserved. Key Benefits An analytic database designed for Hadoop High-Performance BI and SQL Analytics Flexibility for Data and Use Case Variety Cost-effective Scale for Today and Tomorrow Go Beyond SQL with an Open Architecture
  • 6. 6© Cloudera, Inc. All rights reserved. Anatomy of an Analytic Database Cloudera Decoupled by Design Query Engine Storage Engine Catalog Query Engine (Impala) Catalog (HMS) Monolithic Analytic Database Modern Analytic Database Storage (Kudu) Storage (S3) Storage (HDFS)
  • 7. 7© Cloudera, Inc. All rights reserved. Traditional Monolithic Analytic Databases No Cloud Elasticity or Cloud Storage Integration Rigid Data Model with Tightly Coupled Storage/Compute Limited to SQL with Data Movement Necessary Static Sizing ∞ COMPUTE STORE
  • 8. 8© Cloudera, Inc. All rights reserved. Advantages of Our Approach Cloud-Native & On-Premise Go Beyond SQL • Open Architecture: Open formats and open storage • Shared data across SQL and non-SQL workloads Data Flexibility • Faster, more agile data acquisition • Data portability: Open formats and open storage Cost-Effective Scalability • Elastic scale on-prem or in the cloud • Cloud-native pay-per-use and transience • Proven at big data scale Hybrid • Runs across multi-cloud & on-prem • Multi-storage over S3, HDFS, Kudu, Isilon, DSSD, etcShared Data
  • 9. 9© Cloudera, Inc. All rights reserved. Cost-Efficiencies & Flexibility in the Cloud Primary Analytic Database Patterns Only pay for what you need, when you need it ▪ Transient clusters ▪ Object storage centric ▪ Cloud-native deployment ETL Reduce Operating Costs New Insights, New Revenue BI/Analytics Explore and analyze all data, wherever it lives ▪ Long-running clusters ▪ Object storage or local storage ▪ Lift-and-shift deployment
  • 10. 10© Cloudera, Inc. All rights reserved. Add Use Cases, Analytics, and Data On-Demand • Avoid the IT backlog with instant access to all data • On-demand clusters query directly on shared object storage Predictable Results Whenever You Want • Consistent query performance, even during peak times • Multi-tenancy via isolated clusters on shared data Just-in-Time Resources • Real-time capacity for your needs, as they change • Elastically grow/shrink your cluster via decoupled architecture Contention-Free ETL • ETL anytime without impacting other workloads or risking SLAs • Separate ETL clusters as-needed on shared data Benefits Across the Business ETL and BI/Analytics in the cloud
  • 11. 11© Cloudera, Inc. All rights reserved. Demo Querying against cloud object stores
  • 12. 12© Cloudera, Inc. All rights reserved. Deployment Patterns: ETL in the Cloud Object Storage Batch Cluster Transient Batch (most flexible) Spin up clusters as needed ● On-demand/spot instances ● Usage-based pricing ● Sized for workload ● Cluster per tenant/user Batch Cluster Batch Cluster Persistent Batch (most control) Persistent cluster(s) for frequent ETL ● Reserved instances ● Node-based pricing ● Grow/shrink ● Cluster per tenant group Persistent Cluster Batch Persistent Batch on HDFS (fastest) Top performance for frequent ETL ● Reserved instances ● Node-based pricing ● Grow/shrink ● Shared across tenant groups Batch Persistent Cluster Batch Batch Persistent Cluster HDFS Batch Batch
  • 13. 13© Cloudera, Inc. All rights reserved. Deployment Patterns: BI/Analytics in the Cloud Three Architecture Options to Optimize Price/Performance Object Storage Transient Cluster Transient BI (infrequent usage) Spin up clusters when needed ● On-demand instances ● Usage-based pricing ● Grow/shrink ● Cluster per tenant or user Persistent BI (regular usage) Persistent clusters for BI any time ● Reserved instances ● Node-based pricing ● Grow/shrink ● Cluster per tenant group Persistent Cluster Persistent BI with Local Storage (fastest) Max speed for more regular workloads ● Reserved instances ● Node-based pricing ● Less frequent grow/shrink ● Shared cluster for shared local data Persistent Cluster HDFS and/or Kudu Persistent Cluster Transient Cluster Default Choice
  • 14. 14© Cloudera, Inc. All rights reserved. Persistent BI on Object Storage Best for elasticity (and speed vs transient) ● This is usually the best choice ● Best when workloads are: o Flexible and changing o Frequent during most working days o Not scheduled for fixed hours ● Benefits include: o Predictable results readily available o Full multi-tenant isolation o Common data in shared object storage o Grow/shrink for TCO efficiency ● Tradeoffs: o Per node performance of object storage (use more, cheaper nodes) Object Storage Persistent BI (regular usage) Persistent clusters for ready availability ● Reserved instances ● Node-based pricing ● Grow/shrink ● Cluster per tenant group Persistent Cluster Persistent Cluster Default Choice
  • 15. 15© Cloudera, Inc. All rights reserved. Persistent BI with Locally-Attached Storage Best performance for consistent workloads ● Best when workloads are: o Regular and consistent o Consistently querying common data o Tight SLAs for performance o Fast changing data (that needs Kudu) o Running without object storage (eg. Azure, GCE) ● Benefits include: o Faster performance per node on local data o Ability to query object storage for rest of data ● Tradeoffs: o Less elastic than object stored based clusters o Less isolation for multi-tenant workloads using same HDFS data o Cost if there are off-peak hours Object Storage Persistent BI with HDFS (fastest) Max speed for more regular workloads ● Reserved instances ● Node-based pricing ● Less frequent grow/shrink ● Shared cluster for shared HDFS data Persistent Cluster HDFS and/or Kudu
  • 16. 16© Cloudera, Inc. All rights reserved. Transient BI on Object Storage Best TCO for infrequent usage Object Storage Cloudera Director ● Best when workloads are: o Infrequent or scheduled ● Benefits include: o Lowest TCO with clusters only when needed o Full multi-tenant isolation o Common data in shared object storage ● Tradeoffs: o Delay to spin-up clusters when needed o Capability of BI users to spin up clusters o Per node performance of object storage (use more, cheaper nodes) Shared HMS DB Transient Cluster Transient BI (infrequent usage) Spin up clusters when needed. ● On-demand instances ● Usage-based pricing ● Grow/shrink ● Cluster per tenant or user Transient Cluster
  • 17. 17© Cloudera, Inc. All rights reserved. • Engine: Impala • Data Formats: • Use Parquet, Snappy, and 256 MB blocks for best I/O efficiency (especially on object storage) • Avoid small files to avoid performance issues (especially with object storage) • Metadata (Hive metastore): • Use local RDBMS for clusters that use locally-stored HDFS/Kudu data • Use local RDBMS for persisted clusters (or transient clusters with Kerberos or Sentry) • Use a shared RDBMS for transient single-tenant clusters without Kerberos or Sentry • Avoid sharing RDBMS for non-data sharing clusters • Deployment and Administration: • Use Cloudera Manager for persistent clusters • Use Cloudera Director for transient clusters • Deploy NLB for persistent clusters as usual (and only when needed for transience) • Monitor workloads with Cloudera Manager • Security: • Transient BI clusters: use VPC with strict security groups • Persistent BI clusters: (see next slide) Basic Guidelines for BI in the cloud
  • 18. 18© Cloudera, Inc. All rights reserved. Security Guidelines for BI in the cloud Persistent clusters • Identity: • Tie the cluster to the corporate user directory (AD or LDAP) using Linux SSSD • Authentication: • Kerberos for internal CDH services • AD (LDAP) for BI user/tools with Impala • Authorization: • Use Sentry for authorization per BI cluster (shared RDBMS with Sentry is not yet available) • Encryption: • With AWS, use SSE-S3 server-side encryption • If you need HDFS-level encryption or keys outside AWS, then use Persisted Cluster with Local Storage using CM NOTE: if all users for a cluster have access to all data in the cluster, skip steps above and just use VPC with strict security groups
  • 19. 19© Cloudera, Inc. All rights reserved. Real-World Use Cases
  • 20. 20© Cloudera, Inc. All rights reserved. Helping Companies with a Global Value Chain Save Millions • 360° view of supply chain process in seconds with data from suppliers, manufacturing, equipment, field service, IoT and repair • Improved product quality by identifying and addressing supply chain issues in near-real time • $15-$25 million savings annually for Siemens clients • Costs 90% less per TB than RDBMS; 75% less per TB than Netezza CUSTOMER 360
  • 21. 21© Cloudera, Inc. All rights reserved. Providing a complete view of consumer watching and buying habits • Helps customers optimize their ad spend for greater campaign ROI • Improves processing performance as data volumes double • Boosts agility and flexibility and reduces risk with hybrid and multi-cloud strategy CUSTOMER 360
  • 22. 22© Cloudera, Inc. All rights reserved. Measure user interaction across the ecosystem, help direct R&D and development spend • Virtuous cycle: Identify features that facilitate sharing of content that drive new customers • Real-time streaming and batch data from product logs, web analytics, channel data and ERP • Impala connects to third-party data wrangling and BI tools for fast reporting
  • 23. 23© Cloudera, Inc. All rights reserved. Next Steps Learn about Operational and Data Engineering Workloads in the Cloud www.cloudera.com/about-cloudera/events/webinars/cloud-webinar-series.html Get Started with Impala in the Cloud Check Out the Latest Benchmarks • www.cloudera.com/downloads • www.cloudera.com/documentation/e nterprise/latest/topics/impala_s3.html http://blog.cloudera.com/blog/2016/09/apache- impala-incubating-vs-amazon-redshift-s3- integration-elasticity-agility-and-cost- performance-benefits-on-aws/
  • 24. 24© Cloudera, Inc. All rights reserved. Thank you