SlideShare a Scribd company logo
1 of 30
1© Cloudera, Inc. All rights reserved.
Operational Database
in the Public Cloud
Ryan Lippert | Cloudera Product Marketing
2© Cloudera, Inc. All rights reserved.
3© Cloudera, Inc. All rights reserved.
What’s Driving Operations to the Cloud?
● Increased Agility: End-user self-service
● Elasticity: Optimize infrastructure usage
● Lower Overall TCO
● Executive Mandate: Minimize on-
premises datacenter footprint
Big data deployments in cloud are accelerating; why?
4© Cloudera, Inc. All rights reserved.
Overview
Cloudera’s Operational Database
5© Cloudera, Inc. All rights reserved.
Cloudera’s Operational Database
Build Data-Driven Applications to Deliver Real-Time Insights
Operational Database
Attributes
Fast
• Real time model serving w/ <15ms latency
• Limitless concurrency (>100M updates/sec)
Secure
• Native encryption
• Audit
Easy
• Stream ingest, processing, NoSQL, and real-time
analytics together
• Best-in-class management and cloud automation
6© Cloudera, Inc. All rights reserved.
Visualization
Processing/
Exploration
Storage
Unique Components
Cloudera’s Operational Database
Fast/random reads and writes
via a high-performance,
distributed NoSQL data store
HBase
Fast analytics on fast data
with a relational structure
Kudu
Integration with the leading BI
tools
BI Partners
Faceted, text-based search for
data exploration and
democratization
Cloudera
Search
Powerful and flexible
processing, streaming, and
SQL
Spark
Multi-Storage
Multi-Environment
Encryption, Key Trustee
Navigator
Storage & Governance
7© Cloudera, Inc. All rights reserved.
Operational
Database
Durable, low latency storage for web
applications, message stores, and
mission critical operational activities.
Web-Scale Data Depot
Identifying meaningful events
based on multiple data streams
and taking action.
Complex Event Processing
Use data and current/past
events to score and serve the
likelihood of subsequent
events.
Model Scoring/Serving
8© Cloudera, Inc. All rights reserved.
Web Scale Data Depot
Key Applications for Operational Database
Real-Time Data Access
Low-latency and high concurrency enable broad-based
access to real-time information, yielding informed
decisions
Enterprise Data Apps
Build company-wide, easy-to-use apps to enable
employees or customers to interact with pertinent data
IoT Data Ingestion and Collection
Ingest, process, and serve IoT data in real time to take
advantage of instrumentation investments
Web-Scale Data Management
Store information from broad sets of customer
interaction occurring online, in-app, or in-store
Ingests over 4 million
homes worth of
energy data, and
provides reports that
help customers save
millions
Serves real-time
market data for over
40M instruments;
ingests 2.5M
transactions/second,
serves 3.5M
messages/second
9© Cloudera, Inc. All rights reserved.
Complex Event Processing
Key Applications for Operational Database
Cybersecurity and
Advanced Persistent Threats (APT)
Protect data with data; by keeping full-fidelity records
of network activity, anomalies can be surfaced and
thwarted
Network Health Monitoring
Maintain performance within an enterprise network by
identifying and remedying problems in real-time
IoT Predictive Maintenance
Use IoT sensor data from an unlimited number of
sources to proactively predict and fix problems with
physical equipment
Reduced detection of
APT from hundreds of
days to minutes; now
scales to thousands of
endpoints vs.
hundreds
Remote diagnostics
IoT platform reduces
fleet maintenance on
180,000+ vehicles by
30-40%
10© Cloudera, Inc. All rights reserved.
Model Scoring & Serving
Key Applications for Operational Database
Cross-Sell/Up-Sell & Personalization
Leverage a long history of purchases among a broad
population to create personalized offers, in real-time,
that are likely to be actioned by shoppers
Fraud Prevention
Compare recent financial transactions/claims with a
company-wide history of nefarious transactions to
identify and prevent fraud in real-time
Customer Profitability
Quickly identify high-value customers via individual
characteristics that correlate with profitability; focus
acquisition and retention on these segments
Lower cart
abandonment; 3x
higher open email
rate; decreased
bounce rate by 20%;
time to update
indexes from a day to
15 min
Can now provide
customers with 300-
400% higher CTR, 10x
more return visits, and
longer sessions
11© Cloudera, Inc. All rights reserved.
Benefits of the Public Cloud
Cloudera’s Operational Database
12© Cloudera, Inc. All rights reserved.
What’s Driving Operations to the Cloud?
● Increased Agility: End-user self-service
● Elasticity: Optimize infrastructure usage
● Lower overall TCO
● Executive Mandate: Minimize on-
premises datacenter footprint
Big data deployments in cloud are accelerating; why?
13© Cloudera, Inc. All rights reserved.
Advantages of Our Approach
Cloud-Native & On-Premises
Go Beyond SQL
• Open Architecture: Open
formats and open storage
• Shared data across SQL and
non-SQL workloads
Data Flexibility
• Faster, more agile data
acquisition
• Data portability: Open
formats and open storage
Cost-Effective Scalability
• Elastic scale on-prem or in
the cloud
• Cloud-native pay-per-use
and transience
• Proven at big data scale
Hybrid
• Runs across multi-cloud &
on-prem
• Multi-storage over S3, HDFS,
Kudu, Isilon, DSSD, etcShared Data
14© Cloudera, Inc. All rights reserved.
Operational Database in the Cloud
Public Cloud Benefits
Cost
Considerations
• Low-cost backup and
disaster recovery
• Development and
testing environments
easy to deploy and
decommission
Convenience
Considerations
• Elastic growth for
tightly provisioned
workloads makes
expansion easy, and
enables a lower-cost
steady state
• Fast and easy
provisioning of
additional clusters
helps projects move
quickly
15© Cloudera, Inc. All rights reserved.
Operational Database Cloud Architecture
Applications
Long lived
Prod Cluster
Operational
DB
Director
Provision
CM
Manage/Provision
Operational
DB
Temporary
Dev/Test Cluster
Burst Batch
Processing
Data copy for burst processing
or read/write temp clusters
1
Easy
Provisioning
2
Dev/Test
Provisioning
3
Burst processing of
large amounts of data
4
Low cost
backups
Data
Sources Spark
Streaming
S31
EBS2
AWS
Infrastructure
Benefits
1. S3, Azure Blob Storage, etc.
2. EBS, Azure Premium Storage, etc.
16© Cloudera, Inc. All rights reserved.
Easy Provisioning of New Clusters
Operational Database in the Public Cloud
Easy
Provisioning
Business Challenge
• On-premises installations can be slow to roll out, particularly for PoC
engagements with long procurement cycles
• Some organizations can take 3-6 months to execute this process,
slowing developers and creating anchors to legacy technology
Cloud-Enabled Solution
• Cloudera enables customers to go to the public cloud with industry-
leading software
• Workloads can be moved across clouds or to on premise clusters,
preventing cloud lock-in
Details
• Cloudera provides the ability to quickly provision a new cluster for
operational use cases in the public cloud
• Rapid provisioning without the permanent cost of internal
infrastructure helps make Cloudera a fast and easy choice for PoC’s
• Companies with Cloudera-trained employees have the ability to test
and prove/disprove new use cases quickly, delivering more value to
business
17© Cloudera, Inc. All rights reserved.
Easy Provisioning of New Clusters
Operational Database in the Public Cloud
Application
Operational
DB
Cloud Instances
Director
Provision
Cloud Storage
Application
Cloud Instances
Direct Attached Storage
Fast Cloud
Storage1
Operational
DB
Instance
Storage
Director
Provision
1. EBS, Azure Premium Storage, etc.
18© Cloudera, Inc. All rights reserved.
Spark Streaming and Operational DB in the cloud
For real-time processing and serving architectures
Availability Zone Applications
Ingest
Streaming
Data
Spark Streaming
running on a
dedicated
permanent cluster
Spark
Streaming
Operational
DB
Operational DB on a
dedicated permanent
cluster
Both clusters in the
same availability
zone
Deliver/
Serve
Data
19© Cloudera, Inc. All rights reserved.
Creating Dev and Test Environments
Operational Database in the Public Cloud
Development
and Testing
Environments
Business Challenge
• Development and testing environments are expensive to
maintain/configure, difficult to secure with real data, and have
different projects competing for a finite pool of resources
Cloud-Enabled Solution
• Cloudera in the public cloud enables development and testing
environments to be provisioned quickly, securely, and for the
required period of time
Details
• Public cloud offerings with Cloudera enable the ability to easily and
quickly replicate a production instance of data to a testing
environment
• Test environments can be configured with all the security of the
production, without the risk of overloading critical infrastructure
• Temporary instances mean environments are purpose-created,
time-bound, and less competition for test/dev resources
20© Cloudera, Inc. All rights reserved.
Creating Development and Testing Environments
Fast, Easy, and Secure Development & Testing in the Cloud
Production
Ready
Data
Application
Production
Instance Delivers
Data to Users
and Applications
Cloud
Object
Storage1
Fast Cloud
Storage2
Develop-
ment and
Testing
Production Environment Dev/Test Environment
Secure Dev/Test
Environment
1. S3, Azure Blob Storage, etc.
2. EBS, Azure Premium Storage, etc.
21© Cloudera, Inc. All rights reserved.
Burst Provisioning for ETL
Operational Database in the Public Cloud
Leverage Cloud
for ETL
Business Challenge
• ETL is a difficult process that consumes a large amount of resources
and can create bottlenecks depending on the nature of batch
processes or data spikes
Cloud-Enabled Solution
• Cloudera can leverage the elasticity of resources in the public cloud
to help businesses handle large ETL jobs, regardless of whether they
are anticipated or not
Details
• Unexpected surges in traffic, regular batch jobs that are growing in
size, and other high-volume data ingestion issues can create
bottlenecks that slow insight into the business; standard on-
premises ETL may have a difficult time recovering from the surge,
resulting in lost data
• Public cloud instances of Cloudera enable additional ETL resources
to be added temporarily, overcoming the deluge
22© Cloudera, Inc. All rights reserved.
Burst Provisioning for ETL
Keeping your Operational Database Real-Time in the Cloud
Operational
DBData
Surge
1
Data
Sources
Cloud
StorageData Pushed
to Cloud
Storage
2
Data to
Cloud
Instances
for Batch
Processing
3
AWS EBS
Instances
4
Transformed
Data Returned
to Cloud
Storage
5 Transformed Data
Sent to HBase
Application
6 Data Served to Application
23© Cloudera, Inc. All rights reserved.
Backup and Disaster Recovery
Operational Database in the Public Cloud
Backup and
Disaster Recovery
Business Challenge
• Businesses struggle to create backup of their critical data, including
challenges with geographical dispersion, maintenance costs,
frequency of backup, etc.
Cloud-Enabled Solution
• Public clouds offer the ability to take frequent snapshots of the data
within your CDH cluster
Details
• By snapshotting to a remote public cloud datacenter, companies can
take advantage of geographically dispersed data copies
• Cheap storage enables more frequent backups, enabling a more
recent copy of data to be recovered in case of problems
• Navigate issues associated with data sovereignty
24© Cloudera, Inc. All rights reserved.
Backup and Disaster Recovery
Reduce Risk by Backing-Up Operational Database Data in the Public Cloud
On-Premises
Instance
Cloud Object
Storage1
Cloud
Instance
Snapshot
of Data
1
Cloud Object
Storage1
Restore
2
Snapshot
of Data
1
Restore
2
1. S3, Azure Blob Storage, etc.
25© Cloudera, Inc. All rights reserved.
CDH HBase on EBS vs. EMR HBase on S3
EMR and S3 options for HBase
- Amazon can run HBase on S3 and get consistency via
a proprietary EMR-FS connector
- Cloudera leverages EBS for HBase cloud deployments
Customer aim: performance and price
- S3 saves on storage costs relative to EBS, but
increases compute costs as you need more EC2
instances to get to the same metrics
- So, for S3 storage costs go down, but compute costs
go up; few use cases can combine low cost on both
axises
Qualitative customer concerns
- From Cloudera: High availability, automated
configuration, manageability (monitoring/alerts),
support (from Cloudera’s HBase developers)
26© Cloudera, Inc. All rights reserved.
Instance Recommendations
Easy Provisioning of always on clusters:
Model vCPU Mem(GiB) Storage(GB)
d2.xlarge 4 30.5 3 x 2000 HDD
d2.2xlarge 8 61 6 x 2000 HDD
d2.4xlarge 16 122 12 x 2000 HDD
d2.8xlarge 36 244 24 x 2000 HDD
Data Nodes:
Master Nodes:
Model vCPU Mem(GiB) Storage(GB)
c3.8xlarge 32 60 2 x 320 SSD
Snapshot Backups: S3
Smaller versions are
recommended to stagger the
impact of full block reports and
garbage collection.
The master node memory
should be sized inline with the
cluster size, c3.xlarge supports
very large cluster sizes but
smaller master nodes are
possible.
27© Cloudera, Inc. All rights reserved.
Instance Recommendations
Transient clusters with permanent storage using Director:
Model vCPU Mem(GiB) Storage
C4.large 4 30.5 EBS (4000 Mbps dedicated)
Data Nodes:
Master Nodes:
Model vCPU Mem(GiB) Storage (GB)
c3.8xlarge 32 60 2 x 320 SSD
Storage, throughput workloads (ETL, etc.):
Volume Type Volume Size IOPS Throughput
st1 500 GiB – 16 TiB 500 800 MiB/s
Volume Type Volume Size IOPS Throughput
io1 4 GiB – 16 TiB 20,000 800 MiB/s
Storage, real time workloads (HBase, etc.):
These are default
recommendations; they will
vary based on the specifics of
each use case.
We recommend deploying
more than the lowest tier, SC1,
as the throttling limits are
reached quickly, which brings
down the database.
28© Cloudera, Inc. All rights reserved.
Instance Recommendations
Always on Spark Streaming cluster
• Spark Clusters have homogenous nodes i.e. no special “master” node
Model vCPU Mem(GiB) Storage
m4.2xlarge 8 32 EBS (1000 Mbps dedicated)
Default:
Model vCPU Mem(GiB) Storage
m3.2xlarge 8 61 160GB SSD
Very Memory Intensive Workloads:
Best balance of memory and
compute
Examples are workloads that
cache RDDs/Dataframes or
maintain in-memory state via the
updateStateByKey(…) function.
Model vCPU Mem(GiB) Storage
c4.2xlarge 8 15 EBS (1000 Mbps dedicated)
Very Compute Intensive Workloads:
Examples are workloads that may
perform compute intensive
machine learning operations to
score incoming events.
29© Cloudera, Inc. All rights reserved.
• Get Started with Cloudera in the Cloud:
• www.cloudera.com/downloads
• Learn more about Cloudera’s Operational DB:
• https://www.cloudera.com/solutions/operational-database.html
• Learn about Data Engineering Workloads in the Cloud:
• www.cloudera.com/about-cloudera/events/webinars/cloud-webinar-series.html
Next Steps
30© Cloudera, Inc. All rights reserved.
Thank You

More Related Content

What's hot

Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Cloudera, Inc.
 
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldPart 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldCloudera, Inc.
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester WebinarCloudera, Inc.
 
Data Engineering: Elastic, Low-Cost Data Processing in the Cloud
Data Engineering: Elastic, Low-Cost Data Processing in the CloudData Engineering: Elastic, Low-Cost Data Processing in the Cloud
Data Engineering: Elastic, Low-Cost Data Processing in the CloudCloudera, Inc.
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...Cloudera, Inc.
 
Hadoop Distributed File System (HDFS) Encryption with Cloudera Navigator Key ...
Hadoop Distributed File System (HDFS) Encryption with Cloudera Navigator Key ...Hadoop Distributed File System (HDFS) Encryption with Cloudera Navigator Key ...
Hadoop Distributed File System (HDFS) Encryption with Cloudera Navigator Key ...Cloudera, Inc.
 
Big data journey to the cloud rohit pujari 5.30.18
Big data journey to the cloud   rohit pujari 5.30.18Big data journey to the cloud   rohit pujari 5.30.18
Big data journey to the cloud rohit pujari 5.30.18Cloudera, Inc.
 
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac... Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...Cloudera, Inc.
 
A Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsA Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsCloudera, Inc.
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartchCloudera, Inc.
 
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformHow to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformCloudera, Inc.
 
Part 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchPart 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchCloudera, Inc.
 
Data Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseData Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseCloudera, Inc.
 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 
Cloudera, Inc.
 
The Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnThe Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnCloudera, Inc.
 
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)Cloudera, Inc.
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Cloudera, Inc.
 
Data Science and CDSW
Data Science and CDSWData Science and CDSW
Data Science and CDSWJason Hubbard
 
Five Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSFive Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSCloudera, Inc.
 

What's hot (20)

Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr

 
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldPart 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester Webinar
 
Data Engineering: Elastic, Low-Cost Data Processing in the Cloud
Data Engineering: Elastic, Low-Cost Data Processing in the CloudData Engineering: Elastic, Low-Cost Data Processing in the Cloud
Data Engineering: Elastic, Low-Cost Data Processing in the Cloud
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
 
Hadoop Distributed File System (HDFS) Encryption with Cloudera Navigator Key ...
Hadoop Distributed File System (HDFS) Encryption with Cloudera Navigator Key ...Hadoop Distributed File System (HDFS) Encryption with Cloudera Navigator Key ...
Hadoop Distributed File System (HDFS) Encryption with Cloudera Navigator Key ...
 
Big data journey to the cloud rohit pujari 5.30.18
Big data journey to the cloud   rohit pujari 5.30.18Big data journey to the cloud   rohit pujari 5.30.18
Big data journey to the cloud rohit pujari 5.30.18
 
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac... Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 
A Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsA Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber Threats
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartch
 
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformHow to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
 
Part 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchPart 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science Workbench
 
Data Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseData Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the Enterprise
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 

 
The Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnThe Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in Churn
 
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
 
Data Science and CDSW
Data Science and CDSWData Science and CDSW
Data Science and CDSW
 
Five Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSFive Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWS
 

Viewers also liked

Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Cloudera, Inc.
 
Enabling the Connected Car Revolution

Enabling the Connected Car Revolution
Enabling the Connected Car Revolution

Enabling the Connected Car Revolution
Cloudera, Inc.
 
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)Cloudera, Inc.
 
Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Building a Data Hub that Empowers Customer Insight (Technical Workshop)Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Building a Data Hub that Empowers Customer Insight (Technical Workshop)Cloudera, Inc.
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)Cloudera, Inc.
 
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...Cloudera, Inc.
 
Codemotion fuse presentation
Codemotion fuse presentationCodemotion fuse presentation
Codemotion fuse presentationUgo Landini
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekData Driven Innovation
 
Introduction to Spark: Data Analysis and Use Cases in Big Data
Introduction to Spark: Data Analysis and Use Cases in Big Data Introduction to Spark: Data Analysis and Use Cases in Big Data
Introduction to Spark: Data Analysis and Use Cases in Big Data Jongwook Woo
 

Viewers also liked (12)

Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1

 
Top 5 IoT Use Cases
Top 5 IoT Use CasesTop 5 IoT Use Cases
Top 5 IoT Use Cases
 
Enabling the Connected Car Revolution

Enabling the Connected Car Revolution
Enabling the Connected Car Revolution

Enabling the Connected Car Revolution

 
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
 
Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Building a Data Hub that Empowers Customer Insight (Technical Workshop)Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Building a Data Hub that Empowers Customer Insight (Technical Workshop)
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)
 
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
The role of Big Data and Modern Data Management in Driving a Customer 360 fro...
 
Codemotion fuse presentation
Codemotion fuse presentationCodemotion fuse presentation
Codemotion fuse presentation
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
 
The Impala Cookbook
The Impala CookbookThe Impala Cookbook
The Impala Cookbook
 
Introduction to Spark: Data Analysis and Use Cases in Big Data
Introduction to Spark: Data Analysis and Use Cases in Big Data Introduction to Spark: Data Analysis and Use Cases in Big Data
Introduction to Spark: Data Analysis and Use Cases in Big Data
 
UAV-based remote sensing as a monitoring tool for smallholder cropping systems
UAV-based remote sensing as a monitoring tool for smallholder cropping systemsUAV-based remote sensing as a monitoring tool for smallholder cropping systems
UAV-based remote sensing as a monitoring tool for smallholder cropping systems
 

Similar to Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud

Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Cloudera, Inc.
 
Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformCloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Denodo
 
A deep dive into running data analytic workloads in the cloud
A deep dive into running data analytic workloads in the cloudA deep dive into running data analytic workloads in the cloud
A deep dive into running data analytic workloads in the cloudCloudera, Inc.
 
Cloud computing(Basic).pptx
Cloud computing(Basic).pptxCloud computing(Basic).pptx
Cloud computing(Basic).pptxnischal52
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
C5 accelerating your journey to self-service it
C5   accelerating your journey to self-service itC5   accelerating your journey to self-service it
C5 accelerating your journey to self-service itDr. Wilfred Lin (Ph.D.)
 
5 Applications of Cloud Computing
5 Applications of Cloud Computing5 Applications of Cloud Computing
5 Applications of Cloud ComputingCentriLogic
 
Modernizing your organization's data protection approach, with Yamen Alahmad
Modernizing your organization's data protection approach, with Yamen AlahmadModernizing your organization's data protection approach, with Yamen Alahmad
Modernizing your organization's data protection approach, with Yamen AlahmadVeritas Technologies LLC
 
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | QuboleVasu S
 
Move your oracle apps to oci
Move your oracle apps to ociMove your oracle apps to oci
Move your oracle apps to ociVamsiKrishna815
 
Welcome to Cloud Computing
Welcome to Cloud ComputingWelcome to Cloud Computing
Welcome to Cloud Computingimogokate
 
The 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: ExposedThe 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: ExposedCloudera, Inc.
 
Oracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service OverviewOracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service OverviewJinyu Wang
 

Similar to Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud (20)

Introduction of microsoft azure
Introduction of microsoft azureIntroduction of microsoft azure
Introduction of microsoft azure
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Ms.azure in detail
Ms.azure in detailMs.azure in detail
Ms.azure in detail
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
 
Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data Platform
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
 
A deep dive into running data analytic workloads in the cloud
A deep dive into running data analytic workloads in the cloudA deep dive into running data analytic workloads in the cloud
A deep dive into running data analytic workloads in the cloud
 
Cloud computing(Basic).pptx
Cloud computing(Basic).pptxCloud computing(Basic).pptx
Cloud computing(Basic).pptx
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
IT Resilience Use Case
IT Resilience Use CaseIT Resilience Use Case
IT Resilience Use Case
 
C5 accelerating your journey to self-service it
C5   accelerating your journey to self-service itC5   accelerating your journey to self-service it
C5 accelerating your journey to self-service it
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
5 Applications of Cloud Computing
5 Applications of Cloud Computing5 Applications of Cloud Computing
5 Applications of Cloud Computing
 
Modernizing your organization's data protection approach, with Yamen Alahmad
Modernizing your organization's data protection approach, with Yamen AlahmadModernizing your organization's data protection approach, with Yamen Alahmad
Modernizing your organization's data protection approach, with Yamen Alahmad
 
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
 
Move your oracle apps to oci
Move your oracle apps to ociMove your oracle apps to oci
Move your oracle apps to oci
 
Welcome to Cloud Computing
Welcome to Cloud ComputingWelcome to Cloud Computing
Welcome to Cloud Computing
 
The 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: ExposedThe 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: Exposed
 
Oracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service OverviewOracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service Overview
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Cloudera, Inc.
 
Get started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionGet started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionCloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18
 
Get started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionGet started with Cloudera's cyber solution
Get started with Cloudera's cyber solution
 

Recently uploaded

Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
How To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROHow To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROmotivationalword821
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxAndreas Kunz
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 

Recently uploaded (20)

Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
How To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROHow To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTRO
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 

Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud

  • 1. 1© Cloudera, Inc. All rights reserved. Operational Database in the Public Cloud Ryan Lippert | Cloudera Product Marketing
  • 2. 2© Cloudera, Inc. All rights reserved.
  • 3. 3© Cloudera, Inc. All rights reserved. What’s Driving Operations to the Cloud? ● Increased Agility: End-user self-service ● Elasticity: Optimize infrastructure usage ● Lower Overall TCO ● Executive Mandate: Minimize on- premises datacenter footprint Big data deployments in cloud are accelerating; why?
  • 4. 4© Cloudera, Inc. All rights reserved. Overview Cloudera’s Operational Database
  • 5. 5© Cloudera, Inc. All rights reserved. Cloudera’s Operational Database Build Data-Driven Applications to Deliver Real-Time Insights Operational Database Attributes Fast • Real time model serving w/ <15ms latency • Limitless concurrency (>100M updates/sec) Secure • Native encryption • Audit Easy • Stream ingest, processing, NoSQL, and real-time analytics together • Best-in-class management and cloud automation
  • 6. 6© Cloudera, Inc. All rights reserved. Visualization Processing/ Exploration Storage Unique Components Cloudera’s Operational Database Fast/random reads and writes via a high-performance, distributed NoSQL data store HBase Fast analytics on fast data with a relational structure Kudu Integration with the leading BI tools BI Partners Faceted, text-based search for data exploration and democratization Cloudera Search Powerful and flexible processing, streaming, and SQL Spark Multi-Storage Multi-Environment Encryption, Key Trustee Navigator Storage & Governance
  • 7. 7© Cloudera, Inc. All rights reserved. Operational Database Durable, low latency storage for web applications, message stores, and mission critical operational activities. Web-Scale Data Depot Identifying meaningful events based on multiple data streams and taking action. Complex Event Processing Use data and current/past events to score and serve the likelihood of subsequent events. Model Scoring/Serving
  • 8. 8© Cloudera, Inc. All rights reserved. Web Scale Data Depot Key Applications for Operational Database Real-Time Data Access Low-latency and high concurrency enable broad-based access to real-time information, yielding informed decisions Enterprise Data Apps Build company-wide, easy-to-use apps to enable employees or customers to interact with pertinent data IoT Data Ingestion and Collection Ingest, process, and serve IoT data in real time to take advantage of instrumentation investments Web-Scale Data Management Store information from broad sets of customer interaction occurring online, in-app, or in-store Ingests over 4 million homes worth of energy data, and provides reports that help customers save millions Serves real-time market data for over 40M instruments; ingests 2.5M transactions/second, serves 3.5M messages/second
  • 9. 9© Cloudera, Inc. All rights reserved. Complex Event Processing Key Applications for Operational Database Cybersecurity and Advanced Persistent Threats (APT) Protect data with data; by keeping full-fidelity records of network activity, anomalies can be surfaced and thwarted Network Health Monitoring Maintain performance within an enterprise network by identifying and remedying problems in real-time IoT Predictive Maintenance Use IoT sensor data from an unlimited number of sources to proactively predict and fix problems with physical equipment Reduced detection of APT from hundreds of days to minutes; now scales to thousands of endpoints vs. hundreds Remote diagnostics IoT platform reduces fleet maintenance on 180,000+ vehicles by 30-40%
  • 10. 10© Cloudera, Inc. All rights reserved. Model Scoring & Serving Key Applications for Operational Database Cross-Sell/Up-Sell & Personalization Leverage a long history of purchases among a broad population to create personalized offers, in real-time, that are likely to be actioned by shoppers Fraud Prevention Compare recent financial transactions/claims with a company-wide history of nefarious transactions to identify and prevent fraud in real-time Customer Profitability Quickly identify high-value customers via individual characteristics that correlate with profitability; focus acquisition and retention on these segments Lower cart abandonment; 3x higher open email rate; decreased bounce rate by 20%; time to update indexes from a day to 15 min Can now provide customers with 300- 400% higher CTR, 10x more return visits, and longer sessions
  • 11. 11© Cloudera, Inc. All rights reserved. Benefits of the Public Cloud Cloudera’s Operational Database
  • 12. 12© Cloudera, Inc. All rights reserved. What’s Driving Operations to the Cloud? ● Increased Agility: End-user self-service ● Elasticity: Optimize infrastructure usage ● Lower overall TCO ● Executive Mandate: Minimize on- premises datacenter footprint Big data deployments in cloud are accelerating; why?
  • 13. 13© Cloudera, Inc. All rights reserved. Advantages of Our Approach Cloud-Native & On-Premises Go Beyond SQL • Open Architecture: Open formats and open storage • Shared data across SQL and non-SQL workloads Data Flexibility • Faster, more agile data acquisition • Data portability: Open formats and open storage Cost-Effective Scalability • Elastic scale on-prem or in the cloud • Cloud-native pay-per-use and transience • Proven at big data scale Hybrid • Runs across multi-cloud & on-prem • Multi-storage over S3, HDFS, Kudu, Isilon, DSSD, etcShared Data
  • 14. 14© Cloudera, Inc. All rights reserved. Operational Database in the Cloud Public Cloud Benefits Cost Considerations • Low-cost backup and disaster recovery • Development and testing environments easy to deploy and decommission Convenience Considerations • Elastic growth for tightly provisioned workloads makes expansion easy, and enables a lower-cost steady state • Fast and easy provisioning of additional clusters helps projects move quickly
  • 15. 15© Cloudera, Inc. All rights reserved. Operational Database Cloud Architecture Applications Long lived Prod Cluster Operational DB Director Provision CM Manage/Provision Operational DB Temporary Dev/Test Cluster Burst Batch Processing Data copy for burst processing or read/write temp clusters 1 Easy Provisioning 2 Dev/Test Provisioning 3 Burst processing of large amounts of data 4 Low cost backups Data Sources Spark Streaming S31 EBS2 AWS Infrastructure Benefits 1. S3, Azure Blob Storage, etc. 2. EBS, Azure Premium Storage, etc.
  • 16. 16© Cloudera, Inc. All rights reserved. Easy Provisioning of New Clusters Operational Database in the Public Cloud Easy Provisioning Business Challenge • On-premises installations can be slow to roll out, particularly for PoC engagements with long procurement cycles • Some organizations can take 3-6 months to execute this process, slowing developers and creating anchors to legacy technology Cloud-Enabled Solution • Cloudera enables customers to go to the public cloud with industry- leading software • Workloads can be moved across clouds or to on premise clusters, preventing cloud lock-in Details • Cloudera provides the ability to quickly provision a new cluster for operational use cases in the public cloud • Rapid provisioning without the permanent cost of internal infrastructure helps make Cloudera a fast and easy choice for PoC’s • Companies with Cloudera-trained employees have the ability to test and prove/disprove new use cases quickly, delivering more value to business
  • 17. 17© Cloudera, Inc. All rights reserved. Easy Provisioning of New Clusters Operational Database in the Public Cloud Application Operational DB Cloud Instances Director Provision Cloud Storage Application Cloud Instances Direct Attached Storage Fast Cloud Storage1 Operational DB Instance Storage Director Provision 1. EBS, Azure Premium Storage, etc.
  • 18. 18© Cloudera, Inc. All rights reserved. Spark Streaming and Operational DB in the cloud For real-time processing and serving architectures Availability Zone Applications Ingest Streaming Data Spark Streaming running on a dedicated permanent cluster Spark Streaming Operational DB Operational DB on a dedicated permanent cluster Both clusters in the same availability zone Deliver/ Serve Data
  • 19. 19© Cloudera, Inc. All rights reserved. Creating Dev and Test Environments Operational Database in the Public Cloud Development and Testing Environments Business Challenge • Development and testing environments are expensive to maintain/configure, difficult to secure with real data, and have different projects competing for a finite pool of resources Cloud-Enabled Solution • Cloudera in the public cloud enables development and testing environments to be provisioned quickly, securely, and for the required period of time Details • Public cloud offerings with Cloudera enable the ability to easily and quickly replicate a production instance of data to a testing environment • Test environments can be configured with all the security of the production, without the risk of overloading critical infrastructure • Temporary instances mean environments are purpose-created, time-bound, and less competition for test/dev resources
  • 20. 20© Cloudera, Inc. All rights reserved. Creating Development and Testing Environments Fast, Easy, and Secure Development & Testing in the Cloud Production Ready Data Application Production Instance Delivers Data to Users and Applications Cloud Object Storage1 Fast Cloud Storage2 Develop- ment and Testing Production Environment Dev/Test Environment Secure Dev/Test Environment 1. S3, Azure Blob Storage, etc. 2. EBS, Azure Premium Storage, etc.
  • 21. 21© Cloudera, Inc. All rights reserved. Burst Provisioning for ETL Operational Database in the Public Cloud Leverage Cloud for ETL Business Challenge • ETL is a difficult process that consumes a large amount of resources and can create bottlenecks depending on the nature of batch processes or data spikes Cloud-Enabled Solution • Cloudera can leverage the elasticity of resources in the public cloud to help businesses handle large ETL jobs, regardless of whether they are anticipated or not Details • Unexpected surges in traffic, regular batch jobs that are growing in size, and other high-volume data ingestion issues can create bottlenecks that slow insight into the business; standard on- premises ETL may have a difficult time recovering from the surge, resulting in lost data • Public cloud instances of Cloudera enable additional ETL resources to be added temporarily, overcoming the deluge
  • 22. 22© Cloudera, Inc. All rights reserved. Burst Provisioning for ETL Keeping your Operational Database Real-Time in the Cloud Operational DBData Surge 1 Data Sources Cloud StorageData Pushed to Cloud Storage 2 Data to Cloud Instances for Batch Processing 3 AWS EBS Instances 4 Transformed Data Returned to Cloud Storage 5 Transformed Data Sent to HBase Application 6 Data Served to Application
  • 23. 23© Cloudera, Inc. All rights reserved. Backup and Disaster Recovery Operational Database in the Public Cloud Backup and Disaster Recovery Business Challenge • Businesses struggle to create backup of their critical data, including challenges with geographical dispersion, maintenance costs, frequency of backup, etc. Cloud-Enabled Solution • Public clouds offer the ability to take frequent snapshots of the data within your CDH cluster Details • By snapshotting to a remote public cloud datacenter, companies can take advantage of geographically dispersed data copies • Cheap storage enables more frequent backups, enabling a more recent copy of data to be recovered in case of problems • Navigate issues associated with data sovereignty
  • 24. 24© Cloudera, Inc. All rights reserved. Backup and Disaster Recovery Reduce Risk by Backing-Up Operational Database Data in the Public Cloud On-Premises Instance Cloud Object Storage1 Cloud Instance Snapshot of Data 1 Cloud Object Storage1 Restore 2 Snapshot of Data 1 Restore 2 1. S3, Azure Blob Storage, etc.
  • 25. 25© Cloudera, Inc. All rights reserved. CDH HBase on EBS vs. EMR HBase on S3 EMR and S3 options for HBase - Amazon can run HBase on S3 and get consistency via a proprietary EMR-FS connector - Cloudera leverages EBS for HBase cloud deployments Customer aim: performance and price - S3 saves on storage costs relative to EBS, but increases compute costs as you need more EC2 instances to get to the same metrics - So, for S3 storage costs go down, but compute costs go up; few use cases can combine low cost on both axises Qualitative customer concerns - From Cloudera: High availability, automated configuration, manageability (monitoring/alerts), support (from Cloudera’s HBase developers)
  • 26. 26© Cloudera, Inc. All rights reserved. Instance Recommendations Easy Provisioning of always on clusters: Model vCPU Mem(GiB) Storage(GB) d2.xlarge 4 30.5 3 x 2000 HDD d2.2xlarge 8 61 6 x 2000 HDD d2.4xlarge 16 122 12 x 2000 HDD d2.8xlarge 36 244 24 x 2000 HDD Data Nodes: Master Nodes: Model vCPU Mem(GiB) Storage(GB) c3.8xlarge 32 60 2 x 320 SSD Snapshot Backups: S3 Smaller versions are recommended to stagger the impact of full block reports and garbage collection. The master node memory should be sized inline with the cluster size, c3.xlarge supports very large cluster sizes but smaller master nodes are possible.
  • 27. 27© Cloudera, Inc. All rights reserved. Instance Recommendations Transient clusters with permanent storage using Director: Model vCPU Mem(GiB) Storage C4.large 4 30.5 EBS (4000 Mbps dedicated) Data Nodes: Master Nodes: Model vCPU Mem(GiB) Storage (GB) c3.8xlarge 32 60 2 x 320 SSD Storage, throughput workloads (ETL, etc.): Volume Type Volume Size IOPS Throughput st1 500 GiB – 16 TiB 500 800 MiB/s Volume Type Volume Size IOPS Throughput io1 4 GiB – 16 TiB 20,000 800 MiB/s Storage, real time workloads (HBase, etc.): These are default recommendations; they will vary based on the specifics of each use case. We recommend deploying more than the lowest tier, SC1, as the throttling limits are reached quickly, which brings down the database.
  • 28. 28© Cloudera, Inc. All rights reserved. Instance Recommendations Always on Spark Streaming cluster • Spark Clusters have homogenous nodes i.e. no special “master” node Model vCPU Mem(GiB) Storage m4.2xlarge 8 32 EBS (1000 Mbps dedicated) Default: Model vCPU Mem(GiB) Storage m3.2xlarge 8 61 160GB SSD Very Memory Intensive Workloads: Best balance of memory and compute Examples are workloads that cache RDDs/Dataframes or maintain in-memory state via the updateStateByKey(…) function. Model vCPU Mem(GiB) Storage c4.2xlarge 8 15 EBS (1000 Mbps dedicated) Very Compute Intensive Workloads: Examples are workloads that may perform compute intensive machine learning operations to score incoming events.
  • 29. 29© Cloudera, Inc. All rights reserved. • Get Started with Cloudera in the Cloud: • www.cloudera.com/downloads • Learn more about Cloudera’s Operational DB: • https://www.cloudera.com/solutions/operational-database.html • Learn about Data Engineering Workloads in the Cloud: • www.cloudera.com/about-cloudera/events/webinars/cloud-webinar-series.html Next Steps
  • 30. 30© Cloudera, Inc. All rights reserved. Thank You