SlideShare une entreprise Scribd logo
1  sur  56
Page1 © Hortonworks Inc. 2015
A First-Hand Look at What's New in HDP
2.3
Tim E. Hall
VP, Product Management
Hortonworks
June 2015
Page2 © Hortonworks Inc. 2015
Empowering More Organizations to
Drive Transformational Outcomes
Introducing Hortonworks® Data Platform 2.3
Page3 © Hortonworks Inc. 2015
Retailer builds 360° view of its customers
Challenges
• Cost: Data silos led to duplicate storage expenses
• Customer: Data fragmentation (with as many as 15 different records
on the same customer) harmed service quality
• Supply chain: Mismatch between inventory and store-specific
demand led to inefficient carrying costs
Results
• Cost: Data offload and consolidation saved millions
• Customer: Single view of customer personalized promotions
• Supply chain: A single view fed by 12 legacy systems improved
visibility and streamlined inventory management
• Pricing: Optimization added $80 million in top-line revenue
Page4 © Hortonworks Inc. 2015
Security company protects its customers from intrusions
Challenges
• Cost: Redundant storage systems cost many millions annually,
data retention limited to no more than two years
• Multi-tenancy: Unable to support simultaneous users with ad
hoc, data science and predictive analytics tasks
• Speed: Latencies created lags that attackers could exploit
Results
• Cost: Millions saved through elimination of redundant platforms
• Multi-tenancy: Concurrent jobs run in a private cloud
• Ingest: 105 million log events per minute
• Processing time: Time reduced from four hours to two seconds
• High availability: Zero downtime across rolling upgrades
“The recent transformation of
business and consumer
technologies has driven
pervasive mobility and an
explosion of data resulting in the
need for a new approach to
protecting devices, applications,
data and users.”
Company’s 2014 annual report
Page5 © Hortonworks Inc. 2015
New Capabilities in Hortonworks Data Platform 2.3
Breakthrough User
Experience
Dramatic Improvement in the User Experience
HDP 2.3 eliminates much of the complexity administering
Hadoop and improves developer productivity.
Enhanced Security
and Governance
Enhanced Security and Data Governance
HDP 2.3 delivers new encryption of data at rest, and
extends the data governance initiative with Apache™ Atlas.
Proactive Support
Extending the Value of a Hortonworks Subscription
Hortonworks® SmartSense™ adds proactive cluster monitoring,
enhancing Hortonworks’ award-winning support in key areas.
Apache is a trademark of the Apache Software Foundation.
Page6 © Hortonworks Inc. 2015
New Capabilities in HDP 2.3
Breakthrough User
Experience
Dramatic Improvement in the User Experience
HDP 2.3 eliminates much of the complexity administering
Hadoop and improves developer productivity.
Enhanced Security
and Governance
Proactive Support
Extending the Value of a Hortonworks Subscription
Hortonworks® SmartSense™ adds proactive cluster monitoring,
enhancing Hortonworks’ award-winning support in key areas.
Enhanced Security and Data Governance
HDP 2.3 delivers new encryption of data at rest, and
extends the data governance initiative with Apache™ Atlas.
Page7 © Hortonworks Inc. 2015
Ambari Views Framework
Goal: enable the delivery of custom UI experiences in Ambari Web
Developers can extend the Ambari Web interface
• Views expose custom UI features for Hadoop Services
Ambari Admins can entitle Views to Ambari Web users
• Entitlements framework for controlling access to Views
Page8 © Hortonworks Inc. 2015
Views Framework
Views Framework vs. Views
Views
Core to Ambari
Built by
Hortonworks,
Community,
Partners
Page9 © Hortonworks Inc. 2015
Views Framework
Views Framework vs. Views
Views
Core to Ambari
Built by
Hortonworks,
Community,
Partners
Page10 © Hortonworks Inc. 2015
View Components
• Serve client-side assets (such as HTML + JavaScript)
• Expose server-side resources (such as REST endpoints)
VIEW
Client-side
assets
(.js, html)
AMBARI WEB
VIEW
Server-side
resources
(java)
AMBARI SERVER
{rest}
Hadoop
and
other
systems
Page11 © Hortonworks Inc. 2015
View Delivery
1. Develop the View (just like you would for a Web App)
2. Package as a View (basically a WAR)
3. Deploy the View into Ambari
4. Ambari Admins create + configuration view instance(s) and give
access to users + groups
Develop DeployPackage
Create
Instance(s)
Page12 © Hortonworks Inc. 2015
Versions and Instances
• Deploy multiple versions and create multiple instances of a view
• Manage accessibility and usage
Page13 © Hortonworks Inc. 2015
Choice of Deployment Model
• For Hadoop Operators:
Deploy Views in an Ambari Server that is managing a Hadoop cluster
• For Data Workers:
Run Views in a “standalone” Ambari Server
Ambari
Server
HADOOP
Store & Process
Ambari
Server
Operators
manage the
cluster, may
have Views
deployed
Data
Workers use
the cluster
and use a
“standalone”
Ambari
Server for
Views
Page14 © Hortonworks Inc. 2015
Improved Ease of Use for the Hadoop Operator
Responsibilities include:
• Deploying Hadoop® clusters
• Managing cluster health
• Troubleshooting and resolving issues
Hadoop Operator
Simpler administration speeds
time to value
Easy Setup and Installation
Streamlined configuration experience
Customizable Dashboards
Track cluster health with KPIs and drill downs
Easier Provisioning and Faster Cluster
Formation
Cloudbreak simplifies provisioning. Ambari speeds
cluster formation with automated host discovery.
Page15 © Hortonworks Inc. 2015
Hadoop Operator
New guided configurations
ease cluster setup
Page16 © Hortonworks Inc. 2015
Ease installation and
configuration for HDFS,
YARN, Hive and HBase
Makes Key Configs Visible
Clearly displays the set of options
Recommends Settings
Suggests optimal ranges
Highlights Dependencies
Lets you visualize any impact on
dependent services
Hadoop Operator
Page17 © Hortonworks Inc. 2015
Hadoop Operator
Fully customizable
dashboard shows cluster
KPIs
Page18 © Hortonworks Inc. 2015
System Administrator
Hadoop operators can
configure dashboards to
show KPIs
Out-of-box Templates
Based on common best practices
Personalized Experience
Create new display widgets built from
Hadoop metrics. Add or remove
existing widgets.
Reusable and Shareable
Widget library allows other operators
to re-use community widgets
Page19 © Hortonworks Inc. 2015
Demo: Operations
Page20 © Hortonworks Inc. 2015
Host discovery makes cluster
expansion automatic, fast,
orderly and predictable
Faster
Expand clusters incrementally and automatically
as each new node becomes available
Easier
Pre-plan automatic expansion paths
Flexible for Cloud or On-premises
Discover hosts wherever they are
Ambari
Hadoop Operator
Host Discovery Eases Cluster Formation
Page21 © Hortonworks Inc. 2015
Learn More about Ambari
Thursday, 3:10-3:50 – What’s New in Apache Ambari
with Sumit Mohanty & Yusako Sako
Page22 © Hortonworks Inc. 2015
Launch HDP on Leading Cloud Platforms
BI / Analytics
(Hive)
IoT Apps
(Storm, HBase, Hive)
Dev / Test
(all HDP services)
Data Science
(Spark)
Cloudbreak
1. Pick a Blueprint
2. Choose a Cloud
3. Launch HDP!
Example Ambari Blueprints:
IoT Apps, BI / Analytics, Data Science,
Dev / Test
Page23 © Hortonworks Inc. 2015
BI / Analytics
(Hive)
IoT Apps
(Storm, HBase, Hive)
Launch HDP on Any Cloud for Any Application
Dev / Test
(all HDP services)
Data Science
(Spark)
Cloudbreak
1. Pick a Blueprint
2. Choose a Cloud
3. Launch HDP!
Example Ambari Blueprints:
IoT Apps, BI / Analytics, Data Science,
Dev / Test
Page24 © Hortonworks Inc. 2015
Cloudbreak automates provisioning and
scaling clusters in the cloud
Hadoop Operator
Page25 © Hortonworks Inc. 2015
Hadoop Operator
Cloudbreak automates cluster
provisioning and scaling for the cloud in
only 3 steps
Page26 © Hortonworks Inc. 2015
Step 1: Choose your cloud provider –
Microsoft Azure, Amazon AWS, Google
Cloud Platform or OpenStack
Page27 © Hortonworks Inc. 2015
Step 2: Enter your cloud credentials
Page28 © Hortonworks Inc. 2015
Step 3: Pick Your Ambari Blueprint
Page29 © Hortonworks Inc. 2015
Cloudbreak provides feedback while
cluster creation is progress
Page30 © Hortonworks Inc. 2015
Turn on auto-scaling and
set SLA policies
Page31 © Hortonworks Inc. 2015
Hadoop Operator
Leverage re-usable
blueprints to provision
HDP in any environment
Public or Private Clouds
Dynamically set up public or private
cloud clusters from the web console
Automated Scaling
Manage elasticity requirements as
cluster demands grow
Choice of Many Clouds
Supports Microsoft Azure, AWS,
Google and Open Stack clouds
Page32 © Hortonworks Inc. 2015
Learn More about Cloudbreak
Wednesday, 2:35-3:15 – One-click Hadoop Clusters - anywhere (using Docker)
with Janos Matyas
Page33 © Hortonworks Inc. 2015
Preview URL: launch.hortonworks.com
Launch an HDP cluster
with only a few clicks
Easy Setup
With the leading public cloud
platforms: Microsoft Azure, AWS and
Google Cloud
Easy Exploration
Try out the latest features in HDP
Your Data
Use the newest cluster technologies
with your own familiar dataset
Page34 © Hortonworks Inc. 2015
Advances for the Developer
Responsibilities include:
• Developing SQL queries
• Developing new Spark applications
• Implementing streaming data analytics
Developer
Develop Hadoop applications with
ease and speed
Visualization of SQL Queries
Streamlined user interface for Apache Hive
Improvements to Apache Spark on YARN
Machine Learning, Data Frame API, New SQL (Preview)
Enterprise Enhancements for Streaming
Fault tolerance, security, and rolling upgrades for
Apache Kafka and Apache Storm
Page35 © Hortonworks Inc. 2015
Enhanced SQL Semantics and New SQL User View
The rich developer experience includes enhanced
SQL semantics and a new user interface
Enhanced SQL Semantics
Include interval types in expressions and added UNION
SQL User View in Ambari
Write, debug and run Hive SQL queries
Performance Improvements
2.5x performance gain
Query Scheduling
Dynamically share resources for Hive queries
Storage
YARN: Data Operating System
Governance Security
Operations
Resource Management
Page36 © Hortonworks Inc. 2015
Developer
New user interface enables fast &
easy SQL definition and execution.
Page37 © Hortonworks Inc. 2015
New capabilities add dynamic access methods
to feature-rich Spark applications
Data Frame API
Enables common and easy interchange between Spark
components for data imports and exports
Machine Learning
Introduces multiclass classification, clustering,
frequent pattern-mining algorithms
Enterprise-Ready
Consistent operations, comprehensive security,
deployable anywhere
Spark SQL
[Tech Preview] A new module for structured data processing
in Spark
Improvements for Apache Spark on YARN
Storage
YARN: Data Operating System
Governance Security
Operations
Resource Management
Page38 © Hortonworks Inc. 2015
Stream analysis, scalable across the cluster
Nimbus High Availability
No single point of failure for stream processing job
management
Ease of Deployment
Quickly create stream processing pipelines
Rolling Upgrades
Update Storm to newer versions, with zero downtime
Enhanced Security for Kafka
Authorization via Ranger and authentication via Kerberos
Streaming Analysis Ready for Mainstream Adoption
Storage
YARN: Data Operating System
Governance Security
Operations
Resource Management
Page39 © Hortonworks Inc. 2015
Demo: Developer
Page40 © Hortonworks Inc. 2015
New Capabilities in HDP 2.3
Breakthrough User
Experience
Dramatic Improvement in the User Experience
HDP 2.3 eliminates much of the complexity administering
Hadoop and improves developer productivity.
Enhanced Security
and Governance
Enhanced Security and Data Governance
HDP 2.3 delivers new encryption of data at rest, and
extends the data governance initiative with Apache™ Atlas.
Proactive Support
Extending the Value of a Hortonworks Subscription
Hortonworks® SmartSense™ adds proactive cluster monitoring,
enhancing Hortonworks’ award-winning support in key areas.
Page41 © Hortonworks Inc. 2015
HDP Security: Comprehensive, Complete, Extensible
Security in HDP is the most comprehensive, complete and extensible for Hadoop
Administration
Central management and consistent security
Only HDP delivers a single administrative console to
set policy across the entire cluster
Authentication
Authenticate users and systems
Authentication for perimeter and cluster; integrates with existing
ActiveDirectory and LDAP solutions
Authorization
Provision access to data
Provides consistent authorization controls across all
Apache components within HDP
Audit
Maintain a record of data access
Maintains a record of data access events across all
components that is consistent and accessible
Data Protection
Protect data at rest and in motion
Encrypts data in motion and data at rest; refer partner
encryption solutions for broader needs
Page42 © Hortonworks Inc. 2015
Enhanced Security Capabilities in HDP 2.3
Project New Features
Administration
Central management
and consistent security
Ranger
• Administer Kafka, Solr and multi-tenant YARN queues
• Support for custom plugins via Ranger and Knox stacks
Authentication
Authenticate users and systems
Knox
• Bi-directional SSL support  trust between clients and servers
• LDAP data caching reduces server load, improves performance
Authorization
Provision access to data
Ranger
• Authorization for Kafka, Solr and multi-tenant YARN queues
• Hooks for dynamic policy rules (e.g., by geo-location)
Audit
Maintain a record of data access
Atlas
• Scalable metadata service
• Hive integration leverages existing metadata
• UI: Hive table lineage and domain-specific search
Data Protection
Protect data at rest and in motion
HDFS,
Ranger
• HDFS transparent data encryption (for data at rest)
• Key management store (KMS) that’s robust and highly available
Page43 © Hortonworks Inc. 2015
Demo: Security
Page44 © Hortonworks Inc. 2015
Learn More about Ranger
Thursday, 3:10-3:50 – Securing Hadoop with Apache Ranger: Strategies and
Best Practices
with Selvamohan Neethiraj & Velmurugan Periasamy
Page45 © Hortonworks Inc. 2015
Extending Data Governance to Hadoop
ETL / DQ MDM
ARCHIVE
Traditional
Data Systems
Data Governance Requirements
Transparent
Governance standards and
protocols must be clearly defined
and available to all
Reproducible
Recreate the relevant data
landscape at a given point in time
Auditable
Trace all relevant events and assets
with appropriate historical lineage
Consistent
Compliance practices must be
consistent
Hadoop Data
Platform
Must snap into existing
data governance
frameworks and openly
exchange metadata
A group of companies dedicated to
meeting these requirements in the openSCM
CRM
ERP
Holistic Data
Governance
Business
Analytics
Visualization
& Dashboards
Page46 © Hortonworks Inc. 2015
Apache Atlas Is Now Included in HDP
Apache Atlas
Knowledge Store
Audit Store
ModelsType-System
Policy RulesTaxonomies
Tag-based
Policies
Data Lifecycle
Management
Real-time Tag-based Access Control
REST API
Services
Search Lineage Exchange
Healthcare
HIPAA
HL7
Financial
SOX
Dodd-Frank
Energy
PPDM
Retail
PCI
PII
Other
CWM
Scalable Metadata Service
Agile Centralized Taxonomy – Enterprise/Business unit level
modeling with industry-specific vocabulary
Operational Metadata – Extend visibility into HDFS Path,
Hive DB, table, columns
REST API – Modern, flexible access to Atlas services
Hive Integration
Hive Metadata – Leverage existing metadata with import /
export capability and capture SQL runtime metrics directly
User Interface
Hive Table Lineage and Search DSL – Support for key word,
faceted and free text searches
Page47 © Hortonworks Inc. 2015
New Capabilities in HDP 2.3
Breakthrough User
Experience
Dramatic Improvement in the User Experience
HDP 2.3 eliminates much of the complexity administering
Hadoop and improves developer productivity.
Enhanced Security
and Governance
Enhanced Security and Data Governance
HDP 2.3 delivers new encryption of data at rest, and
extends the data governance initiative with Apache™ Atlas.
Proactive Support
Extending the Value of a Hortonworks Subscription
Hortonworks® SmartSense™ adds proactive cluster monitoring,
enhancing Hortonworks’ award-winning support in key areas.
Page48 © Hortonworks Inc. 2015
HDP Subscriptions Deliver
Global support coverage, 24x7x365
Hortonworks University self-paced learning
Premier Support: designated support engineer
Influence on the direction of the technology
The Hadoop Industry’s Best Subscription Value
Expansion
Architecture &
Development ProductionImplementation
Hortonworks Support
# tickets
Project 2
Project 3
Project N
.
.
.
From Architecture to Expansion
“Hortonworks loves
and lives open-source
innovation”
Page49 © Hortonworks Inc. 2015
Hortonworks® SmartSense™ provides
comprehensive visibility into cluster issues
Hadoop Operator
Page50 © Hortonworks Inc. 2015
Hortonworks® SmartSense™ makes
tailored recommendations based on
analysis of operational data
Hadoop Operator
Page51 © Hortonworks Inc. 2015
Hortonworks® SmartSense™ solicits
feedback from Hadoop Operators to
optimize its recommendations
Hadoop Operator
Page52 © Hortonworks Inc. 2015
Hadoop Operator
Hortonworks®
SmartSense™ enhances
the support subscription
Faster Case Resolution
Easily capture log files and metrics for
insight and resolution
Proactive Configuration
Via intelligent stream of cluster
analytics and data-driven
recommendations
Capacity Planning
Through proactive view into
customer’s cluster utilization
Page53 © Hortonworks Inc. 2015
Hortonworks® SmartSense™ Resolves Issues Proactively
Integrated Customer Portal
Knowledge Base
On-Demand
Training
Customer Environment
• Any cloud
• Hybrid environment
• Multi-tenant
“5 out of 5”Enterprise Hadoop Support
Connection to the customer’s environment
via telephone or web support
Page54 © Hortonworks Inc. 2015
Hortonworks SmartSense
Hortonworks® SmartSense™ Resolves Issues Proactively
Integrated Customer Portal
Knowledge Base
On-Demand
Training
Customer Environment
• Any cloud
• Hybrid environment
• Multi-tenant
“5 out of 5”Enterprise Hadoop Support
Page55 © Hortonworks Inc. 2015
In Summary: New in HDP
Breakthrough User
Experience
Enhanced Security
and Governance
Proactive Support
HDP 2.3 is a Major Step Forward for
Open Enterprise Hadoop®
Page56 © Hortonworks Inc. 2015
Thank You. Questions?

Contenu connexe

Tendances

Hortonworks Technical Workshop: Apache Ambari
Hortonworks Technical Workshop:   Apache AmbariHortonworks Technical Workshop:   Apache Ambari
Hortonworks Technical Workshop: Apache AmbariHortonworks
 
What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4DataWorks Summit
 
Apache Ambari - What's New in 2.2
 Apache Ambari - What's New in 2.2 Apache Ambari - What's New in 2.2
Apache Ambari - What's New in 2.2Hortonworks
 
Hortonworks Technical Workshop: HBase For Mission Critical Applications
Hortonworks Technical Workshop: HBase For Mission Critical ApplicationsHortonworks Technical Workshop: HBase For Mission Critical Applications
Hortonworks Technical Workshop: HBase For Mission Critical ApplicationsHortonworks
 
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Hortonworks
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseHortonworks
 
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...DataWorks Summit
 
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureApache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureHortonworks
 
Deploying Docker applications on YARN via Slider
Deploying Docker applications on YARN via SliderDeploying Docker applications on YARN via Slider
Deploying Docker applications on YARN via SliderHortonworks
 
Accelerating query processing
Accelerating query processingAccelerating query processing
Accelerating query processingDataWorks Summit
 
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP2.1: Apache Storm for Stream Data Processing in HadoopDiscover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP2.1: Apache Storm for Stream Data Processing in HadoopHortonworks
 
Running a container cloud on YARN
Running a container cloud on YARNRunning a container cloud on YARN
Running a container cloud on YARNDataWorks Summit
 
How to Achieve a Self-Service and Secure Multitenant Data Lake in a Large Com...
How to Achieve a Self-Service and Secure Multitenant Data Lake in a Large Com...How to Achieve a Self-Service and Secure Multitenant Data Lake in a Large Com...
How to Achieve a Self-Service and Secure Multitenant Data Lake in a Large Com...DataWorks Summit
 
Apache NiFi Toronto Meetup
Apache NiFi Toronto MeetupApache NiFi Toronto Meetup
Apache NiFi Toronto MeetupHortonworks
 
Improvements in Hadoop Security
Improvements in Hadoop SecurityImprovements in Hadoop Security
Improvements in Hadoop SecurityDataWorks Summit
 
Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Hortonworks
 
Curb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure ClusterCurb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure Clusterahortonworks
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...DataWorks Summit
 
Hadoop crashcourse v3
Hadoop crashcourse v3Hadoop crashcourse v3
Hadoop crashcourse v3Hortonworks
 

Tendances (20)

Hortonworks Technical Workshop: Apache Ambari
Hortonworks Technical Workshop:   Apache AmbariHortonworks Technical Workshop:   Apache Ambari
Hortonworks Technical Workshop: Apache Ambari
 
What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4
 
Apache Ambari - What's New in 2.2
 Apache Ambari - What's New in 2.2 Apache Ambari - What's New in 2.2
Apache Ambari - What's New in 2.2
 
Hortonworks Technical Workshop: HBase For Mission Critical Applications
Hortonworks Technical Workshop: HBase For Mission Critical ApplicationsHortonworks Technical Workshop: HBase For Mission Critical Applications
Hortonworks Technical Workshop: HBase For Mission Critical Applications
 
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
 
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
 
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureApache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
 
Deploying Docker applications on YARN via Slider
Deploying Docker applications on YARN via SliderDeploying Docker applications on YARN via Slider
Deploying Docker applications on YARN via Slider
 
Accelerating query processing
Accelerating query processingAccelerating query processing
Accelerating query processing
 
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP2.1: Apache Storm for Stream Data Processing in HadoopDiscover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
 
Running a container cloud on YARN
Running a container cloud on YARNRunning a container cloud on YARN
Running a container cloud on YARN
 
How to Achieve a Self-Service and Secure Multitenant Data Lake in a Large Com...
How to Achieve a Self-Service and Secure Multitenant Data Lake in a Large Com...How to Achieve a Self-Service and Secure Multitenant Data Lake in a Large Com...
How to Achieve a Self-Service and Secure Multitenant Data Lake in a Large Com...
 
Apache NiFi Toronto Meetup
Apache NiFi Toronto MeetupApache NiFi Toronto Meetup
Apache NiFi Toronto Meetup
 
Kafka Security
Kafka SecurityKafka Security
Kafka Security
 
Improvements in Hadoop Security
Improvements in Hadoop SecurityImprovements in Hadoop Security
Improvements in Hadoop Security
 
Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4
 
Curb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure ClusterCurb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure Cluster
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...
 
Hadoop crashcourse v3
Hadoop crashcourse v3Hadoop crashcourse v3
Hadoop crashcourse v3
 

En vedette

Using Data to Predict and Shape Consumer Behavior with Brian Reich
Using Data to Predict and Shape Consumer Behavior with Brian ReichUsing Data to Predict and Shape Consumer Behavior with Brian Reich
Using Data to Predict and Shape Consumer Behavior with Brian ReichCDS Global, Inc.
 
Google Webmaster Tools Webinar
Google Webmaster Tools WebinarGoogle Webmaster Tools Webinar
Google Webmaster Tools WebinarFluid
 
Using an Evidence-Based Learning Strategy to Win Hearts and Minds
Using an Evidence-Based Learning Strategy to Win Hearts and MindsUsing an Evidence-Based Learning Strategy to Win Hearts and Minds
Using an Evidence-Based Learning Strategy to Win Hearts and MindsLaura Overton
 
Conquering The Digital Shift With Data
Conquering The Digital Shift With DataConquering The Digital Shift With Data
Conquering The Digital Shift With DataAT Internet
 
Driving A Data-Centric Culture: The Leadership Challenge
Driving A Data-Centric Culture: The Leadership ChallengeDriving A Data-Centric Culture: The Leadership Challenge
Driving A Data-Centric Culture: The Leadership ChallengePlatfora
 
Living Leadership: Designing flexible leadership development experiences for ...
Living Leadership: Designing flexible leadership development experiences for ...Living Leadership: Designing flexible leadership development experiences for ...
Living Leadership: Designing flexible leadership development experiences for ...College of DuPage
 
The State of Payments 2015 by PayFort - ArabNet Digital Summit 2015
The State of Payments 2015 by PayFort - ArabNet Digital Summit 2015The State of Payments 2015 by PayFort - ArabNet Digital Summit 2015
The State of Payments 2015 by PayFort - ArabNet Digital Summit 2015ArabNet ME
 
Big Data in Banking (White paper)
Big Data in Banking (White paper)Big Data in Banking (White paper)
Big Data in Banking (White paper)InData Labs
 
Definitions for Real World of Big Data Marketing
Definitions for Real World of Big Data MarketingDefinitions for Real World of Big Data Marketing
Definitions for Real World of Big Data MarketingWiley
 
Gartner MQ for Web App Firewall Webinar
Gartner MQ for Web App Firewall WebinarGartner MQ for Web App Firewall Webinar
Gartner MQ for Web App Firewall WebinarImperva
 
Marketing Analytics to Prove Your ROI
Marketing Analytics to Prove Your ROIMarketing Analytics to Prove Your ROI
Marketing Analytics to Prove Your ROIMarketo
 
Presenting Data Webinar
Presenting Data WebinarPresenting Data Webinar
Presenting Data WebinarGavin McMahon
 
Investor Pitches That Win - Sol Marketing, Austin, TX
Investor Pitches That Win - Sol Marketing, Austin, TX Investor Pitches That Win - Sol Marketing, Austin, TX
Investor Pitches That Win - Sol Marketing, Austin, TX Deb Gabor
 
Big Data Marketing - What You Need To Know
Big Data Marketing - What You Need To KnowBig Data Marketing - What You Need To Know
Big Data Marketing - What You Need To KnowMBA & Company
 
State of Payments 2016
State of Payments 2016State of Payments 2016
State of Payments 2016PAYFORT
 
Growing Your Business Through Experimentation
Growing Your Business Through ExperimentationGrowing Your Business Through Experimentation
Growing Your Business Through ExperimentationHiten Shah
 
Q4 2016 Investor Presentation
 Q4 2016 Investor Presentation Q4 2016 Investor Presentation
Q4 2016 Investor PresentationNEXONIR
 
A NEW ERA OF NONTRADITIONAL NURSING ROLES
A NEW ERA OF NONTRADITIONAL NURSING ROLESA NEW ERA OF NONTRADITIONAL NURSING ROLES
A NEW ERA OF NONTRADITIONAL NURSING ROLESKelly Services
 
Always Be Testing - Learn from Every A/B Test (Hiten Shah)
Always Be Testing - Learn from Every A/B Test (Hiten Shah)Always Be Testing - Learn from Every A/B Test (Hiten Shah)
Always Be Testing - Learn from Every A/B Test (Hiten Shah)Future Insights
 

En vedette (20)

Using Data to Predict and Shape Consumer Behavior with Brian Reich
Using Data to Predict and Shape Consumer Behavior with Brian ReichUsing Data to Predict and Shape Consumer Behavior with Brian Reich
Using Data to Predict and Shape Consumer Behavior with Brian Reich
 
Change the Automation Game by Jon Stanesby
Change the Automation Game by Jon StanesbyChange the Automation Game by Jon Stanesby
Change the Automation Game by Jon Stanesby
 
Google Webmaster Tools Webinar
Google Webmaster Tools WebinarGoogle Webmaster Tools Webinar
Google Webmaster Tools Webinar
 
Using an Evidence-Based Learning Strategy to Win Hearts and Minds
Using an Evidence-Based Learning Strategy to Win Hearts and MindsUsing an Evidence-Based Learning Strategy to Win Hearts and Minds
Using an Evidence-Based Learning Strategy to Win Hearts and Minds
 
Conquering The Digital Shift With Data
Conquering The Digital Shift With DataConquering The Digital Shift With Data
Conquering The Digital Shift With Data
 
Driving A Data-Centric Culture: The Leadership Challenge
Driving A Data-Centric Culture: The Leadership ChallengeDriving A Data-Centric Culture: The Leadership Challenge
Driving A Data-Centric Culture: The Leadership Challenge
 
Living Leadership: Designing flexible leadership development experiences for ...
Living Leadership: Designing flexible leadership development experiences for ...Living Leadership: Designing flexible leadership development experiences for ...
Living Leadership: Designing flexible leadership development experiences for ...
 
The State of Payments 2015 by PayFort - ArabNet Digital Summit 2015
The State of Payments 2015 by PayFort - ArabNet Digital Summit 2015The State of Payments 2015 by PayFort - ArabNet Digital Summit 2015
The State of Payments 2015 by PayFort - ArabNet Digital Summit 2015
 
Big Data in Banking (White paper)
Big Data in Banking (White paper)Big Data in Banking (White paper)
Big Data in Banking (White paper)
 
Definitions for Real World of Big Data Marketing
Definitions for Real World of Big Data MarketingDefinitions for Real World of Big Data Marketing
Definitions for Real World of Big Data Marketing
 
Gartner MQ for Web App Firewall Webinar
Gartner MQ for Web App Firewall WebinarGartner MQ for Web App Firewall Webinar
Gartner MQ for Web App Firewall Webinar
 
Marketing Analytics to Prove Your ROI
Marketing Analytics to Prove Your ROIMarketing Analytics to Prove Your ROI
Marketing Analytics to Prove Your ROI
 
Presenting Data Webinar
Presenting Data WebinarPresenting Data Webinar
Presenting Data Webinar
 
Investor Pitches That Win - Sol Marketing, Austin, TX
Investor Pitches That Win - Sol Marketing, Austin, TX Investor Pitches That Win - Sol Marketing, Austin, TX
Investor Pitches That Win - Sol Marketing, Austin, TX
 
Big Data Marketing - What You Need To Know
Big Data Marketing - What You Need To KnowBig Data Marketing - What You Need To Know
Big Data Marketing - What You Need To Know
 
State of Payments 2016
State of Payments 2016State of Payments 2016
State of Payments 2016
 
Growing Your Business Through Experimentation
Growing Your Business Through ExperimentationGrowing Your Business Through Experimentation
Growing Your Business Through Experimentation
 
Q4 2016 Investor Presentation
 Q4 2016 Investor Presentation Q4 2016 Investor Presentation
Q4 2016 Investor Presentation
 
A NEW ERA OF NONTRADITIONAL NURSING ROLES
A NEW ERA OF NONTRADITIONAL NURSING ROLESA NEW ERA OF NONTRADITIONAL NURSING ROLES
A NEW ERA OF NONTRADITIONAL NURSING ROLES
 
Always Be Testing - Learn from Every A/B Test (Hiten Shah)
Always Be Testing - Learn from Every A/B Test (Hiten Shah)Always Be Testing - Learn from Every A/B Test (Hiten Shah)
Always Be Testing - Learn from Every A/B Test (Hiten Shah)
 

Similaire à A First-Hand Look at What's New in HDP 2.3

Docker based Hadoop Deployment
Docker based Hadoop DeploymentDocker based Hadoop Deployment
Docker based Hadoop DeploymentRakesh Saha
 
DEVNET-1141 Dynamic Dockerized Hadoop Provisioning
DEVNET-1141	Dynamic Dockerized Hadoop ProvisioningDEVNET-1141	Dynamic Dockerized Hadoop Provisioning
DEVNET-1141 Dynamic Dockerized Hadoop ProvisioningCisco DevNet
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on DockerRakesh Saha
 
Yahoo! Hack Europe
Yahoo! Hack EuropeYahoo! Hack Europe
Yahoo! Hack EuropeHortonworks
 
Hortonworks Hadoop @ Oslo Hadoop User Group
Hortonworks Hadoop @ Oslo Hadoop User GroupHortonworks Hadoop @ Oslo Hadoop User Group
Hortonworks Hadoop @ Oslo Hadoop User GroupMats Johansson
 
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013Hortonworks
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks
 
Uncovering New Opportunities With HP Public Cloud - RightScale Compute 2013
Uncovering New Opportunities With HP Public Cloud - RightScale Compute 2013Uncovering New Opportunities With HP Public Cloud - RightScale Compute 2013
Uncovering New Opportunities With HP Public Cloud - RightScale Compute 2013RightScale
 
Bridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldBridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldCA Technologies
 
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015Christophe Lucas
 
Hadoop Everywhere & Cloudbreak
Hadoop Everywhere & CloudbreakHadoop Everywhere & Cloudbreak
Hadoop Everywhere & CloudbreakSean Roberts
 
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...Hortonworks
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...Hortonworks
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramHortonworks
 
Microsoft cloud continuum
Microsoft cloud continuumMicrosoft cloud continuum
Microsoft cloud continuumMathews Job
 
Data Lake for the Cloud: Extending your Hadoop Implementation
Data Lake for the Cloud: Extending your Hadoop ImplementationData Lake for the Cloud: Extending your Hadoop Implementation
Data Lake for the Cloud: Extending your Hadoop ImplementationHortonworks
 

Similaire à A First-Hand Look at What's New in HDP 2.3 (20)

Docker based Hadoop Deployment
Docker based Hadoop DeploymentDocker based Hadoop Deployment
Docker based Hadoop Deployment
 
DEVNET-1141 Dynamic Dockerized Hadoop Provisioning
DEVNET-1141	Dynamic Dockerized Hadoop ProvisioningDEVNET-1141	Dynamic Dockerized Hadoop Provisioning
DEVNET-1141 Dynamic Dockerized Hadoop Provisioning
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on Docker
 
Yahoo! Hack Europe
Yahoo! Hack EuropeYahoo! Hack Europe
Yahoo! Hack Europe
 
Munich HUG 21.11.2013
Munich HUG 21.11.2013Munich HUG 21.11.2013
Munich HUG 21.11.2013
 
Hortonworks Hadoop @ Oslo Hadoop User Group
Hortonworks Hadoop @ Oslo Hadoop User GroupHortonworks Hadoop @ Oslo Hadoop User Group
Hortonworks Hadoop @ Oslo Hadoop User Group
 
Meetup oslo hortonworks HDP
Meetup oslo hortonworks HDPMeetup oslo hortonworks HDP
Meetup oslo hortonworks HDP
 
4 hp converged_cloud
4 hp converged_cloud4 hp converged_cloud
4 hp converged_cloud
 
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2
 
Uncovering New Opportunities With HP Public Cloud - RightScale Compute 2013
Uncovering New Opportunities With HP Public Cloud - RightScale Compute 2013Uncovering New Opportunities With HP Public Cloud - RightScale Compute 2013
Uncovering New Opportunities With HP Public Cloud - RightScale Compute 2013
 
Bridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldBridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven World
 
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
 
Hadoop Everywhere & Cloudbreak
Hadoop Everywhere & CloudbreakHadoop Everywhere & Cloudbreak
Hadoop Everywhere & Cloudbreak
 
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready Program
 
Microsoft cloud continuum
Microsoft cloud continuumMicrosoft cloud continuum
Microsoft cloud continuum
 
Cloud foundry meetup 12112013
Cloud foundry meetup 12112013Cloud foundry meetup 12112013
Cloud foundry meetup 12112013
 
Data Lake for the Cloud: Extending your Hadoop Implementation
Data Lake for the Cloud: Extending your Hadoop ImplementationData Lake for the Cloud: Extending your Hadoop Implementation
Data Lake for the Cloud: Extending your Hadoop Implementation
 

Plus de DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Plus de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Dernier

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 

Dernier (20)

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 

A First-Hand Look at What's New in HDP 2.3

  • 1. Page1 © Hortonworks Inc. 2015 A First-Hand Look at What's New in HDP 2.3 Tim E. Hall VP, Product Management Hortonworks June 2015
  • 2. Page2 © Hortonworks Inc. 2015 Empowering More Organizations to Drive Transformational Outcomes Introducing Hortonworks® Data Platform 2.3
  • 3. Page3 © Hortonworks Inc. 2015 Retailer builds 360° view of its customers Challenges • Cost: Data silos led to duplicate storage expenses • Customer: Data fragmentation (with as many as 15 different records on the same customer) harmed service quality • Supply chain: Mismatch between inventory and store-specific demand led to inefficient carrying costs Results • Cost: Data offload and consolidation saved millions • Customer: Single view of customer personalized promotions • Supply chain: A single view fed by 12 legacy systems improved visibility and streamlined inventory management • Pricing: Optimization added $80 million in top-line revenue
  • 4. Page4 © Hortonworks Inc. 2015 Security company protects its customers from intrusions Challenges • Cost: Redundant storage systems cost many millions annually, data retention limited to no more than two years • Multi-tenancy: Unable to support simultaneous users with ad hoc, data science and predictive analytics tasks • Speed: Latencies created lags that attackers could exploit Results • Cost: Millions saved through elimination of redundant platforms • Multi-tenancy: Concurrent jobs run in a private cloud • Ingest: 105 million log events per minute • Processing time: Time reduced from four hours to two seconds • High availability: Zero downtime across rolling upgrades “The recent transformation of business and consumer technologies has driven pervasive mobility and an explosion of data resulting in the need for a new approach to protecting devices, applications, data and users.” Company’s 2014 annual report
  • 5. Page5 © Hortonworks Inc. 2015 New Capabilities in Hortonworks Data Platform 2.3 Breakthrough User Experience Dramatic Improvement in the User Experience HDP 2.3 eliminates much of the complexity administering Hadoop and improves developer productivity. Enhanced Security and Governance Enhanced Security and Data Governance HDP 2.3 delivers new encryption of data at rest, and extends the data governance initiative with Apache™ Atlas. Proactive Support Extending the Value of a Hortonworks Subscription Hortonworks® SmartSense™ adds proactive cluster monitoring, enhancing Hortonworks’ award-winning support in key areas. Apache is a trademark of the Apache Software Foundation.
  • 6. Page6 © Hortonworks Inc. 2015 New Capabilities in HDP 2.3 Breakthrough User Experience Dramatic Improvement in the User Experience HDP 2.3 eliminates much of the complexity administering Hadoop and improves developer productivity. Enhanced Security and Governance Proactive Support Extending the Value of a Hortonworks Subscription Hortonworks® SmartSense™ adds proactive cluster monitoring, enhancing Hortonworks’ award-winning support in key areas. Enhanced Security and Data Governance HDP 2.3 delivers new encryption of data at rest, and extends the data governance initiative with Apache™ Atlas.
  • 7. Page7 © Hortonworks Inc. 2015 Ambari Views Framework Goal: enable the delivery of custom UI experiences in Ambari Web Developers can extend the Ambari Web interface • Views expose custom UI features for Hadoop Services Ambari Admins can entitle Views to Ambari Web users • Entitlements framework for controlling access to Views
  • 8. Page8 © Hortonworks Inc. 2015 Views Framework Views Framework vs. Views Views Core to Ambari Built by Hortonworks, Community, Partners
  • 9. Page9 © Hortonworks Inc. 2015 Views Framework Views Framework vs. Views Views Core to Ambari Built by Hortonworks, Community, Partners
  • 10. Page10 © Hortonworks Inc. 2015 View Components • Serve client-side assets (such as HTML + JavaScript) • Expose server-side resources (such as REST endpoints) VIEW Client-side assets (.js, html) AMBARI WEB VIEW Server-side resources (java) AMBARI SERVER {rest} Hadoop and other systems
  • 11. Page11 © Hortonworks Inc. 2015 View Delivery 1. Develop the View (just like you would for a Web App) 2. Package as a View (basically a WAR) 3. Deploy the View into Ambari 4. Ambari Admins create + configuration view instance(s) and give access to users + groups Develop DeployPackage Create Instance(s)
  • 12. Page12 © Hortonworks Inc. 2015 Versions and Instances • Deploy multiple versions and create multiple instances of a view • Manage accessibility and usage
  • 13. Page13 © Hortonworks Inc. 2015 Choice of Deployment Model • For Hadoop Operators: Deploy Views in an Ambari Server that is managing a Hadoop cluster • For Data Workers: Run Views in a “standalone” Ambari Server Ambari Server HADOOP Store & Process Ambari Server Operators manage the cluster, may have Views deployed Data Workers use the cluster and use a “standalone” Ambari Server for Views
  • 14. Page14 © Hortonworks Inc. 2015 Improved Ease of Use for the Hadoop Operator Responsibilities include: • Deploying Hadoop® clusters • Managing cluster health • Troubleshooting and resolving issues Hadoop Operator Simpler administration speeds time to value Easy Setup and Installation Streamlined configuration experience Customizable Dashboards Track cluster health with KPIs and drill downs Easier Provisioning and Faster Cluster Formation Cloudbreak simplifies provisioning. Ambari speeds cluster formation with automated host discovery.
  • 15. Page15 © Hortonworks Inc. 2015 Hadoop Operator New guided configurations ease cluster setup
  • 16. Page16 © Hortonworks Inc. 2015 Ease installation and configuration for HDFS, YARN, Hive and HBase Makes Key Configs Visible Clearly displays the set of options Recommends Settings Suggests optimal ranges Highlights Dependencies Lets you visualize any impact on dependent services Hadoop Operator
  • 17. Page17 © Hortonworks Inc. 2015 Hadoop Operator Fully customizable dashboard shows cluster KPIs
  • 18. Page18 © Hortonworks Inc. 2015 System Administrator Hadoop operators can configure dashboards to show KPIs Out-of-box Templates Based on common best practices Personalized Experience Create new display widgets built from Hadoop metrics. Add or remove existing widgets. Reusable and Shareable Widget library allows other operators to re-use community widgets
  • 19. Page19 © Hortonworks Inc. 2015 Demo: Operations
  • 20. Page20 © Hortonworks Inc. 2015 Host discovery makes cluster expansion automatic, fast, orderly and predictable Faster Expand clusters incrementally and automatically as each new node becomes available Easier Pre-plan automatic expansion paths Flexible for Cloud or On-premises Discover hosts wherever they are Ambari Hadoop Operator Host Discovery Eases Cluster Formation
  • 21. Page21 © Hortonworks Inc. 2015 Learn More about Ambari Thursday, 3:10-3:50 – What’s New in Apache Ambari with Sumit Mohanty & Yusako Sako
  • 22. Page22 © Hortonworks Inc. 2015 Launch HDP on Leading Cloud Platforms BI / Analytics (Hive) IoT Apps (Storm, HBase, Hive) Dev / Test (all HDP services) Data Science (Spark) Cloudbreak 1. Pick a Blueprint 2. Choose a Cloud 3. Launch HDP! Example Ambari Blueprints: IoT Apps, BI / Analytics, Data Science, Dev / Test
  • 23. Page23 © Hortonworks Inc. 2015 BI / Analytics (Hive) IoT Apps (Storm, HBase, Hive) Launch HDP on Any Cloud for Any Application Dev / Test (all HDP services) Data Science (Spark) Cloudbreak 1. Pick a Blueprint 2. Choose a Cloud 3. Launch HDP! Example Ambari Blueprints: IoT Apps, BI / Analytics, Data Science, Dev / Test
  • 24. Page24 © Hortonworks Inc. 2015 Cloudbreak automates provisioning and scaling clusters in the cloud Hadoop Operator
  • 25. Page25 © Hortonworks Inc. 2015 Hadoop Operator Cloudbreak automates cluster provisioning and scaling for the cloud in only 3 steps
  • 26. Page26 © Hortonworks Inc. 2015 Step 1: Choose your cloud provider – Microsoft Azure, Amazon AWS, Google Cloud Platform or OpenStack
  • 27. Page27 © Hortonworks Inc. 2015 Step 2: Enter your cloud credentials
  • 28. Page28 © Hortonworks Inc. 2015 Step 3: Pick Your Ambari Blueprint
  • 29. Page29 © Hortonworks Inc. 2015 Cloudbreak provides feedback while cluster creation is progress
  • 30. Page30 © Hortonworks Inc. 2015 Turn on auto-scaling and set SLA policies
  • 31. Page31 © Hortonworks Inc. 2015 Hadoop Operator Leverage re-usable blueprints to provision HDP in any environment Public or Private Clouds Dynamically set up public or private cloud clusters from the web console Automated Scaling Manage elasticity requirements as cluster demands grow Choice of Many Clouds Supports Microsoft Azure, AWS, Google and Open Stack clouds
  • 32. Page32 © Hortonworks Inc. 2015 Learn More about Cloudbreak Wednesday, 2:35-3:15 – One-click Hadoop Clusters - anywhere (using Docker) with Janos Matyas
  • 33. Page33 © Hortonworks Inc. 2015 Preview URL: launch.hortonworks.com Launch an HDP cluster with only a few clicks Easy Setup With the leading public cloud platforms: Microsoft Azure, AWS and Google Cloud Easy Exploration Try out the latest features in HDP Your Data Use the newest cluster technologies with your own familiar dataset
  • 34. Page34 © Hortonworks Inc. 2015 Advances for the Developer Responsibilities include: • Developing SQL queries • Developing new Spark applications • Implementing streaming data analytics Developer Develop Hadoop applications with ease and speed Visualization of SQL Queries Streamlined user interface for Apache Hive Improvements to Apache Spark on YARN Machine Learning, Data Frame API, New SQL (Preview) Enterprise Enhancements for Streaming Fault tolerance, security, and rolling upgrades for Apache Kafka and Apache Storm
  • 35. Page35 © Hortonworks Inc. 2015 Enhanced SQL Semantics and New SQL User View The rich developer experience includes enhanced SQL semantics and a new user interface Enhanced SQL Semantics Include interval types in expressions and added UNION SQL User View in Ambari Write, debug and run Hive SQL queries Performance Improvements 2.5x performance gain Query Scheduling Dynamically share resources for Hive queries Storage YARN: Data Operating System Governance Security Operations Resource Management
  • 36. Page36 © Hortonworks Inc. 2015 Developer New user interface enables fast & easy SQL definition and execution.
  • 37. Page37 © Hortonworks Inc. 2015 New capabilities add dynamic access methods to feature-rich Spark applications Data Frame API Enables common and easy interchange between Spark components for data imports and exports Machine Learning Introduces multiclass classification, clustering, frequent pattern-mining algorithms Enterprise-Ready Consistent operations, comprehensive security, deployable anywhere Spark SQL [Tech Preview] A new module for structured data processing in Spark Improvements for Apache Spark on YARN Storage YARN: Data Operating System Governance Security Operations Resource Management
  • 38. Page38 © Hortonworks Inc. 2015 Stream analysis, scalable across the cluster Nimbus High Availability No single point of failure for stream processing job management Ease of Deployment Quickly create stream processing pipelines Rolling Upgrades Update Storm to newer versions, with zero downtime Enhanced Security for Kafka Authorization via Ranger and authentication via Kerberos Streaming Analysis Ready for Mainstream Adoption Storage YARN: Data Operating System Governance Security Operations Resource Management
  • 39. Page39 © Hortonworks Inc. 2015 Demo: Developer
  • 40. Page40 © Hortonworks Inc. 2015 New Capabilities in HDP 2.3 Breakthrough User Experience Dramatic Improvement in the User Experience HDP 2.3 eliminates much of the complexity administering Hadoop and improves developer productivity. Enhanced Security and Governance Enhanced Security and Data Governance HDP 2.3 delivers new encryption of data at rest, and extends the data governance initiative with Apache™ Atlas. Proactive Support Extending the Value of a Hortonworks Subscription Hortonworks® SmartSense™ adds proactive cluster monitoring, enhancing Hortonworks’ award-winning support in key areas.
  • 41. Page41 © Hortonworks Inc. 2015 HDP Security: Comprehensive, Complete, Extensible Security in HDP is the most comprehensive, complete and extensible for Hadoop Administration Central management and consistent security Only HDP delivers a single administrative console to set policy across the entire cluster Authentication Authenticate users and systems Authentication for perimeter and cluster; integrates with existing ActiveDirectory and LDAP solutions Authorization Provision access to data Provides consistent authorization controls across all Apache components within HDP Audit Maintain a record of data access Maintains a record of data access events across all components that is consistent and accessible Data Protection Protect data at rest and in motion Encrypts data in motion and data at rest; refer partner encryption solutions for broader needs
  • 42. Page42 © Hortonworks Inc. 2015 Enhanced Security Capabilities in HDP 2.3 Project New Features Administration Central management and consistent security Ranger • Administer Kafka, Solr and multi-tenant YARN queues • Support for custom plugins via Ranger and Knox stacks Authentication Authenticate users and systems Knox • Bi-directional SSL support  trust between clients and servers • LDAP data caching reduces server load, improves performance Authorization Provision access to data Ranger • Authorization for Kafka, Solr and multi-tenant YARN queues • Hooks for dynamic policy rules (e.g., by geo-location) Audit Maintain a record of data access Atlas • Scalable metadata service • Hive integration leverages existing metadata • UI: Hive table lineage and domain-specific search Data Protection Protect data at rest and in motion HDFS, Ranger • HDFS transparent data encryption (for data at rest) • Key management store (KMS) that’s robust and highly available
  • 43. Page43 © Hortonworks Inc. 2015 Demo: Security
  • 44. Page44 © Hortonworks Inc. 2015 Learn More about Ranger Thursday, 3:10-3:50 – Securing Hadoop with Apache Ranger: Strategies and Best Practices with Selvamohan Neethiraj & Velmurugan Periasamy
  • 45. Page45 © Hortonworks Inc. 2015 Extending Data Governance to Hadoop ETL / DQ MDM ARCHIVE Traditional Data Systems Data Governance Requirements Transparent Governance standards and protocols must be clearly defined and available to all Reproducible Recreate the relevant data landscape at a given point in time Auditable Trace all relevant events and assets with appropriate historical lineage Consistent Compliance practices must be consistent Hadoop Data Platform Must snap into existing data governance frameworks and openly exchange metadata A group of companies dedicated to meeting these requirements in the openSCM CRM ERP Holistic Data Governance Business Analytics Visualization & Dashboards
  • 46. Page46 © Hortonworks Inc. 2015 Apache Atlas Is Now Included in HDP Apache Atlas Knowledge Store Audit Store ModelsType-System Policy RulesTaxonomies Tag-based Policies Data Lifecycle Management Real-time Tag-based Access Control REST API Services Search Lineage Exchange Healthcare HIPAA HL7 Financial SOX Dodd-Frank Energy PPDM Retail PCI PII Other CWM Scalable Metadata Service Agile Centralized Taxonomy – Enterprise/Business unit level modeling with industry-specific vocabulary Operational Metadata – Extend visibility into HDFS Path, Hive DB, table, columns REST API – Modern, flexible access to Atlas services Hive Integration Hive Metadata – Leverage existing metadata with import / export capability and capture SQL runtime metrics directly User Interface Hive Table Lineage and Search DSL – Support for key word, faceted and free text searches
  • 47. Page47 © Hortonworks Inc. 2015 New Capabilities in HDP 2.3 Breakthrough User Experience Dramatic Improvement in the User Experience HDP 2.3 eliminates much of the complexity administering Hadoop and improves developer productivity. Enhanced Security and Governance Enhanced Security and Data Governance HDP 2.3 delivers new encryption of data at rest, and extends the data governance initiative with Apache™ Atlas. Proactive Support Extending the Value of a Hortonworks Subscription Hortonworks® SmartSense™ adds proactive cluster monitoring, enhancing Hortonworks’ award-winning support in key areas.
  • 48. Page48 © Hortonworks Inc. 2015 HDP Subscriptions Deliver Global support coverage, 24x7x365 Hortonworks University self-paced learning Premier Support: designated support engineer Influence on the direction of the technology The Hadoop Industry’s Best Subscription Value Expansion Architecture & Development ProductionImplementation Hortonworks Support # tickets Project 2 Project 3 Project N . . . From Architecture to Expansion “Hortonworks loves and lives open-source innovation”
  • 49. Page49 © Hortonworks Inc. 2015 Hortonworks® SmartSense™ provides comprehensive visibility into cluster issues Hadoop Operator
  • 50. Page50 © Hortonworks Inc. 2015 Hortonworks® SmartSense™ makes tailored recommendations based on analysis of operational data Hadoop Operator
  • 51. Page51 © Hortonworks Inc. 2015 Hortonworks® SmartSense™ solicits feedback from Hadoop Operators to optimize its recommendations Hadoop Operator
  • 52. Page52 © Hortonworks Inc. 2015 Hadoop Operator Hortonworks® SmartSense™ enhances the support subscription Faster Case Resolution Easily capture log files and metrics for insight and resolution Proactive Configuration Via intelligent stream of cluster analytics and data-driven recommendations Capacity Planning Through proactive view into customer’s cluster utilization
  • 53. Page53 © Hortonworks Inc. 2015 Hortonworks® SmartSense™ Resolves Issues Proactively Integrated Customer Portal Knowledge Base On-Demand Training Customer Environment • Any cloud • Hybrid environment • Multi-tenant “5 out of 5”Enterprise Hadoop Support Connection to the customer’s environment via telephone or web support
  • 54. Page54 © Hortonworks Inc. 2015 Hortonworks SmartSense Hortonworks® SmartSense™ Resolves Issues Proactively Integrated Customer Portal Knowledge Base On-Demand Training Customer Environment • Any cloud • Hybrid environment • Multi-tenant “5 out of 5”Enterprise Hadoop Support
  • 55. Page55 © Hortonworks Inc. 2015 In Summary: New in HDP Breakthrough User Experience Enhanced Security and Governance Proactive Support HDP 2.3 is a Major Step Forward for Open Enterprise Hadoop®
  • 56. Page56 © Hortonworks Inc. 2015 Thank You. Questions?

Notes de l'éditeur

  1. TYPE OF ANALYSIS: SQL QUERIES WITH HIVE + A MAJOR HOME IMPROVEMENT RETAILER + SINGLE VIEW OF ITS CUSTOMERS – “THE GOLDEN RECORD” + SINGLE VIEW OF INVENTORY FOR SUPPLY CHAIN OPTIMIZATION + AND ALSO A SINGLE VIEW OF ITS AND COMPETITORS PRICES + LOW COST OF STORAGE = MORE DATA RETAINED FOR LONGER + LONGER RETENTION POWER = MULTIPLE “SINGLE VIEWS”
  2. TYPE OF ANALYSIS: STREAM ANALYSIS A MAJOR PROVIDER OF DIGITAL SECURITY SOLUTIONS CUT ITS TIME PROCESSING THE THREAT LANDSCAPE FROM FOUR HOURS TO 2 SECONDS, WHICH DRAMATICALLY REDUCED THEIR CLIENTS’ WINDOW OF VULNERABILITY STATS + PROCESSES 105 MILLION LOG EVENTS PER MINUTE
  3. Dynamic availability
  4. A data governance framework in any organization comprises a combination of people, process and technology that are in place to establish decision rights and accountabilities for information. A governance policy defines who can take what actions with what information, and when, under what circumstances, using what methods.   The technology goals for a data governance framework are to provide a platform for a common approach across all systems and data within the organization, Explicitly they need to be: - Transparent: Governance standards & protocols must be clearly defined and available to all - Reproducible: Recreate the relevant data landscape at a point in time - Auditable: All relevant events and assets but be traceable with appropriate historical lineage - Consistent: Compliance practices must be consistent    
  5. Apache Atlas is the only open source project created to solve the governance challenge in the open. The founding members of the project include all the members of the data governance initiative and others from the Hadoop community. The core functionality defined by the project includes the following: Data Classification – create an understanding of the data within Hadoop and provide a classification of this data to external and internal sources Centralized Auditing – provide a framework to capture and report on access to and modifications of data within Hadoop Search & Lineage – allow pre-defined and ad hoc exploration of data and metadata while maintaining a history of how a data source or explicit data was constructed. Security and Policy Engine – implement engines to protect and rationalize data access and according to compliance policy.
  6. You should be hiring people focused on your unique data and application needs, not support engineers focused on the complicated internals of the data platform. Many users who started out self-supporting have ultimately come to us to support the platform so they can focus on their application and business needs. We enable HDP in the market through three types of offerings: 1) software support subscriptions, 2) expert consulting services, and 3) training. Our primary focus is on our annual support subscriptions for HDP which provide the 24x7 support enterprises expect along with patches, updates, hot fixes, etc. that help keep their mission critical workloads running. Since we have the most committers working on the dozens of open source projects, we’re uniquely able to: -- Define and deliver an enterprise-focused roadmap for Enterprise Hadoop -- Provide customers and partners a direct way to engage with the community to affect that roadmap (you can think of us as the product management function for these projects) -- And finally, we ensure the patches and updates we make available to our customers are applied to the corresponding open source projects so there are no regressions in future releases of those open source components. To net out: we enable customer success by listening to their needs and driving innovation into HDP. And our open source model provides the leverage to evolve the technology faster than any single vendor could accomplish alone.