SlideShare une entreprise Scribd logo
1  sur  54
© 2014 MapR Techno©lo 2g0ie1s4 MapR Technologies 
Getting Real With Hadoop 
Jim Scott, Director, Enterprise Strategy & Architecture 
@kingmesal #BigDataEverywhere #Chicago - October 1st, 2014
© 2014 MapR Technologies
© 2014 MapR Technologies
© 2014 MapR Technologies
© 2014 MapR Technologies
© 2014 MapR Technologies
© 2014 MapR Technologies 6 
Can’t We All Just Get Along?
© 2014 MapR Technologies 7 
We Have All Contributed…
The Reality is 
Architecture Matters 
8
© 2014 MapR Technologies 9 
High Availability (HA) Everywhere 
No NameNode architecture 
MapReduce/YARN HA 
NFS HA 
Instant recovery 
Rolling upgrades 
HA is built in 
• Distributed metadata can self-heal 
• No practical limit on # of files 
• Jobs are not impacted by failures 
• Meet your data processing SLAs 
• High throughput and resilience for NFS-based data 
ingestion, import/export and multi-client access 
• Files and tables are accessible within seconds of a node 
failure or cluster restart 
• Upgrade the software with no downtime 
• No special configuration to enable HA 
• All MapR customers operate with HA
© 2014 MapR Technologies
RDBMS Hammer 
© 2014 MapR Technologies 11
© 2014 MapR Technologies 12
Hadoop Hammer 
© 2014 MapR Technologies 13
© 2014 MapR Technologies 
Data Everywhere! 
Social Media 
Messages 
Audio 
Sensors 
Mobile Data 
Email 
Clickstream
Friends don’t let friends 
© 2014 MapR Technologies 
run name nodes.
© 2014 MapR Technologies 16 
Too Many Files!
Friends don’t let friends 
© 2014 MapR Technologies 
run name nodes.
© 2014 MapR Technologies 18 
Volumes 
100K volumes are OK, 
create as many as needed 
Volumes dramatically simplify 
management: 
• Replication factor 
• Scheduled mirroring 
• Scheduled snapshots 
• Data placement control 
• User access and tracking 
• Administrative permissions 
/projects 
/tahoe 
/yosemite 
/user 
/msmith 
/bjohnson
© 2014 MapR Technologies 19 
MapR M7: The Best In-Hadoop Database 
MapR-DB 
 NoSQL Columnar Store 
 Apache HBase API 
 Integrated with Hadoop 
HBase 
JVM 
HDFS 
JVM 
ext3/ext4 
Disks 
Other Distros 
Tables/Files 
Disks 
MapR Enterprise Database Edition (M7) 
The most scalable, enterprise-grade, 
NoSQL database that supports online applications and analytics
Easy Administration 
© 2014 MapR Technologies 20 
Tradeoffs with Other NoSQL Solutions 
Reliability 
24x7 applications with strong 
data consistency 
Performance 
Continuous low latency with 
horizontal scaling 
Easy day-to-day management 
with minimal learning curve
© 2014 MapR Technologies 21 
Consistent, Low Read Latency 
--- M7 Read Latency --- Others Read Latency
MapR Integrates Security into Hadoop 
© 2014 MapR Technologies 
MapR Integrates Security into Hadoop
© 2014 MapR Technologies 23 
Hadoop Security 
Authorization to 
ensure the right 
access to files 
and databases 
Authentication 
for users and 
user-created job 
requests 
Encryption to 
ensure user 
credentials and 
data are always 
secure 
Integration with 
existing security 
infrastructure
© 2014 MapR Technologies 24 
Fine-Grained Access Control 
Full POSIX permissions on files and directories 
ACLs on tables, column families and columns 
ACLs on MapReduce jobs and queues 
Administration ACLs on cluster and volumes 
ACLs for Apache Hive, Apache Drill and Impala
Seamless Integration with Direct Access NFS 
© 2014 MapR Technologies 25 
• MapR is POSIX compliant 
– Random reads/writes 
– Simultaneous reading and writing to a file 
– Compression is automatic and transparent
Seamless Integration with Direct Access NFS 
© 2014 MapR Technologies 26 
• MapR is POSIX compliant 
– Random reads/writes 
– Simultaneous reading and writing to a file 
– Compression is automatic and transparent 
• Industry-standard NFS interface (in 
addition to HDFS API) 
– Stream data into the cluster 
– Leverage thousands of tools and 
applications 
– Easier to use non-Java programming 
languages 
– No need for most proprietary Hadoop 
connectors
© 2014 MapR Technologies 27 
Disaster Recovery: Mirroring 
• Flexible 
– Choose the volumes/directories to mirror 
– You don’t need to mirror the entire cluster 
– Active/active 
• Fast 
– No performance impact 
– Block-level (8KB) deltas 
– Automatic compression 
Production Research 
Production 
WAN 
Datacenter 1 Datacenter 2 
WAN EC2
© 2014 MapR Technologies 28 
Disaster Recovery: Mirroring 
• Flexible 
– Choose the volumes/directories to mirror 
– You don’t need to mirror the entire cluster 
– Active/active 
• Fast 
– No performance impact 
– Block-level (8KB) deltas 
– Automatic compression 
• Safe 
– Point-in-time consistency 
– End-to-end checksums 
• Easy 
– Graceful handling of network issues 
– No third-party software 
– Takes less than two minutes to configure! 
Production Research 
Production 
WAN 
Datacenter 1 Datacenter 2 
WAN EC2
MapR Advantages 
MapR-DB Others 
99.999% uptime ✓ X 
Instant recovery from failures ✓ X 
Continuous low latency (no compactions) ✓ X 
© 2014 MapR Technologies 29 
Zero administration 
(no processes to manage, self-tuning) 
✓ X 
Online data protection (snapshots, mirroring) ✓ X 
Scalability (number of tables supported) Trillion Hundreds
Packages Supported by various distributions 
Red – lacking 
Blue - leading 
© 2014 MapR Technologies 30 
MapR 4.0.1 
(Sep 2014) 
Cloudera 5.1.2 
(Aug 2014) 
Hortonworks 2.1.5 
(Aug 2014) 
Apache Versions 
(Sep 12th, 2014) 
Core Hadoop Hadoop Core, YARN 2.4.1 2.3.0 2.4.0 2.5.1 
Batch Map Reduce MRv1 and MRv2 MRv1 or MRv2 MRv2 MRv2 
Hive 0.12, 0.13 0.12 0.13 0.13 
Tez 0.4 (Dev Preview Only) X 0.4 0.5 
Pig 0.12 0.12 0.12 0.12 
Cascading 2.1.6 X X 2.5 
Spark 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 
Interactive SQL Impala 1.2.3 1.4 X 1.4 
Drill 0.5 X X 0.5 
SparkSQL 1.0.2 X 1.0.1 (Tech Preview only) 1.1 
NoSQL and Search HBase/NoSQL 0.94.2, 0.98.4, MapR-DB 0.98 0.98, Accumulo 1.5.1 HBase 0.98 
Phoenix X X 4.0.0 4.1.0 
AsyncHBase 1.5 X X 1.5 
Search LW (Solr) 2.6.1 , 2.7 Cloudera Search 1.5 X NA 
Machine Learning and 
Graph 
Mahout 0.9 0.9 0.9 0.9 
MLLib/MLBase 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 
GraphX 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 
Streaming/Messaging Spark Streaming 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 
Storm 0.9, 0.9.2 (Certified) X 0.9.1 0.9.2 
Kafka X X 0.8.1.1 (Tech Preview) 0.8.1.1 
Data Integration Sqoop, Sqoop2 1.4.4, 1.99.3 1.4.4, 1.99.3 1.4.4 1.4.5 
Flume 1.5.0 1.5.0 1.4.0 1.5.0 
Knox X X 0.4 0.4 
Coordination Oozie 4.0.1 4.0.0 4.0.0 4.0.1 
Zookeeper 3.4.5 3.4.5 3.4.5 3.4.5 
GUI, Configuration, 
Monitoring 
Management MCS CM Ambari Ambari 
Hue 3.5 3.6 2.5.1 3.6 
http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH5/latest/CDH-Version-and-Packaging-Information/cdhvd_cdh_package_tarball.html?scroll=topic_3_unique_8 
http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.5/bk_releasenotes_hdp_2.1/content/ch_relnotes-hdp-2.1.5-product.html
© 2014 MapR Technologies 
Pick the 
Right Tool 
for the Job
Provisioning 
& 
coordination 
Savannah* 
Workflow 
& Data 
Governance 
MapR Distribution for Apache Hadoop 
Data 
Integration 
& Access 
Hue 
HttpFS 
Flume Knox* Falcon* Whirr 
© 2014 MapR Technologies 32 
APACHE HADOOP AND OSS ECOSYSTEM 
Security 
SQL 
Drill 
SparkSQL 
Impala 
YARN 
Batch 
Spark 
Cascading 
Pig 
Streaming 
Storm* 
Spark 
Streaming 
NoSQL & 
Search 
Solr 
HBase 
Juju 
ML, Graph 
GraphX 
MLLib 
Mahout 
MapReduce 
v1 & v2 
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS 
Tez* 
Accumulo* 
Hive 
Sqoop Sentry* Oozie ZooKeeper 
* Certification/support planned for 2014 
Management 
MapR Data Platform
Provisioning 
& 
coordination 
Savannah* 
Workflow 
& Data 
Governance 
Data 
Integration 
& Access 
Hue 
HttpFS 
Flume Knox* Falcon* Whirr 
NFS HDFS API HBase API JSON API 
© 2014 MapR Technologies 33 
APACHE HADOOP AND OSS ECOSYSTEM 
Security 
SQL 
Drill 
SparkSQL 
Impala 
YARN 
Batch 
Spark 
Cascading 
Pig 
Streaming 
Storm* 
Spark 
Streaming 
NoSQL & 
Search 
Solr 
HBase 
Juju 
ML, Graph 
GraphX 
MLLib 
Mahout 
MapReduce 
v1 & v2 
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS 
Tez* 
Accumulo* 
Hive 
Sqoop Sentry* Oozie ZooKeeper 
MapR Control System 
(Management and Monitoring) 
* Certification/support planned for 2014 
CLI REST API GUI 
MapR Distribution for Apache Hadoop
© 2014 MapR Technologies 
1.65TB 
WITH 298 SERVERS
© 2014 MapR Technologies 35 
1/7th the Hardware Footprint
Forrester Wave™: Big Data Hadoop Solutions, Q1‘14 
February 2014 “The Forrester Wave™: Big Data Hadoop Solutions, Q1 2014” 
© 2014 MapR Technologies 36
© 2014 MapR Technologies
• Pioneering Data Agility for Hadoop 
• Apache open source project 
• Scale-out execution engine for low-latency queries 
• Unified SQL-based API for analytics & operational applications 
© 2014 MapR Technologies 38 
APACHE DRILL 
40+ contributors 
150+ years of experience building 
databases and distributed systems
Drill Supports Schema Discovery On-The-Fly 
Schema Declared In Advance Schema Discovered On-The-Fly 
Schema Schema2 The-Fly 
© 2014 MapR Technologies 39 
• Fixed schema 
• Leverage schema in centralized 
repository (Hive Metastore) 
• Fixed schema, evolving schema or 
schema-less 
• Leverage schema in centralized 
repository or self-describing data 
SCHEMA ON 
WRITE 
SCHEMA 
BEFORE READ 
SCHEMA ON THE 
FLY
© 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsies 40 
Operational Analytics
© 2014 MapR Technologies 41 
Must Be Able to Scale
© 2014 MapR Technologies 42 
Mobile 
application server 
Real-time ad 
targeting 
Data exploration 
(SQL) 
Real-time and Operational 
Actionable 
Analytics 
Hadoop (MapR M7) 
•User profiles and state 
•User interactions 
•Real-time location data 
•Web and mobile session state 
•Comments/rankings 
Web 
application server 
Customer 360 
dashboard 
Churn analysis 
(predictive analytics) 
Product/service 
optimization and 
personalization
© 2014 MapR Technologies 43 
General Application Monitoring
© 2014 MapR Technologies 44 
Hard Drive Failure Rates
© 2014 MapR Technologies 45 
Recommendation Engines
© 2014 MapR Technologies 46 
20M 
SONGS 
Media Content Recommendation Engine
© 2014 MapR Technologies 
Fraud Detection
© 2014 MapR Technologies 48 
104M 
CARD MEMBERS 
Offer Serving, Credit Risk & Fraud 
More than $600B+
100M 
Data Points 
per second 
Fastest Data Ingest Rates 
© 2014 PEOPLE MapR Technologies 49
© 2014 MapR Technologies 50 
Speed and Intelligence…
Forrester Wave™: NoSQL Key-Value Databases, Q3‘14 
September 2014 “The Forrester Wave™: NoSQL Key-Value Databases, Q3 2014” 
© 2014 MapR Technologies 51
© 2014 MapR Technologies 52 
MapR Editions 
 Control System 
 NFS Access 
 Performance 
 Unlimited Nodes 
 Free 
 All the Features of M5 
 Simplified Administration 
for HBase 
 Increased Performance 
 Consistent Low Latency 
 Unified Snapshots, 
Mirroring 
 Control System 
 NFS Access 
 Performance 
 High Availability 
 Snapshots & Mirroring 
 24 X 7 Support 
 Annual Subscription 
Fastest On-Ramp: 
MapR Sandbox for Hadoop
© 2014 MapR Technologies 
Engage with us! 
@mapr maprtech 
jscott@mapr.com 
MapR 
maprtech 
mapr-technologies

Contenu connexe

Tendances

Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Adam Doyle
 
Best Practices for Protecting Sensitive Data Across the Big Data Platform
Best Practices for Protecting Sensitive Data Across the Big Data PlatformBest Practices for Protecting Sensitive Data Across the Big Data Platform
Best Practices for Protecting Sensitive Data Across the Big Data PlatformMapR Technologies
 
Hadoop 2 - More than MapReduce
Hadoop 2 - More than MapReduceHadoop 2 - More than MapReduce
Hadoop 2 - More than MapReduceUwe Printz
 
Dchug m7-30 apr2013
Dchug m7-30 apr2013Dchug m7-30 apr2013
Dchug m7-30 apr2013jdfiori
 
20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetupWei Ting Chen
 
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And CloudYARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And CloudDataWorks Summit
 
Rich placement constraints: Who said YARN cannot schedule services?
Rich placement constraints: Who said YARN cannot schedule services?Rich placement constraints: Who said YARN cannot schedule services?
Rich placement constraints: Who said YARN cannot schedule services?DataWorks Summit
 
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARNDeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARNDataWorks Summit
 
How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopPOSSCON
 
Hadoop 2 - Going beyond MapReduce
Hadoop 2 - Going beyond MapReduceHadoop 2 - Going beyond MapReduce
Hadoop 2 - Going beyond MapReduceUwe Printz
 
NoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBNoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBMapR Technologies
 
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise HadoopHadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise HadoopYifeng Jiang
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...StampedeCon
 
Dawn of YARN @ Rocket Fuel
Dawn of YARN @ Rocket FuelDawn of YARN @ Rocket Fuel
Dawn of YARN @ Rocket FuelDataWorks Summit
 
Architecting a Fraud Detection Application with Hadoop
Architecting a Fraud Detection Application with HadoopArchitecting a Fraud Detection Application with Hadoop
Architecting a Fraud Detection Application with HadoopDataWorks Summit
 
Hadoop: Past, Present and Future - v2.2 - SQLSaturday #326 - Tampa BA Edition
Hadoop: Past, Present and Future - v2.2 - SQLSaturday #326 - Tampa BA EditionHadoop: Past, Present and Future - v2.2 - SQLSaturday #326 - Tampa BA Edition
Hadoop: Past, Present and Future - v2.2 - SQLSaturday #326 - Tampa BA EditionBig Data Joe™ Rossi
 

Tendances (20)

Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016
 
MapR 5.2 Product Update
MapR 5.2 Product UpdateMapR 5.2 Product Update
MapR 5.2 Product Update
 
Hive Now Sparks
Hive Now SparksHive Now Sparks
Hive Now Sparks
 
Best Practices for Protecting Sensitive Data Across the Big Data Platform
Best Practices for Protecting Sensitive Data Across the Big Data PlatformBest Practices for Protecting Sensitive Data Across the Big Data Platform
Best Practices for Protecting Sensitive Data Across the Big Data Platform
 
Hadoop 2 - More than MapReduce
Hadoop 2 - More than MapReduceHadoop 2 - More than MapReduce
Hadoop 2 - More than MapReduce
 
Dchug m7-30 apr2013
Dchug m7-30 apr2013Dchug m7-30 apr2013
Dchug m7-30 apr2013
 
20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup
 
Apache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduceApache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduce
 
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And CloudYARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
 
Rich placement constraints: Who said YARN cannot schedule services?
Rich placement constraints: Who said YARN cannot schedule services?Rich placement constraints: Who said YARN cannot schedule services?
Rich placement constraints: Who said YARN cannot schedule services?
 
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARNDeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
 
How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in Hadoop
 
Hadoop 2 - Going beyond MapReduce
Hadoop 2 - Going beyond MapReduceHadoop 2 - Going beyond MapReduce
Hadoop 2 - Going beyond MapReduce
 
NoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBNoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DB
 
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise HadoopHadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
 
Dawn of YARN @ Rocket Fuel
Dawn of YARN @ Rocket FuelDawn of YARN @ Rocket Fuel
Dawn of YARN @ Rocket Fuel
 
Architecting a Fraud Detection Application with Hadoop
Architecting a Fraud Detection Application with HadoopArchitecting a Fraud Detection Application with Hadoop
Architecting a Fraud Detection Application with Hadoop
 
IoT:what about data storage?
IoT:what about data storage?IoT:what about data storage?
IoT:what about data storage?
 
Hadoop: Past, Present and Future - v2.2 - SQLSaturday #326 - Tampa BA Edition
Hadoop: Past, Present and Future - v2.2 - SQLSaturday #326 - Tampa BA EditionHadoop: Past, Present and Future - v2.2 - SQLSaturday #326 - Tampa BA Edition
Hadoop: Past, Present and Future - v2.2 - SQLSaturday #326 - Tampa BA Edition
 

En vedette

Mc kinsey big_data_full_report
Mc kinsey big_data_full_reportMc kinsey big_data_full_report
Mc kinsey big_data_full_reportJyrki Määttä
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...BigDataEverywhere
 
The Evolution of Data and New Opportunities for Analytics
The Evolution of Data and New Opportunities for AnalyticsThe Evolution of Data and New Opportunities for Analytics
The Evolution of Data and New Opportunities for AnalyticsSAS Canada
 
Open Source Engineering V2
Open Source Engineering V2Open Source Engineering V2
Open Source Engineering V2YoungSu Son
 
Data: Open for Good and Secure by Default | Eddie Garcia
Data: Open for Good and Secure by Default | Eddie GarciaData: Open for Good and Secure by Default | Eddie Garcia
Data: Open for Good and Secure by Default | Eddie GarciaCloudera, Inc.
 
Data, data, everywhere… - SEE UK - 2016
Data, data, everywhere… - SEE UK - 2016Data, data, everywhere… - SEE UK - 2016
Data, data, everywhere… - SEE UK - 2016TOPdesk
 
Big Data Insights & Opportunities
Big Data Insights & OpportunitiesBig Data Insights & Opportunities
Big Data Insights & OpportunitiesCompTIA
 
101 Marketing Charts
101 Marketing Charts101 Marketing Charts
101 Marketing ChartsHubSpot
 
Big Data Day LA 2016/ Use Case Driven track - Reliable Media Reporting in an ...
Big Data Day LA 2016/ Use Case Driven track - Reliable Media Reporting in an ...Big Data Day LA 2016/ Use Case Driven track - Reliable Media Reporting in an ...
Big Data Day LA 2016/ Use Case Driven track - Reliable Media Reporting in an ...Data Con LA
 
MixTaiwan 20170104-趨勢-陳昇瑋-從資料科學到人工智慧
MixTaiwan 20170104-趨勢-陳昇瑋-從資料科學到人工智慧MixTaiwan 20170104-趨勢-陳昇瑋-從資料科學到人工智慧
MixTaiwan 20170104-趨勢-陳昇瑋-從資料科學到人工智慧Mix Taiwan
 
Big Data Industry Insights 2015
Big Data Industry Insights 2015 Big Data Industry Insights 2015
Big Data Industry Insights 2015 Den Reymer
 
2016 CIO Agenda
2016 CIO Agenda2016 CIO Agenda
2016 CIO AgendaDen Reymer
 
Gartner: Top 10 Strategic Technology Trends 2016
Gartner: Top 10 Strategic Technology Trends 2016Gartner: Top 10 Strategic Technology Trends 2016
Gartner: Top 10 Strategic Technology Trends 2016Den Reymer
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBernard Marr
 
Gartner TOP 10 Strategic Technology Trends 2017
Gartner TOP 10 Strategic Technology Trends 2017Gartner TOP 10 Strategic Technology Trends 2017
Gartner TOP 10 Strategic Technology Trends 2017Den Reymer
 

En vedette (20)

Mc kinsey big_data_full_report
Mc kinsey big_data_full_reportMc kinsey big_data_full_report
Mc kinsey big_data_full_report
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
The Evolution of Data and New Opportunities for Analytics
The Evolution of Data and New Opportunities for AnalyticsThe Evolution of Data and New Opportunities for Analytics
The Evolution of Data and New Opportunities for Analytics
 
Open Source Engineering V2
Open Source Engineering V2Open Source Engineering V2
Open Source Engineering V2
 
Analytics3.0 e book
Analytics3.0 e bookAnalytics3.0 e book
Analytics3.0 e book
 
Data: Open for Good and Secure by Default | Eddie Garcia
Data: Open for Good and Secure by Default | Eddie GarciaData: Open for Good and Secure by Default | Eddie Garcia
Data: Open for Good and Secure by Default | Eddie Garcia
 
Data, data, everywhere… - SEE UK - 2016
Data, data, everywhere… - SEE UK - 2016Data, data, everywhere… - SEE UK - 2016
Data, data, everywhere… - SEE UK - 2016
 
Big Data Insights & Opportunities
Big Data Insights & OpportunitiesBig Data Insights & Opportunities
Big Data Insights & Opportunities
 
101 Marketing Charts
101 Marketing Charts101 Marketing Charts
101 Marketing Charts
 
Big Data Day LA 2016/ Use Case Driven track - Reliable Media Reporting in an ...
Big Data Day LA 2016/ Use Case Driven track - Reliable Media Reporting in an ...Big Data Day LA 2016/ Use Case Driven track - Reliable Media Reporting in an ...
Big Data Day LA 2016/ Use Case Driven track - Reliable Media Reporting in an ...
 
Banking Operations
Banking Operations Banking Operations
Banking Operations
 
MixTaiwan 20170104-趨勢-陳昇瑋-從資料科學到人工智慧
MixTaiwan 20170104-趨勢-陳昇瑋-從資料科學到人工智慧MixTaiwan 20170104-趨勢-陳昇瑋-從資料科學到人工智慧
MixTaiwan 20170104-趨勢-陳昇瑋-從資料科學到人工智慧
 
Big Data Industry Insights 2015
Big Data Industry Insights 2015 Big Data Industry Insights 2015
Big Data Industry Insights 2015
 
2016 CIO Agenda
2016 CIO Agenda2016 CIO Agenda
2016 CIO Agenda
 
Gartner: Top 10 Strategic Technology Trends 2016
Gartner: Top 10 Strategic Technology Trends 2016Gartner: Top 10 Strategic Technology Trends 2016
Gartner: Top 10 Strategic Technology Trends 2016
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 
Gartner TOP 10 Strategic Technology Trends 2017
Gartner TOP 10 Strategic Technology Trends 2017Gartner TOP 10 Strategic Technology Trends 2017
Gartner TOP 10 Strategic Technology Trends 2017
 

Similaire à Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)

Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRData Con LA
 
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...Amazon Web Services
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoopmarkgrover
 
Cloudera Impala - Las Vegas Big Data Meetup Nov 5th 2014
Cloudera Impala - Las Vegas Big Data Meetup Nov 5th 2014Cloudera Impala - Las Vegas Big Data Meetup Nov 5th 2014
Cloudera Impala - Las Vegas Big Data Meetup Nov 5th 2014cdmaxime
 
Spark One Platform Webinar
Spark One Platform WebinarSpark One Platform Webinar
Spark One Platform WebinarCloudera, Inc.
 
Bay Area Impala User Group Meetup (Sept 16 2014)
Bay Area Impala User Group Meetup (Sept 16 2014)Bay Area Impala User Group Meetup (Sept 16 2014)
Bay Area Impala User Group Meetup (Sept 16 2014)Cloudera, Inc.
 
Webinar: Selecting the Right SQL-on-Hadoop Solution
Webinar: Selecting the Right SQL-on-Hadoop SolutionWebinar: Selecting the Right SQL-on-Hadoop Solution
Webinar: Selecting the Right SQL-on-Hadoop SolutionMapR Technologies
 
Meruvian - Introduction to MapR
Meruvian - Introduction to MapRMeruvian - Introduction to MapR
Meruvian - Introduction to MapRThe World Bank
 
Drill into Drill – How Providing Flexibility and Performance is Possible
Drill into Drill – How Providing Flexibility and Performance is PossibleDrill into Drill – How Providing Flexibility and Performance is Possible
Drill into Drill – How Providing Flexibility and Performance is PossibleMapR Technologies
 
Introduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemIntroduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemShivaji Dutta
 
Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataSenturus
 
Virtual Hadoop Introduction In Chinese
Virtual Hadoop Introduction In ChineseVirtual Hadoop Introduction In Chinese
Virtual Hadoop Introduction In Chinese天青 王
 
Real Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeReal Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeMapR Technologies
 
Real Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeReal Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeDataWorks Summit
 
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340Big Data Joe™ Rossi
 

Similaire à Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR) (20)

Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
 
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoop
 
Cloudera Impala - Las Vegas Big Data Meetup Nov 5th 2014
Cloudera Impala - Las Vegas Big Data Meetup Nov 5th 2014Cloudera Impala - Las Vegas Big Data Meetup Nov 5th 2014
Cloudera Impala - Las Vegas Big Data Meetup Nov 5th 2014
 
MapR Unique features
MapR Unique featuresMapR Unique features
MapR Unique features
 
Spark One Platform Webinar
Spark One Platform WebinarSpark One Platform Webinar
Spark One Platform Webinar
 
Bay Area Impala User Group Meetup (Sept 16 2014)
Bay Area Impala User Group Meetup (Sept 16 2014)Bay Area Impala User Group Meetup (Sept 16 2014)
Bay Area Impala User Group Meetup (Sept 16 2014)
 
Webinar: Selecting the Right SQL-on-Hadoop Solution
Webinar: Selecting the Right SQL-on-Hadoop SolutionWebinar: Selecting the Right SQL-on-Hadoop Solution
Webinar: Selecting the Right SQL-on-Hadoop Solution
 
2014 08-20-pit-hug
2014 08-20-pit-hug2014 08-20-pit-hug
2014 08-20-pit-hug
 
Meruvian - Introduction to MapR
Meruvian - Introduction to MapRMeruvian - Introduction to MapR
Meruvian - Introduction to MapR
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
Drill into Drill – How Providing Flexibility and Performance is Possible
Drill into Drill – How Providing Flexibility and Performance is PossibleDrill into Drill – How Providing Flexibility and Performance is Possible
Drill into Drill – How Providing Flexibility and Performance is Possible
 
Introduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemIntroduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystem
 
Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big Data
 
Virtual Hadoop Introduction In Chinese
Virtual Hadoop Introduction In ChineseVirtual Hadoop Introduction In Chinese
Virtual Hadoop Introduction In Chinese
 
Real Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeReal Time and Big Data – It’s About Time
Real Time and Big Data – It’s About Time
 
Real Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeReal Time and Big Data – It’s About Time
Real Time and Big Data – It’s About Time
 
Introduction to Spark
Introduction to SparkIntroduction to Spark
Introduction to Spark
 
Getting started big data
Getting started big dataGetting started big data
Getting started big data
 
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
 

Plus de BigDataEverywhere

Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...BigDataEverywhere
 
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...BigDataEverywhere
 
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...BigDataEverywhere
 
Big Data Everywhere Chicago: Unleash the Power of HBase Shell (Conversant)
Big Data Everywhere Chicago: Unleash the Power of HBase Shell (Conversant) Big Data Everywhere Chicago: Unleash the Power of HBase Shell (Conversant)
Big Data Everywhere Chicago: Unleash the Power of HBase Shell (Conversant) BigDataEverywhere
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...BigDataEverywhere
 
Big Data Everywhere Chicago: SQL on Hadoop
Big Data Everywhere Chicago: SQL on Hadoop Big Data Everywhere Chicago: SQL on Hadoop
Big Data Everywhere Chicago: SQL on Hadoop BigDataEverywhere
 

Plus de BigDataEverywhere (6)

Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
 
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
 
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
 
Big Data Everywhere Chicago: Unleash the Power of HBase Shell (Conversant)
Big Data Everywhere Chicago: Unleash the Power of HBase Shell (Conversant) Big Data Everywhere Chicago: Unleash the Power of HBase Shell (Conversant)
Big Data Everywhere Chicago: Unleash the Power of HBase Shell (Conversant)
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
 
Big Data Everywhere Chicago: SQL on Hadoop
Big Data Everywhere Chicago: SQL on Hadoop Big Data Everywhere Chicago: SQL on Hadoop
Big Data Everywhere Chicago: SQL on Hadoop
 

Dernier

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 

Dernier (20)

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 

Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)

  • 1. © 2014 MapR Techno©lo 2g0ie1s4 MapR Technologies Getting Real With Hadoop Jim Scott, Director, Enterprise Strategy & Architecture @kingmesal #BigDataEverywhere #Chicago - October 1st, 2014
  • 2. © 2014 MapR Technologies
  • 3. © 2014 MapR Technologies
  • 4. © 2014 MapR Technologies
  • 5. © 2014 MapR Technologies
  • 6. © 2014 MapR Technologies
  • 7. © 2014 MapR Technologies 6 Can’t We All Just Get Along?
  • 8. © 2014 MapR Technologies 7 We Have All Contributed…
  • 9. The Reality is Architecture Matters 8
  • 10. © 2014 MapR Technologies 9 High Availability (HA) Everywhere No NameNode architecture MapReduce/YARN HA NFS HA Instant recovery Rolling upgrades HA is built in • Distributed metadata can self-heal • No practical limit on # of files • Jobs are not impacted by failures • Meet your data processing SLAs • High throughput and resilience for NFS-based data ingestion, import/export and multi-client access • Files and tables are accessible within seconds of a node failure or cluster restart • Upgrade the software with no downtime • No special configuration to enable HA • All MapR customers operate with HA
  • 11. © 2014 MapR Technologies
  • 12. RDBMS Hammer © 2014 MapR Technologies 11
  • 13. © 2014 MapR Technologies 12
  • 14. Hadoop Hammer © 2014 MapR Technologies 13
  • 15. © 2014 MapR Technologies Data Everywhere! Social Media Messages Audio Sensors Mobile Data Email Clickstream
  • 16. Friends don’t let friends © 2014 MapR Technologies run name nodes.
  • 17. © 2014 MapR Technologies 16 Too Many Files!
  • 18. Friends don’t let friends © 2014 MapR Technologies run name nodes.
  • 19. © 2014 MapR Technologies 18 Volumes 100K volumes are OK, create as many as needed Volumes dramatically simplify management: • Replication factor • Scheduled mirroring • Scheduled snapshots • Data placement control • User access and tracking • Administrative permissions /projects /tahoe /yosemite /user /msmith /bjohnson
  • 20. © 2014 MapR Technologies 19 MapR M7: The Best In-Hadoop Database MapR-DB  NoSQL Columnar Store  Apache HBase API  Integrated with Hadoop HBase JVM HDFS JVM ext3/ext4 Disks Other Distros Tables/Files Disks MapR Enterprise Database Edition (M7) The most scalable, enterprise-grade, NoSQL database that supports online applications and analytics
  • 21. Easy Administration © 2014 MapR Technologies 20 Tradeoffs with Other NoSQL Solutions Reliability 24x7 applications with strong data consistency Performance Continuous low latency with horizontal scaling Easy day-to-day management with minimal learning curve
  • 22. © 2014 MapR Technologies 21 Consistent, Low Read Latency --- M7 Read Latency --- Others Read Latency
  • 23. MapR Integrates Security into Hadoop © 2014 MapR Technologies MapR Integrates Security into Hadoop
  • 24. © 2014 MapR Technologies 23 Hadoop Security Authorization to ensure the right access to files and databases Authentication for users and user-created job requests Encryption to ensure user credentials and data are always secure Integration with existing security infrastructure
  • 25. © 2014 MapR Technologies 24 Fine-Grained Access Control Full POSIX permissions on files and directories ACLs on tables, column families and columns ACLs on MapReduce jobs and queues Administration ACLs on cluster and volumes ACLs for Apache Hive, Apache Drill and Impala
  • 26. Seamless Integration with Direct Access NFS © 2014 MapR Technologies 25 • MapR is POSIX compliant – Random reads/writes – Simultaneous reading and writing to a file – Compression is automatic and transparent
  • 27. Seamless Integration with Direct Access NFS © 2014 MapR Technologies 26 • MapR is POSIX compliant – Random reads/writes – Simultaneous reading and writing to a file – Compression is automatic and transparent • Industry-standard NFS interface (in addition to HDFS API) – Stream data into the cluster – Leverage thousands of tools and applications – Easier to use non-Java programming languages – No need for most proprietary Hadoop connectors
  • 28. © 2014 MapR Technologies 27 Disaster Recovery: Mirroring • Flexible – Choose the volumes/directories to mirror – You don’t need to mirror the entire cluster – Active/active • Fast – No performance impact – Block-level (8KB) deltas – Automatic compression Production Research Production WAN Datacenter 1 Datacenter 2 WAN EC2
  • 29. © 2014 MapR Technologies 28 Disaster Recovery: Mirroring • Flexible – Choose the volumes/directories to mirror – You don’t need to mirror the entire cluster – Active/active • Fast – No performance impact – Block-level (8KB) deltas – Automatic compression • Safe – Point-in-time consistency – End-to-end checksums • Easy – Graceful handling of network issues – No third-party software – Takes less than two minutes to configure! Production Research Production WAN Datacenter 1 Datacenter 2 WAN EC2
  • 30. MapR Advantages MapR-DB Others 99.999% uptime ✓ X Instant recovery from failures ✓ X Continuous low latency (no compactions) ✓ X © 2014 MapR Technologies 29 Zero administration (no processes to manage, self-tuning) ✓ X Online data protection (snapshots, mirroring) ✓ X Scalability (number of tables supported) Trillion Hundreds
  • 31. Packages Supported by various distributions Red – lacking Blue - leading © 2014 MapR Technologies 30 MapR 4.0.1 (Sep 2014) Cloudera 5.1.2 (Aug 2014) Hortonworks 2.1.5 (Aug 2014) Apache Versions (Sep 12th, 2014) Core Hadoop Hadoop Core, YARN 2.4.1 2.3.0 2.4.0 2.5.1 Batch Map Reduce MRv1 and MRv2 MRv1 or MRv2 MRv2 MRv2 Hive 0.12, 0.13 0.12 0.13 0.13 Tez 0.4 (Dev Preview Only) X 0.4 0.5 Pig 0.12 0.12 0.12 0.12 Cascading 2.1.6 X X 2.5 Spark 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 Interactive SQL Impala 1.2.3 1.4 X 1.4 Drill 0.5 X X 0.5 SparkSQL 1.0.2 X 1.0.1 (Tech Preview only) 1.1 NoSQL and Search HBase/NoSQL 0.94.2, 0.98.4, MapR-DB 0.98 0.98, Accumulo 1.5.1 HBase 0.98 Phoenix X X 4.0.0 4.1.0 AsyncHBase 1.5 X X 1.5 Search LW (Solr) 2.6.1 , 2.7 Cloudera Search 1.5 X NA Machine Learning and Graph Mahout 0.9 0.9 0.9 0.9 MLLib/MLBase 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 GraphX 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 Streaming/Messaging Spark Streaming 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 Storm 0.9, 0.9.2 (Certified) X 0.9.1 0.9.2 Kafka X X 0.8.1.1 (Tech Preview) 0.8.1.1 Data Integration Sqoop, Sqoop2 1.4.4, 1.99.3 1.4.4, 1.99.3 1.4.4 1.4.5 Flume 1.5.0 1.5.0 1.4.0 1.5.0 Knox X X 0.4 0.4 Coordination Oozie 4.0.1 4.0.0 4.0.0 4.0.1 Zookeeper 3.4.5 3.4.5 3.4.5 3.4.5 GUI, Configuration, Monitoring Management MCS CM Ambari Ambari Hue 3.5 3.6 2.5.1 3.6 http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH5/latest/CDH-Version-and-Packaging-Information/cdhvd_cdh_package_tarball.html?scroll=topic_3_unique_8 http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.5/bk_releasenotes_hdp_2.1/content/ch_relnotes-hdp-2.1.5-product.html
  • 32. © 2014 MapR Technologies Pick the Right Tool for the Job
  • 33. Provisioning & coordination Savannah* Workflow & Data Governance MapR Distribution for Apache Hadoop Data Integration & Access Hue HttpFS Flume Knox* Falcon* Whirr © 2014 MapR Technologies 32 APACHE HADOOP AND OSS ECOSYSTEM Security SQL Drill SparkSQL Impala YARN Batch Spark Cascading Pig Streaming Storm* Spark Streaming NoSQL & Search Solr HBase Juju ML, Graph GraphX MLLib Mahout MapReduce v1 & v2 EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Tez* Accumulo* Hive Sqoop Sentry* Oozie ZooKeeper * Certification/support planned for 2014 Management MapR Data Platform
  • 34. Provisioning & coordination Savannah* Workflow & Data Governance Data Integration & Access Hue HttpFS Flume Knox* Falcon* Whirr NFS HDFS API HBase API JSON API © 2014 MapR Technologies 33 APACHE HADOOP AND OSS ECOSYSTEM Security SQL Drill SparkSQL Impala YARN Batch Spark Cascading Pig Streaming Storm* Spark Streaming NoSQL & Search Solr HBase Juju ML, Graph GraphX MLLib Mahout MapReduce v1 & v2 EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Tez* Accumulo* Hive Sqoop Sentry* Oozie ZooKeeper MapR Control System (Management and Monitoring) * Certification/support planned for 2014 CLI REST API GUI MapR Distribution for Apache Hadoop
  • 35. © 2014 MapR Technologies 1.65TB WITH 298 SERVERS
  • 36. © 2014 MapR Technologies 35 1/7th the Hardware Footprint
  • 37. Forrester Wave™: Big Data Hadoop Solutions, Q1‘14 February 2014 “The Forrester Wave™: Big Data Hadoop Solutions, Q1 2014” © 2014 MapR Technologies 36
  • 38. © 2014 MapR Technologies
  • 39. • Pioneering Data Agility for Hadoop • Apache open source project • Scale-out execution engine for low-latency queries • Unified SQL-based API for analytics & operational applications © 2014 MapR Technologies 38 APACHE DRILL 40+ contributors 150+ years of experience building databases and distributed systems
  • 40. Drill Supports Schema Discovery On-The-Fly Schema Declared In Advance Schema Discovered On-The-Fly Schema Schema2 The-Fly © 2014 MapR Technologies 39 • Fixed schema • Leverage schema in centralized repository (Hive Metastore) • Fixed schema, evolving schema or schema-less • Leverage schema in centralized repository or self-describing data SCHEMA ON WRITE SCHEMA BEFORE READ SCHEMA ON THE FLY
  • 41. © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsies 40 Operational Analytics
  • 42. © 2014 MapR Technologies 41 Must Be Able to Scale
  • 43. © 2014 MapR Technologies 42 Mobile application server Real-time ad targeting Data exploration (SQL) Real-time and Operational Actionable Analytics Hadoop (MapR M7) •User profiles and state •User interactions •Real-time location data •Web and mobile session state •Comments/rankings Web application server Customer 360 dashboard Churn analysis (predictive analytics) Product/service optimization and personalization
  • 44. © 2014 MapR Technologies 43 General Application Monitoring
  • 45. © 2014 MapR Technologies 44 Hard Drive Failure Rates
  • 46. © 2014 MapR Technologies 45 Recommendation Engines
  • 47. © 2014 MapR Technologies 46 20M SONGS Media Content Recommendation Engine
  • 48. © 2014 MapR Technologies Fraud Detection
  • 49. © 2014 MapR Technologies 48 104M CARD MEMBERS Offer Serving, Credit Risk & Fraud More than $600B+
  • 50. 100M Data Points per second Fastest Data Ingest Rates © 2014 PEOPLE MapR Technologies 49
  • 51. © 2014 MapR Technologies 50 Speed and Intelligence…
  • 52. Forrester Wave™: NoSQL Key-Value Databases, Q3‘14 September 2014 “The Forrester Wave™: NoSQL Key-Value Databases, Q3 2014” © 2014 MapR Technologies 51
  • 53. © 2014 MapR Technologies 52 MapR Editions  Control System  NFS Access  Performance  Unlimited Nodes  Free  All the Features of M5  Simplified Administration for HBase  Increased Performance  Consistent Low Latency  Unified Snapshots, Mirroring  Control System  NFS Access  Performance  High Availability  Snapshots & Mirroring  24 X 7 Support  Annual Subscription Fastest On-Ramp: MapR Sandbox for Hadoop
  • 54. © 2014 MapR Technologies Engage with us! @mapr maprtech jscott@mapr.com MapR maprtech mapr-technologies