SlideShare une entreprise Scribd logo
1  sur  32
Télécharger pour lire hors ligne
BIG
                                                          DATA
                                                         IS CHANGING THE
                                                              WORLD


© Copyright 2010 EMC Corporation. All rights reserved.                     1
IN THIS DECADE THE DIGITAL UNIVERSE

    WILL GROW 44X
      FROM 0.9 ZETTABYTES TO 35.2 ZETTABYTES



     Source : 2010 IDC Digital Universe Study



© Copyright 2010 EMC Corporation. All rights reserved.   2
90% OF THE
      DIGITAL UNIVERSE IS
      UNSTRUCTURED

      Source: 2011 IDC Digital Universe Study



© Copyright 2010 EMC Corporation. All rights reserved.   3
Big Data Has Arrived
                                                                       Electronic
                                                                       Payments                    Video Rendering



                                                                                       Video
       Mobile Sensors                         Social Media                          Surveillance

                                                                             Medical Imaging
                                                                                                         Gene
                                                                                                         Sequencing

                                                         Geophysical
             Smart Grids
                                                         Exploration


© Copyright 2010 EMC Corporation. All rights reserved.                                                                4
Deliver Better Healthcare With Big Data
      Billion Dollar Specialty Care Service Provider
                                                        Legacy System &                                     New System &
                                                                                            International
                                                        Traditional Data                       Results            Big Data
                              Quality Of Patient Care




                                                                            Treatment
                                                                           Pathways On
                                                            Treatment      All The Data
                                                           Pathways On                                          Individual
                                                          Summary Data                                        Patient History

                                                                                           Social &
                                                                                          Economic
                                                                                           Factors




© Copyright 2010 EMC Corporation. All rights reserved.                                                                          5
Increase Profit Margins With Big Data
      Retail Banking Firm Aligns Offers To Customers
                                                Legacy System &                                        New System &
                                                Traditional Data                                             Big Data

                                                                      Profit-Based
                              Customer Profit




                                                                   Recommendations

                                                                                                            Identify
                                                     Agent
                                                                                                           “At-Risk”
                                                  “Best Guess”
                                                                                                          Customers
                                                                                        User Based
                                                                                     Recommendations




© Copyright 2010 EMC Corporation. All rights reserved.                                                                  6
Classifying and segmenting Big Data
          • Rich content stores—original intellectual property or value-added
                   – Media, VOD, content creation, special effects, satellite imagery, GIS data

          • Generated from workflow—must be managed/processed quickly & cheaply
                   – Manufacturing, simulation, electronic design

          • Develop new intellectual property based on big data
                   – Pharmaceutical companies doing customised drug development

          • Companies, public sector, utilities mining data for business advantage
          • Some mine consumer data—higher-volume and potentially higher-value



© Copyright 2010 EMC Corporation. All rights reserved.                                            7
Big Data is File & Unstructured Data
                            90
                            80
                            70
                            60
                EXABYTES




                            50
                            40
                            30
                            20
                            10
                             0
                                        2009             2010           2011        2012          2013      2014
                                                         File Based: 60.7% CAGR   Block Based: 21.8% CAGR

                           By 2012, 80% of all storage capacity sold will be for file-based data

         Source: IDC
© Copyright 2010 EMC Corporation. All rights reserved.                                                             8
Why is Big Data appearing now?




                                                         Source: IDC



© Copyright 2010 EMC Corporation. All rights reserved.                 9
Gartner’s 3 V’s of Big Data




© Copyright 2010 EMC Corporation. All rights reserved.   10
“The Internet of Things”
          • Massive explosion of smart devices, all sending, receiving, storing data
                   – handhelds, tablets, cameras
                   – Human-oriented devices
          • Non-human-oriented devices
                   – sensors, embedded CPUs
          • Social networking messages & data grow exponentially
                   – Twitter feeds, Facebook updates, LinkedIn messages
          • Increasingly, business is conducted digitally – or digitized
          • Big Data is global – any source to any target


© Copyright 2010 EMC Corporation. All rights reserved.                                 11
Source:
                                                         GoGlobe



© Copyright 2010 EMC Corporation. All rights reserved.             12
Companies want to store big data—Why?
          • Google – Originally thought of as “search engine”
                   – Now: Storing the Internet, storing every search query
          • Facebook, Twitter – Just social media?
                   – Storing every message you send, monitoring every
                     market trend
          • Amazon – your every purchase, forever
          • Carriers – Storing location-based data on everyone

© Copyright 2010 EMC Corporation. All rights reserved.                       13
Social Networking Analysis
                                                         Courtesy of NSF Workshop on Social Modeling


© Copyright 2010 EMC Corporation. All rights reserved.                                                 14
The race is on
      • Big Data leads to the Optimised Organisation
      • Takes a long time to build a functioning data
        warehouse, analytics tools, connect to business
      • Many companies have a head start
      • Every CIO needs to consider Big Data in their
        strategy to stay ahead
               – How to manage, how to leverage

© Copyright 2010 EMC Corporation. All rights reserved.    15
A little retailer I once knew
      • Why can Amazon beat everyone on price?
      • Purchase information used to adjust supply chain
      • Shipping and logistics adjusted according to conditions on
        the ground and supply chain
      • Other customers’ information used to provide
        recommendations, improve experience
      • Not just Amazon: Tesco, Carrefour, Metro, etc all taking
        advantage

© Copyright 2010 EMC Corporation. All rights reserved.               16
How do we make decisions?
      • Good data is hard to get—so often on no data at all
      • Often on information from peers, colleagues,
        reports, or because it’s always been done that way
      • Many companies fail because they fail to detect
           shifts in consumer demand
      • Internet has made customers more segmented, and
        causes customer choice to change faster

© Copyright 2010 EMC Corporation. All rights reserved.        17
Moving to a Data-Driven Model
                                                         • Managing with the facts
                                                         • Making a science out of data!
                                                         • Experimental model—different
                                                           than BI
                                                         • Moving from “gut feel” to
                                                           rational, scentific decisions


© Copyright 2010 EMC Corporation. All rights reserved.                                     18
Big-Data-based Decisions
      • Unlock value by making information transparent
        and useable at higher frequency
      • More accurate information (e.g. inventories, trends)
      • Tailor products more precisely
      • Sophisticated analytics makes for better decisions
      • Better products (via web feedback, sensors, etc)
                                                         Source: McKinsey



© Copyright 2010 EMC Corporation. All rights reserved.                      19
What holds back big data?
      • Not ICT—compute & storage getting
        bigger, cheaper, easier
      • Not the quantity of data (see slide 1)
      • Not the value—large-scale Big Data
        projects generally have great ROI
      • Real problems are organisational
        change and talent acquisition


© Copyright 2010 EMC Corporation. All rights reserved.   20
© Copyright 2010 EMC Corporation. All rights reserved.   21
How are people doing it?
         • Enterprises ingesting > 1PB data per day within 5 yrs
         • Big data is often largely unstructured
         • Hadoop is an application written to analyze big data
                   –     open source, Java-based

         • Big data can mean billions to trillions of files
                   –     Each file can be gigabytes to terabytes in size
         •    Directed graph analysis, Collaborative Filtering, A/B testing, Associative Rule Learning, Classification, Natural Language
              processing, Data Mining, Pattern Matching, Sentiment Analysis, Comparative Effectiveness, Clinical Decision Support are
              examples of big data techniques

         • This means petabytes to exabytes of data




© Copyright 2010 EMC Corporation. All rights reserved.                                                                                     22
How do you manage and design for Big Data?
          • Scale and parallelism are the keys
                   – Big data is far too big to process sequentially
                   – Too much coming in too quickly
                   – Example: Banks seeking to process market data
                     more quickly, reducing decision making time from
                     days to minutes
          • Answer: Scale-out storage and scale-out processing


© Copyright 2010 EMC Corporation. All rights reserved.                  23
Cramming big data onto traditional models
       Server




                                                         Scalability
       Network




                                                         Performance
                                                         Management
                                                         Availability
                                                         Cost
       Storage




© Copyright 2010 EMC Corporation. All rights reserved.                  24
A different idea – scale-out
       Server




                                                         Scalability
       Network




                                                         Performance
                                                         Management
                                                         Availability
                                                         Cost
       Storage




© Copyright 2010 EMC Corporation. All rights reserved.                  25
Enterprise Hadoop: Greenplum & Isilon
      • Easier and more reliable
               – Packaged Hadoop distribution with Isilon storage
      • Purpose-built Hadoop infrastructure
               – Faster, less risk
      • Sharing expertise to address the talent gap
               – Architecture, data science, and roadmap services
      • Proven at scale with worldwide support
               – 24x7 one call Hadoop support from EMC
               – Key component of Greenplum UAP
               – Unstructured data processing


© Copyright 2010 EMC Corporation. All rights reserved.              26
Increasing Demand for Advanced Analytics
      • Complex
               – Deep, rich analysis of big data sets
               – Ad hoc, interactive analysis, not structured reports
      • Timely
               – On-going, frequent analysis (e.g. daily, weekly)
               – Insights delivered in minutes/seconds
      • Actionable
               – Forward looking, predictive insight
               – Create new business value


© Copyright 2010 EMC Corporation. All rights reserved.                  27
EMC Greenplum: Purpose-built for Big Data
      • EMC Greenplum is a shared nothing, massively parallel
        processing (MPP) data warehouse system
      • Core principle of data computing is to move the processing
        dramatically closer to the data and to the people

                          Fast Data
                           Loading
                                                         Extreme Performance        Unified
                                                          & Elastic Scalability   Data Access


© Copyright 2010 EMC Corporation. All rights reserved.                                          28
MPP Shared-Nothing Architecture
    Greenplum’s Massively                                                                                         MapReduce
    Parallel Processing (MPP)
    Database has extreme
    scalability on general purpose                         Master
    systems                                                Servers                              ...                           ...
                                                         Query planning
    Automatic parallelization                             and dispatch

         – Load and query like any                         Network
           database                                      Interconnect
    Scan and process in parallel                          Segment                                                                                     ...
         – Extremely scalable and I/O                     Servers ...
           optimized                                      Storage and
                                                             query             ...      ...    ...    ...   ...      ...      ...   ...   ...   ...
    Linear scalability by adding                          processing

    nodes                                                  External
         – Each adds storage, query                        Sources
           performance and loading                        MPP loading,
                                                         streaming, etc.
           performance



© Copyright 2011 EMC Corporation. All rights reserved.       EMC Confidential – NDA Required                                                                29
EMC Hadoop.
                                                         Open Source.
                                                         Fully Supported By
                                                         EMC.
© Copyright 2010 EMC Corporation. All rights reserved.                        30
The EMC Big Data “Stack”
4       Collaborative                                          Act
                                                         Documentum xCP
                                                                             ?

3          Real Time                                        Analyze
                                                         Greenplum, Hadoop

2       Structured &
        Unstructured



                                                              Store
1          Petabyte
            Scale
                                                          Isilon and Atmos



© Copyright 2010 EMC Corporation. All rights reserved.                           31
THANK
                                                          YOU
                                                         HAVE A GREAT
                                                         CONFERENCE!
© Copyright 2010 EMC Corporation. All rights reserved.                  32

Contenu connexe

En vedette

NAG December 2012
NAG December 2012NAG December 2012
NAG December 2012Eduserv
 
Licensing challenges under distributed education - Martyn Jansen
Licensing challenges under distributed education - Martyn JansenLicensing challenges under distributed education - Martyn Jansen
Licensing challenges under distributed education - Martyn JansenEduserv
 
Electronic resources and student attainment - Phil Adams
Electronic resources and student attainment - Phil AdamsElectronic resources and student attainment - Phil Adams
Electronic resources and student attainment - Phil AdamsEduserv
 
Single sign-on to online subscriptions with OpenAthens
Single sign-on to online subscriptions with OpenAthensSingle sign-on to online subscriptions with OpenAthens
Single sign-on to online subscriptions with OpenAthensEduserv
 
Traditional outsourcing is dead! What should your new ICT partner look like?
Traditional outsourcing is dead! What should your new ICT partner look like?Traditional outsourcing is dead! What should your new ICT partner look like?
Traditional outsourcing is dead! What should your new ICT partner look like?Eduserv
 
Usability workshop: the good, the bad and the ugly
Usability workshop: the good, the bad and the uglyUsability workshop: the good, the bad and the ugly
Usability workshop: the good, the bad and the uglyEduserv
 
NAG Presentation to Eduserv Maths and Stats Software Dec2014
NAG Presentation to Eduserv Maths and Stats Software Dec2014NAG Presentation to Eduserv Maths and Stats Software Dec2014
NAG Presentation to Eduserv Maths and Stats Software Dec2014Eduserv
 
IWMW11: A2 working against the silo
IWMW11: A2 working against the siloIWMW11: A2 working against the silo
IWMW11: A2 working against the siloEduserv
 
Cloud Computing - a legal view from Bird & Bird
Cloud Computing - a legal view from Bird & BirdCloud Computing - a legal view from Bird & Bird
Cloud Computing - a legal view from Bird & BirdEduserv
 
Trust me – I’m a journalist - Frank Walker
Trust me – I’m a journalist - Frank WalkerTrust me – I’m a journalist - Frank Walker
Trust me – I’m a journalist - Frank WalkerEduserv
 
Security in the cloud - making it a safe prospect
Security in the cloud - making it a safe prospectSecurity in the cloud - making it a safe prospect
Security in the cloud - making it a safe prospectEduserv
 
Extending Access Management to Business & Community Engagement - John Paschoud
Extending Access Managementto Business & Community Engagement - John PaschoudExtending Access Managementto Business & Community Engagement - John Paschoud
Extending Access Management to Business & Community Engagement - John PaschoudEduserv
 
Logos, Labels and Login - Rod Widdowson
Logos, Labels and Login - Rod WiddowsonLogos, Labels and Login - Rod Widdowson
Logos, Labels and Login - Rod WiddowsonEduserv
 
The potential of DevOps for cloud
The potential of DevOps for cloudThe potential of DevOps for cloud
The potential of DevOps for cloudEduserv
 
Making federations work together more effectively - Nicole Harris, JISC Adva...
Making federations work together more effectively -  Nicole Harris, JISC Adva...Making federations work together more effectively -  Nicole Harris, JISC Adva...
Making federations work together more effectively - Nicole Harris, JISC Adva...Eduserv
 

En vedette (15)

NAG December 2012
NAG December 2012NAG December 2012
NAG December 2012
 
Licensing challenges under distributed education - Martyn Jansen
Licensing challenges under distributed education - Martyn JansenLicensing challenges under distributed education - Martyn Jansen
Licensing challenges under distributed education - Martyn Jansen
 
Electronic resources and student attainment - Phil Adams
Electronic resources and student attainment - Phil AdamsElectronic resources and student attainment - Phil Adams
Electronic resources and student attainment - Phil Adams
 
Single sign-on to online subscriptions with OpenAthens
Single sign-on to online subscriptions with OpenAthensSingle sign-on to online subscriptions with OpenAthens
Single sign-on to online subscriptions with OpenAthens
 
Traditional outsourcing is dead! What should your new ICT partner look like?
Traditional outsourcing is dead! What should your new ICT partner look like?Traditional outsourcing is dead! What should your new ICT partner look like?
Traditional outsourcing is dead! What should your new ICT partner look like?
 
Usability workshop: the good, the bad and the ugly
Usability workshop: the good, the bad and the uglyUsability workshop: the good, the bad and the ugly
Usability workshop: the good, the bad and the ugly
 
NAG Presentation to Eduserv Maths and Stats Software Dec2014
NAG Presentation to Eduserv Maths and Stats Software Dec2014NAG Presentation to Eduserv Maths and Stats Software Dec2014
NAG Presentation to Eduserv Maths and Stats Software Dec2014
 
IWMW11: A2 working against the silo
IWMW11: A2 working against the siloIWMW11: A2 working against the silo
IWMW11: A2 working against the silo
 
Cloud Computing - a legal view from Bird & Bird
Cloud Computing - a legal view from Bird & BirdCloud Computing - a legal view from Bird & Bird
Cloud Computing - a legal view from Bird & Bird
 
Trust me – I’m a journalist - Frank Walker
Trust me – I’m a journalist - Frank WalkerTrust me – I’m a journalist - Frank Walker
Trust me – I’m a journalist - Frank Walker
 
Security in the cloud - making it a safe prospect
Security in the cloud - making it a safe prospectSecurity in the cloud - making it a safe prospect
Security in the cloud - making it a safe prospect
 
Extending Access Management to Business & Community Engagement - John Paschoud
Extending Access Managementto Business & Community Engagement - John PaschoudExtending Access Managementto Business & Community Engagement - John Paschoud
Extending Access Management to Business & Community Engagement - John Paschoud
 
Logos, Labels and Login - Rod Widdowson
Logos, Labels and Login - Rod WiddowsonLogos, Labels and Login - Rod Widdowson
Logos, Labels and Login - Rod Widdowson
 
The potential of DevOps for cloud
The potential of DevOps for cloudThe potential of DevOps for cloud
The potential of DevOps for cloud
 
Making federations work together more effectively - Nicole Harris, JISC Adva...
Making federations work together more effectively -  Nicole Harris, JISC Adva...Making federations work together more effectively -  Nicole Harris, JISC Adva...
Making federations work together more effectively - Nicole Harris, JISC Adva...
 

Similaire à Rob anderson

Big Data: A Big Trap for Product Development
Big Data: A Big Trap for Product DevelopmentBig Data: A Big Trap for Product Development
Big Data: A Big Trap for Product DevelopmentStrategy 2 Market, Inc,
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data AnalyticsEMC
 
Big data cloud cloud circle keynote_final laura colvine 8th november 2012
Big data cloud cloud circle keynote_final laura colvine 8th november 2012Big data cloud cloud circle keynote_final laura colvine 8th november 2012
Big data cloud cloud circle keynote_final laura colvine 8th november 2012IBM
 
Big Data & the Cloud
Big Data & the CloudBig Data & the Cloud
Big Data & the CloudDATAVERSITY
 
TSB_IoT_Presentations_27June2012
TSB_IoT_Presentations_27June2012TSB_IoT_Presentations_27June2012
TSB_IoT_Presentations_27June2012100%Open
 
Demonstrating the Future of Data Science
Demonstrating the Future of Data ScienceDemonstrating the Future of Data Science
Demonstrating the Future of Data Sciencegreenplum
 
48 benot-long
48 benot-long48 benot-long
48 benot-longKBIZEAU
 
DiabetesManagement mHIseminar.Peeples
DiabetesManagement mHIseminar.PeeplesDiabetesManagement mHIseminar.Peeples
DiabetesManagement mHIseminar.PeeplesmHealth Initiative
 
Peter Schelkens - Future Media and Imaging
Peter Schelkens - Future Media and ImagingPeter Schelkens - Future Media and Imaging
Peter Schelkens - Future Media and Imagingimec.archive
 
Keynote - Randy Newell of IBM
Keynote - Randy Newell of IBMKeynote - Randy Newell of IBM
Keynote - Randy Newell of IBMjowen_evansdata
 
Low Hon Chau
Low Hon ChauLow Hon Chau
Low Hon ChauNone None
 
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big DataDr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big DataGlobal Business Events
 
Smarter Planet: How Big Data changes our world
Smarter Planet: How Big Data changes our worldSmarter Planet: How Big Data changes our world
Smarter Planet: How Big Data changes our worldKim Escherich
 
Powering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopPowering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopHortonworks
 

Similaire à Rob anderson (20)

Greenplum hadoop
Greenplum hadoopGreenplum hadoop
Greenplum hadoop
 
Greenplum hadoop
Greenplum hadoopGreenplum hadoop
Greenplum hadoop
 
Big Data: A Big Trap for Product Development
Big Data: A Big Trap for Product DevelopmentBig Data: A Big Trap for Product Development
Big Data: A Big Trap for Product Development
 
KMWorld Presentation
KMWorld PresentationKMWorld Presentation
KMWorld Presentation
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big data cloud cloud circle keynote_final laura colvine 8th november 2012
Big data cloud cloud circle keynote_final laura colvine 8th november 2012Big data cloud cloud circle keynote_final laura colvine 8th november 2012
Big data cloud cloud circle keynote_final laura colvine 8th november 2012
 
Big Data & the Cloud
Big Data & the CloudBig Data & the Cloud
Big Data & the Cloud
 
IBM Big Data Platform Nov 2012
IBM Big Data Platform Nov 2012IBM Big Data Platform Nov 2012
IBM Big Data Platform Nov 2012
 
TSB_IoT_Presentations_27June2012
TSB_IoT_Presentations_27June2012TSB_IoT_Presentations_27June2012
TSB_IoT_Presentations_27June2012
 
Demonstrating the Future of Data Science
Demonstrating the Future of Data ScienceDemonstrating the Future of Data Science
Demonstrating the Future of Data Science
 
101 ab 1415-1445
101 ab 1415-1445101 ab 1415-1445
101 ab 1415-1445
 
48 benot-long
48 benot-long48 benot-long
48 benot-long
 
DiabetesManagement mHIseminar.Peeples
DiabetesManagement mHIseminar.PeeplesDiabetesManagement mHIseminar.Peeples
DiabetesManagement mHIseminar.Peeples
 
Peter Schelkens - Future Media and Imaging
Peter Schelkens - Future Media and ImagingPeter Schelkens - Future Media and Imaging
Peter Schelkens - Future Media and Imaging
 
Keynote - Randy Newell of IBM
Keynote - Randy Newell of IBMKeynote - Randy Newell of IBM
Keynote - Randy Newell of IBM
 
On Demand Cloud Services Coury
On Demand Cloud Services   CouryOn Demand Cloud Services   Coury
On Demand Cloud Services Coury
 
Low Hon Chau
Low Hon ChauLow Hon Chau
Low Hon Chau
 
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big DataDr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
 
Smarter Planet: How Big Data changes our world
Smarter Planet: How Big Data changes our worldSmarter Planet: How Big Data changes our world
Smarter Planet: How Big Data changes our world
 
Powering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopPowering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache Hadoop
 

Plus de Eduserv

Phase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect optionPhase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect optionEduserv
 
Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources Eduserv
 
Lightning talk - EBSCO
Lightning talk - EBSCOLightning talk - EBSCO
Lightning talk - EBSCOEduserv
 
Lightning talk - Boopsie
Lightning talk - BoopsieLightning talk - Boopsie
Lightning talk - BoopsieEduserv
 
Lightning talk - Softlink
Lightning talk - SoftlinkLightning talk - Softlink
Lightning talk - SoftlinkEduserv
 
Lightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZineLightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZineEduserv
 
Lightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest AgreementsLightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest AgreementsEduserv
 
Phase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolutionPhase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolutionEduserv
 
Key considerations when mapping your end user experience
Key considerations when mapping your end user experienceKey considerations when mapping your end user experience
Key considerations when mapping your end user experienceEduserv
 
Our product development methodology
Our product development methodologyOur product development methodology
Our product development methodologyEduserv
 
How Readers Discover Content
How Readers Discover ContentHow Readers Discover Content
How Readers Discover ContentEduserv
 
OpenAthens product update
OpenAthens product updateOpenAthens product update
OpenAthens product updateEduserv
 
OpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome addressOpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome addressEduserv
 
Generating leads with content marketing
Generating leads with content marketingGenerating leads with content marketing
Generating leads with content marketingEduserv
 
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016Eduserv
 
Mobius from Maplesoft
Mobius from MaplesoftMobius from Maplesoft
Mobius from MaplesoftEduserv
 
QSR NVivo
QSR NVivo QSR NVivo
QSR NVivo Eduserv
 
How Eduserv are helping local government organisations
How Eduserv are helping local government organisationsHow Eduserv are helping local government organisations
How Eduserv are helping local government organisationsEduserv
 
Is cloud the right fit for your needs?
Is cloud the right fit for your needs?Is cloud the right fit for your needs?
Is cloud the right fit for your needs?Eduserv
 
Planning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing CouncilsPlanning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing CouncilsEduserv
 

Plus de Eduserv (20)

Phase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect optionPhase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect option
 
Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources
 
Lightning talk - EBSCO
Lightning talk - EBSCOLightning talk - EBSCO
Lightning talk - EBSCO
 
Lightning talk - Boopsie
Lightning talk - BoopsieLightning talk - Boopsie
Lightning talk - Boopsie
 
Lightning talk - Softlink
Lightning talk - SoftlinkLightning talk - Softlink
Lightning talk - Softlink
 
Lightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZineLightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZine
 
Lightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest AgreementsLightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest Agreements
 
Phase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolutionPhase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolution
 
Key considerations when mapping your end user experience
Key considerations when mapping your end user experienceKey considerations when mapping your end user experience
Key considerations when mapping your end user experience
 
Our product development methodology
Our product development methodologyOur product development methodology
Our product development methodology
 
How Readers Discover Content
How Readers Discover ContentHow Readers Discover Content
How Readers Discover Content
 
OpenAthens product update
OpenAthens product updateOpenAthens product update
OpenAthens product update
 
OpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome addressOpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome address
 
Generating leads with content marketing
Generating leads with content marketingGenerating leads with content marketing
Generating leads with content marketing
 
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
 
Mobius from Maplesoft
Mobius from MaplesoftMobius from Maplesoft
Mobius from Maplesoft
 
QSR NVivo
QSR NVivo QSR NVivo
QSR NVivo
 
How Eduserv are helping local government organisations
How Eduserv are helping local government organisationsHow Eduserv are helping local government organisations
How Eduserv are helping local government organisations
 
Is cloud the right fit for your needs?
Is cloud the right fit for your needs?Is cloud the right fit for your needs?
Is cloud the right fit for your needs?
 
Planning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing CouncilsPlanning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing Councils
 

Dernier

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 

Dernier (20)

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 

Rob anderson

  • 1. BIG DATA IS CHANGING THE WORLD © Copyright 2010 EMC Corporation. All rights reserved. 1
  • 2. IN THIS DECADE THE DIGITAL UNIVERSE WILL GROW 44X FROM 0.9 ZETTABYTES TO 35.2 ZETTABYTES Source : 2010 IDC Digital Universe Study © Copyright 2010 EMC Corporation. All rights reserved. 2
  • 3. 90% OF THE DIGITAL UNIVERSE IS UNSTRUCTURED Source: 2011 IDC Digital Universe Study © Copyright 2010 EMC Corporation. All rights reserved. 3
  • 4. Big Data Has Arrived Electronic Payments Video Rendering Video Mobile Sensors Social Media Surveillance Medical Imaging Gene Sequencing Geophysical Smart Grids Exploration © Copyright 2010 EMC Corporation. All rights reserved. 4
  • 5. Deliver Better Healthcare With Big Data Billion Dollar Specialty Care Service Provider Legacy System & New System & International Traditional Data Results Big Data Quality Of Patient Care Treatment Pathways On Treatment All The Data Pathways On Individual Summary Data Patient History Social & Economic Factors © Copyright 2010 EMC Corporation. All rights reserved. 5
  • 6. Increase Profit Margins With Big Data Retail Banking Firm Aligns Offers To Customers Legacy System & New System & Traditional Data Big Data Profit-Based Customer Profit Recommendations Identify Agent “At-Risk” “Best Guess” Customers User Based Recommendations © Copyright 2010 EMC Corporation. All rights reserved. 6
  • 7. Classifying and segmenting Big Data • Rich content stores—original intellectual property or value-added – Media, VOD, content creation, special effects, satellite imagery, GIS data • Generated from workflow—must be managed/processed quickly & cheaply – Manufacturing, simulation, electronic design • Develop new intellectual property based on big data – Pharmaceutical companies doing customised drug development • Companies, public sector, utilities mining data for business advantage • Some mine consumer data—higher-volume and potentially higher-value © Copyright 2010 EMC Corporation. All rights reserved. 7
  • 8. Big Data is File & Unstructured Data 90 80 70 60 EXABYTES 50 40 30 20 10 0 2009 2010 2011 2012 2013 2014 File Based: 60.7% CAGR Block Based: 21.8% CAGR By 2012, 80% of all storage capacity sold will be for file-based data Source: IDC © Copyright 2010 EMC Corporation. All rights reserved. 8
  • 9. Why is Big Data appearing now? Source: IDC © Copyright 2010 EMC Corporation. All rights reserved. 9
  • 10. Gartner’s 3 V’s of Big Data © Copyright 2010 EMC Corporation. All rights reserved. 10
  • 11. “The Internet of Things” • Massive explosion of smart devices, all sending, receiving, storing data – handhelds, tablets, cameras – Human-oriented devices • Non-human-oriented devices – sensors, embedded CPUs • Social networking messages & data grow exponentially – Twitter feeds, Facebook updates, LinkedIn messages • Increasingly, business is conducted digitally – or digitized • Big Data is global – any source to any target © Copyright 2010 EMC Corporation. All rights reserved. 11
  • 12. Source: GoGlobe © Copyright 2010 EMC Corporation. All rights reserved. 12
  • 13. Companies want to store big data—Why? • Google – Originally thought of as “search engine” – Now: Storing the Internet, storing every search query • Facebook, Twitter – Just social media? – Storing every message you send, monitoring every market trend • Amazon – your every purchase, forever • Carriers – Storing location-based data on everyone © Copyright 2010 EMC Corporation. All rights reserved. 13
  • 14. Social Networking Analysis Courtesy of NSF Workshop on Social Modeling © Copyright 2010 EMC Corporation. All rights reserved. 14
  • 15. The race is on • Big Data leads to the Optimised Organisation • Takes a long time to build a functioning data warehouse, analytics tools, connect to business • Many companies have a head start • Every CIO needs to consider Big Data in their strategy to stay ahead – How to manage, how to leverage © Copyright 2010 EMC Corporation. All rights reserved. 15
  • 16. A little retailer I once knew • Why can Amazon beat everyone on price? • Purchase information used to adjust supply chain • Shipping and logistics adjusted according to conditions on the ground and supply chain • Other customers’ information used to provide recommendations, improve experience • Not just Amazon: Tesco, Carrefour, Metro, etc all taking advantage © Copyright 2010 EMC Corporation. All rights reserved. 16
  • 17. How do we make decisions? • Good data is hard to get—so often on no data at all • Often on information from peers, colleagues, reports, or because it’s always been done that way • Many companies fail because they fail to detect shifts in consumer demand • Internet has made customers more segmented, and causes customer choice to change faster © Copyright 2010 EMC Corporation. All rights reserved. 17
  • 18. Moving to a Data-Driven Model • Managing with the facts • Making a science out of data! • Experimental model—different than BI • Moving from “gut feel” to rational, scentific decisions © Copyright 2010 EMC Corporation. All rights reserved. 18
  • 19. Big-Data-based Decisions • Unlock value by making information transparent and useable at higher frequency • More accurate information (e.g. inventories, trends) • Tailor products more precisely • Sophisticated analytics makes for better decisions • Better products (via web feedback, sensors, etc) Source: McKinsey © Copyright 2010 EMC Corporation. All rights reserved. 19
  • 20. What holds back big data? • Not ICT—compute & storage getting bigger, cheaper, easier • Not the quantity of data (see slide 1) • Not the value—large-scale Big Data projects generally have great ROI • Real problems are organisational change and talent acquisition © Copyright 2010 EMC Corporation. All rights reserved. 20
  • 21. © Copyright 2010 EMC Corporation. All rights reserved. 21
  • 22. How are people doing it? • Enterprises ingesting > 1PB data per day within 5 yrs • Big data is often largely unstructured • Hadoop is an application written to analyze big data – open source, Java-based • Big data can mean billions to trillions of files – Each file can be gigabytes to terabytes in size • Directed graph analysis, Collaborative Filtering, A/B testing, Associative Rule Learning, Classification, Natural Language processing, Data Mining, Pattern Matching, Sentiment Analysis, Comparative Effectiveness, Clinical Decision Support are examples of big data techniques • This means petabytes to exabytes of data © Copyright 2010 EMC Corporation. All rights reserved. 22
  • 23. How do you manage and design for Big Data? • Scale and parallelism are the keys – Big data is far too big to process sequentially – Too much coming in too quickly – Example: Banks seeking to process market data more quickly, reducing decision making time from days to minutes • Answer: Scale-out storage and scale-out processing © Copyright 2010 EMC Corporation. All rights reserved. 23
  • 24. Cramming big data onto traditional models Server Scalability Network Performance Management Availability Cost Storage © Copyright 2010 EMC Corporation. All rights reserved. 24
  • 25. A different idea – scale-out Server Scalability Network Performance Management Availability Cost Storage © Copyright 2010 EMC Corporation. All rights reserved. 25
  • 26. Enterprise Hadoop: Greenplum & Isilon • Easier and more reliable – Packaged Hadoop distribution with Isilon storage • Purpose-built Hadoop infrastructure – Faster, less risk • Sharing expertise to address the talent gap – Architecture, data science, and roadmap services • Proven at scale with worldwide support – 24x7 one call Hadoop support from EMC – Key component of Greenplum UAP – Unstructured data processing © Copyright 2010 EMC Corporation. All rights reserved. 26
  • 27. Increasing Demand for Advanced Analytics • Complex – Deep, rich analysis of big data sets – Ad hoc, interactive analysis, not structured reports • Timely – On-going, frequent analysis (e.g. daily, weekly) – Insights delivered in minutes/seconds • Actionable – Forward looking, predictive insight – Create new business value © Copyright 2010 EMC Corporation. All rights reserved. 27
  • 28. EMC Greenplum: Purpose-built for Big Data • EMC Greenplum is a shared nothing, massively parallel processing (MPP) data warehouse system • Core principle of data computing is to move the processing dramatically closer to the data and to the people Fast Data Loading Extreme Performance Unified & Elastic Scalability Data Access © Copyright 2010 EMC Corporation. All rights reserved. 28
  • 29. MPP Shared-Nothing Architecture Greenplum’s Massively MapReduce Parallel Processing (MPP) Database has extreme scalability on general purpose Master systems Servers ... ... Query planning Automatic parallelization and dispatch – Load and query like any Network database Interconnect Scan and process in parallel Segment ... – Extremely scalable and I/O Servers ... optimized Storage and query ... ... ... ... ... ... ... ... ... ... Linear scalability by adding processing nodes External – Each adds storage, query Sources performance and loading MPP loading, streaming, etc. performance © Copyright 2011 EMC Corporation. All rights reserved. EMC Confidential – NDA Required 29
  • 30. EMC Hadoop. Open Source. Fully Supported By EMC. © Copyright 2010 EMC Corporation. All rights reserved. 30
  • 31. The EMC Big Data “Stack” 4 Collaborative Act Documentum xCP ? 3 Real Time Analyze Greenplum, Hadoop 2 Structured & Unstructured Store 1 Petabyte Scale Isilon and Atmos © Copyright 2010 EMC Corporation. All rights reserved. 31
  • 32. THANK YOU HAVE A GREAT CONFERENCE! © Copyright 2010 EMC Corporation. All rights reserved. 32