SlideShare une entreprise Scribd logo
1  sur  50
Analytics Education
in the Era of Big Data
    Gregory Piatetsky
       KDnuggets


         © KDnuggets 2013   1
Outline
• Analytics, Data Mining, Data Science - What
  do we call it?
• Big Data Trends
• Jobs and Skills
• Analytics Education overview




                    © KDnuggets 2013            2
What do we call it?

   Buzzwords and Trends
    Useful for Marketing
“Analytics” has staying power

           (c) KDnuggets 2013   3
What do we call it?
• Statistics             Same Core Idea:
• Data mining            Finding Useful
• Knowledge Discovery in
  Data (KDD)             Patterns in Data
•   Predictive Analytics
•   Business Analytics
•   Data Science                       Different
•   Data Analytics                     Emphasis
•   …?

                           © KDnuggets 2013        4
Pre-history (1900-2000): Statistics




  statistics is the biggest term in 20th century,
  Analytics is used increasingly thru 20th century
  data mining appears in late 1990s
          From Google Ngram viewer – English language books
          Search case sensitive – used most popular spelling.

                                 © KDnuggets 2012               5
20th Century
     Analytics vs Data Mining


                                                     data mining




                                                     Data Mining
                                     Analytics

                                      analytics

??
         Google N-grams search is case sensitive;
         Note: “data mining” > “Data Mining” usage
         While “analytics” < “Analytics”
                  © KDnuggets 2012                                 6
Recent History: 1980-2008
                                                                    data mining




                         Analytics

                     analytics                                Knowledge Discovery

“analytics” has been used since 1900, but started to rise in 2005
“data mining” surges around 1995 (soon after first KDD conference) but slowly
declines after 2003 (TIA controversy, associated with Govt invasion of privacy).
“Knowledge Discovery” appears in 1989, rises in 1996, and plateaus in 2000
(Google N-grams, smoothing =1)
                                  © KDnuggets 2012                                 7
Google Trends:
After 2006, Analytics > Data Mining




             Global – all regions
               (c) KDnuggets 2012     8
>50% of “Analytics” searches are for
        “Google Analytics”




Google Analytics introduced,
Dec 2005
                               (c) KDnuggets 2012   9
Google Trends observations
         (as of Jan 2013) Decline in analytics in
                                                  2012?
data mining: 16   analytics -google: 54




        Competing on Analytics book,
        Tom Davenport, Apr 2007                 Vacation drops
                           (c) KDnuggets 2013
Global View: searches for
data mining, analytics -google




        Google Trends

               (c) KDnuggets 2013   11
Google Trends: USA, 2012
  For “analytics – google –web –adsense”




                  © KDnuggets 2013
                                           12
Google Trends:
USA, Regional Interest




        © KDnuggets 2013   13
Google Trends:
USA, Analytics-related terms




           © KDnuggets 2013    14
Analytics:
Business > Data> Predictive > Text




       Google Insights, Jan 2007- Sep 2012, Global
                     (c) KDnuggets 2012              15
Big Data > Data Mining >
* Analytics > Data Science




                                      “Big Data” Surge




   Google Trends search, Jan 2007- Dec 2012, USA

                 (c) KDnuggets 2013                      16
Big Data Trends




    (c) KDnuggets 2013   17
3 Vs of Big Data
• Volume
  – Gigabytes to Terabytes to Petabytes …
• Velocity
  – online streaming
• Variety
  – numbers, text, links, images, audio, video, …




                       (c) KDnuggets 2013           18
Volume + Velocity => No consistency
• CAP Theorem (Eric Brewer, 2000)
  For highly scalable distributed systems, you can only
    have two of following:
  – 1) consistency,
  – 2) high availability, and
  – 3) (network) partition tolerance (network failure tolerance)
   http://www.julianbrowne.com/article/viewer/brewers-cap-
     theorem

  Implication: Big data solutions must stop worrying
    about consistency if they want high availability

                           (c) KDnuggets 2013                      19
Big Data
• 2nd Industrial Revolution

• Do old activities better

• Create new activities/businesses




                     (c) KDnuggets 2013   20
Doing Old Things Better
“Classical” Analytics Application areas
  – Churn prediction
  – Direct marketing/Customer modeling
  – Recommendations
  – Fraud detection
  – Security/Intelligence
  –…
• Competition will level companies

                     (c) KDnuggets 2013   21
Limit to Predicting Human Behavior?
• There is randomness in human behavior and
  once we find first-level effects, there are
  diminishing returns in prediction on individual
  level
• Many examples: Netflix Prize, Customer
  modeling…

Gregory Piatetsky-Shapiro, Big Data Hype and Reality,
  Harvard Business Review blog, Oct 2012

                       (c) KDnuggets 2013               22
Netflix Prize Progress
The most advanced algorithms were only a few
  percentages better than basic algorithms




                   © KDnuggets 2013            23
Many Customer Modeling Tasks have
            Similar Lift
                    Actual lift(T)    Est. lift(T)                Lift(T) ~ T -0.5 = sqrt (1/T)
       14
       12
                                                                  See G. Piatetsky-Shapiro,
       10                                                         B. Masand, Estimating
        8                                                         Campaign Benefits and
Lift




        6                                                         Modeling Lift,
        4                                                         Proceedings of KDD-99
        2
                                                                  Conference
        0
            0   5       10           15           20         25
                          100*T%




                                          © KDnuggets 2013                                 24
Big Data Bubble?                  Gartner VP says Big Data
                                  is Falling into the Trough
                                  of Disillusionment, Jan
                                  2013


   Big Data




   Gartner Hype Cycle

                                                          25
               © 2013 KDnuggets
Big Data Enables New Things !
– Google – first big success of big data
– Social networks (Facebook, Twitter, LinkedIn, …)
  success depends on network size, i.e. big data

– Location analytics
– Health-care
   • Personalized medicine
– Semantics and AI ?
   • Imagine IBM Watson, Siri in 2020 ?
– Beware of Loss of privacy

                       (c) KDnuggets 2012            26
Largest Dataset Analyzed?
                                                    Big Data Miners – elite
                                                    group

                                                    2012 median dataset size
                                                    ~20-40 GB,
                                                    vs 10-20 GB in 2011.




www.KDnuggets.com/polls/2012/largest-dataset-analyzed-data-mined.html
                             © KDnuggets 2012                             27
Digital Universe in 2020

                                            40,000 exabytes by 2020

                                            1 Exabyte = 1018 bytes
                                            = 1,000,000 TB




   Source : IDC Study The Digital Universe in 2020
                  © KDnuggets 2013                               28
Where in the World is Big Data?
                                Most of Big Data
                                is outside the US




             © KDnuggets 2012                       29
Opportunities for Big Data
• Social networks
• Surveillance video
• Embedded and medical devices
  – M2M data, logs
• Entertainment and social media
• Consumer images – image recognition,
  labeling, …


                     © KDnuggets 2012    30
Where did you apply Analytics/Data Mining?

                                                  Avg. Number of Industries 2.6

                                                  Most Popular:
                                                  - CRM/Consumer analytics
                                                  - Health care/ HR,
                                                  - Retail
                                                  - Banking
                                                  - Education

                                                  Highest growth in:
                                                  1. Advertising, 89.0%
                                                  2. Search / Web content
                                                      mining, 55.1%
                                                  3. Retail, 40.6%
                                                  4. Other, 36.9%
                                                  5. Manufacturing, 35.7%

www.KDnuggets.com/polls/2012/where-applied-analytics-data-mining.html
                               © KDnuggets 2013                              31
Untapped Big Data Gap




 Big limitation is lack of Analytic Talent
              © KDnuggets 2013               32
JOBS AND SKILLS



    (c) KDnuggets 2011   33
Shortage of Skills
• McKinsey: shortage by 2018 in the US of
  – 140-190,000 people with deep analytical skills

  – 1.5 M managers/analysts with the know-how to
    use the analysis of big data to make effective
    decisions.

  Source:
    www.mckinsey.com/mgi/publications/big_data/

                      (c) KDnuggets 2012             34
Indeed.com fastest growing jobs
Top 10 skills:
• HTML5                             Hadoop
• MongoDB
• iOS
• Android                                    MongoDB
• Mobile app
• Puppet
• Hadoop
• jQuery
• PaaS
• Social Media

                 © KDnuggets 2012               35
“Big Data” grows faster than MongoDB

                                  Big Data

                                  Hadoop




                                  MongoDB




               © KDnuggets 2012        36
Data Mining >> Hadoop




        (c) KDnuggets 2013   37
Demand for Data Scientists surging

                                                           “Data Scientist”

                                                           Fastest growing term on
                                                           www.kdnuggets.com/jobs

                                                           1% of jobs in 2010

                                                           4% of jobs in 2011

                                                           19% of jobs in 2012



 Data Scientist – sexiest job of the 21st Century (???)
 say Thomas H. Davenport and D.J. Patil, (HBR, Oct 2012)

                               © KDnuggets 2013                                  38
What is a Data Scientist
My definition: A combination of MBA, a Statistician, and a Hacker




                                                         Drew Conway:
                                                         http://www.drewconway.co
                                                         m/zia/?p=2378


                                  © KDnuggets 2013                            39
Rebranding from
“Data Mining” to “Big Data”


                  Data Mining




                                                  Big Data

                                                             Data Scientist


  “Data mining” jobs are much more common, but
  “Big Data” jobs are surging much faster than “Data Scientist”

                    (c) KDnuggets 2011                                40
LinkedIn Analytics/Data Mining Skills
                                    “Ground” analytics
                                    skills most common

                                    “Cloud” analytics
                                    skills grow
                                    fastest

                                    Text Analytics skills
                                    less common
                                    Sentiment Analysis
                                    – fastest growing




               (c) KDnuggets 2012                       41
Analytics Education




      © KDnuggets 2013   42
Analytics Education: USA/Canada
US: Northeast | US: South | US: Midwest | US: West | Canada
     Northeast
•    Connecticut: Central Connecticut State University (CCSU), exploring cutting-edge data mining techniques
     and Applications. New Britain, CT.
•     U. Conn. MS in Business Analytics and Project Management, designed to meet the growing demand for
     professionals who can harness advanced business analytics and project management skills. Hartford, CT.
•    Maryland: U. of Maryland MS in Business for Marketing Analytics, will help you learn how to harness and
     process massive amounts of data to help design products, predict the effects of marketing campaigns, and
     better understand your customers. Fall 2013. College Park, MD.
•    Massachusetts: Bentley Master of Science in Marketing Analytics, teaches students how to become more
     engaged with consumers, how to design and deliver robust statistical analysis, and how to effectively
     communicate the resulting insights. Waltham, MA.
•     Harvard Masters of Science in Computational Science and Engineering, including a major focus on
     machine learning and analyzing and visualizing very large data sets. Cambridge, MA.
•    New Jersey: Rutgers Master of Business and Science (MBS) in Analytics, prepares students for data-driven
     decision making; brings together fields of data management, statistics, machine learning and
     computation. New Brunswick, NJ.
•    Stevens Institute of Technology Master of Science - Business Intelligence & Analytics, Hoboken, NJ.
•    New York: Columbia MS in Computer Science, concentration in Machine Learning, New York, NY.
•     NYU Master of Science in Business Analytics, starting May 2013, New York, NY.
•     NYU MBA with specialization in Business Analytics, New York, NY.
•    ….
Full list at http://www.kdnuggets.com/education/usa-canada.html


                                              © KDnuggets 2013                                             43
www.kdnuggets.com/education/usa-canada.html
Analytics Education: Online
•   Big Data University, offering online classes on Hadoop and DB2.
•   Caltech Learning from Data course, free, broadcast online Apr-May 2012.
•   CMU Open Learning Initiative, including on-line courses in statistics, math & logic.
•   Coursera, offering online classes from Stanford and other top universities.
    Check especially Statistics, Data Analysis, and Scientific Computing courses.
•   Data Mining Tools Tutorials, covering Data Mining, Probability, Weka, R, and numerous commercial data mining tools.
•   EMC Data Science and Big Data Analytics open course.
•   LearnAnalytics India, delivering SAS and Advanced Analytics trainings online and offline.
•   Northwestern University Online Master of Science in Predictive Analytics, skills for leadership in a growing
    Field.
•   Oxford Advanced Diploma in Data and Systems Analysis, a one-year online course.
•    Stanford Center for Professional Education, offers certificate programs for managers and professionals in Data
    Mining and Applications and many related areas.
•   Statistics.com, offering on-line short courses in statistics and data mining.
•   UCI: U. of California, Irvine Extension, Predictive Analytics Certificate Program, a comprehensive online
    program.
•   UC San Diego Data Mining Courses, part of Data Analysis study area.
•   Udacity, online university founded by David Evans and Sebastian Thrun.
•   Video lectures from conferences, workshops and the scientific lectures in the areas of machine learning, data and text
    mining, and semantic web.


Full list at http://www.kdnuggets.com/education/online.html




                                                     © KDnuggets 2013                                                        44
Analytics Certificates
•   Business Analytics Certificates from BeyeUniversity, sponsored by Sybase.
•   Business Analytics Certificate from Statistics.com, become your company's expert on forecasting, customer segmentation, consumer
    behavior, and risk analysis.
•   Central Michigan University Graduate certificate in Data Mining, with SAS. Mount Pleasant, MI, USA
•   Data Mining Certificate from Statistics.com, master the secrets of teasing powerful information from large data sets to predict customer
    behavior, identify likely high value customers, visualize high dimensional data, and convert text to minable data.
•   EMC Data Science and Big Data Analytics open curriculum-based Education and Certification.
•    Indiana U. Kelley School of Business Business Analytics Certificate Program, Bloomington, IN, USA.
•   INFORMS Analytics Certification, coming April 2013.
•   NJIT (New Jersey Institute of Technology) The Graduate Certificate in Data Mining, online and in class.
•   Nova Southeastern University Graduate Certificate In Business Intelligence / Analytics, Fort Lauderdale, FL.
•   Stanford Center for Professional Education, offers certificate programs for managers and professionals in Data Mining and Applications and
    Quantitative Methods in Finance and Risk Management.
•   Statistics.com Institute for Statistics Education, offers certificates in Data Analytics, Biostatistics, Social Science, and Using R.
•   UCI: U. of California, Irvine Extension, Predictive Analytics Certificate Program, a comprehensive online program.
•   UCSD Data Mining certificate program, San Diego, CA.
•    University of Delaware Certificate in Analytics: Optimizing Big Data, Newark, DE, USA.
•   U. of Washington Certificate in Data Science, Seattle, WA, USA.
•   SAS Certificate Programs


Full list http://www.kdnuggets.com/education/analytics-data-mining-certificates.html



                                                            © KDnuggets 2013                                                                 45
Analytics Education by Doing
• Competitions – learn by doing
  – Kaggle and more
• Kaggle beginner competitions
• Kaggle in class : free to Instructors from any
  course dealing with data analysis




                      © KDnuggets 2013             46
Online Education – Free Courses
Data Science pseudo degree from
• Lower-Division Courses
   –   Data Science 101 – Statistics One
   –   Data Science 102 – Computing for Data Analysis (R)
   –   Data Science 103 – Data Analysis
   –   Data Science 104 – Introduction to Data Science
• Upper-Division Courses
   – Data Science 201 – Machine Learning I
   – Data Science 202 – Machine Learning II
   – Data Science 203 – Neural Networks for Machine Learning
• Graduate Courses
   – Data Science 301 – Learning from Data (Caltech course CS101)
   – Data Science 302 – Machine Learning III (MIT course 6.867)


                                © KDnuggets 2013                    47
Analytics Education:
 Practical Knowledge > Degree Prestige
Paco Nathan - School prestige matters some to hiring managers. Several top schools
   are known to have excellent programs and track records: Stanford, CMU, U
   Washington, UC Berkeley, Harvard, Johns Hopkins, etc. However, keep in mind that
   that list is not entirely representative. For example, there are nearly a dozen
   relevant programs at Stanford, which has produced Google, Yahoo, etc., while the
   list only mentions one program.

More to the point, about half of my peers in this field have backgrounds in Physics or
  physical science/physical engineering -- and I tend to hire from those programs
  more so than from CS programs, because the grad students tend to have both the
  math/stats depth plus practice with real-world frameworks like R, Matlab, etc.
  Having a really solid background in applying statistics at scale, some knack for data
  visualization, plus good programming chops -- those skills will trump a PhD in
  Machine Learning.

http://www.linkedin.com/groups/Graduate-Programs-in-Big-Data-
    2013423.S.203868026?view=&srchtype=discussedNews&gid=2013423&item=203
    868026&type=member&trk=eml-anet_dig-b_pd-ttl-cn&ut=2Xc36Ur0k0zBA1


                                     © KDnuggets 2013                                 48
Analytics Education Boom




2012 was a peak year for starting Analytics Degree Programs in the US


                           © KDnuggets 2013                             49
Questions?
Analytics Education Overview
www.kdnuggets.com/education/

Subscribe to KDnuggets News email at
  www.KDnuggets.com/subscribe.html

• Follow @kdnuggets on Twitter

• Email to editor1@kdnuggets.com
                  © KDnuggets 2013     50

Contenu connexe

Tendances

Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
Introduction to Big Data and AI for Business Analytics and Prediction
Introduction to Big Data and AI for Business Analytics and PredictionIntroduction to Big Data and AI for Business Analytics and Prediction
Introduction to Big Data and AI for Business Analytics and PredictionJongwook Woo
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)Prof. Dr. Diego Kuonen
 
Introduction to Big Data and its Trends
Introduction to Big Data and its TrendsIntroduction to Big Data and its Trends
Introduction to Big Data and its TrendsJongwook Woo
 
Traffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big DataTraffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big DataJongwook Woo
 
History and Trend of Big Data and Deep Learning
History and Trend of Big Data and Deep LearningHistory and Trend of Big Data and Deep Learning
History and Trend of Big Data and Deep LearningJongwook Woo
 
Rating Prediction using Deep Learning and Spark
Rating Prediction using Deep Learning and SparkRating Prediction using Deep Learning and Spark
Rating Prediction using Deep Learning and SparkJongwook Woo
 
Introduction to Big Data: Smart Factory
Introduction to Big Data: Smart FactoryIntroduction to Big Data: Smart Factory
Introduction to Big Data: Smart FactoryJongwook Woo
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learningGiuseppe Manco
 
Predictive Analysis of Financial Fraud Detection using Azure and Spark ML
Predictive Analysis of Financial Fraud Detection using Azure and Spark MLPredictive Analysis of Financial Fraud Detection using Azure and Spark ML
Predictive Analysis of Financial Fraud Detection using Azure and Spark MLJongwook Woo
 
Scalable Predictive Analysis and The Trend with Big Data & AI
Scalable Predictive Analysis and The Trend with Big Data & AIScalable Predictive Analysis and The Trend with Big Data & AI
Scalable Predictive Analysis and The Trend with Big Data & AIJongwook Woo
 
The Importance of Open Innovation in AI era
The Importance of Open Innovation in AI eraThe Importance of Open Innovation in AI era
The Importance of Open Innovation in AI eraJongwook Woo
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsChandan Rajah
 
Full-Stack Data Science: How to be a One-person Data Team
Full-Stack Data Science: How to be a One-person Data TeamFull-Stack Data Science: How to be a One-person Data Team
Full-Stack Data Science: How to be a One-person Data TeamGreg Goltsov
 
Data Scientist 101 BI Dutch
Data Scientist 101 BI DutchData Scientist 101 BI Dutch
Data Scientist 101 BI DutchJos van Dongen
 
Big Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our LivesBig Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our LivesRukshan Batuwita
 
8 minute intro to data science
8 minute intro to data science 8 minute intro to data science
8 minute intro to data science Mahesh Kumar CV
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data ScienceEdureka!
 
(Big) Data as the Fuel and Analytics as the Engine of the Digital Transformation
(Big) Data as the Fuel and Analytics as the Engine of the Digital Transformation(Big) Data as the Fuel and Analytics as the Engine of the Digital Transformation
(Big) Data as the Fuel and Analytics as the Engine of the Digital TransformationProf. Dr. Diego Kuonen
 

Tendances (20)

Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Introduction to Big Data and AI for Business Analytics and Prediction
Introduction to Big Data and AI for Business Analytics and PredictionIntroduction to Big Data and AI for Business Analytics and Prediction
Introduction to Big Data and AI for Business Analytics and Prediction
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)
 
Introduction to Big Data and its Trends
Introduction to Big Data and its TrendsIntroduction to Big Data and its Trends
Introduction to Big Data and its Trends
 
Traffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big DataTraffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big Data
 
History and Trend of Big Data and Deep Learning
History and Trend of Big Data and Deep LearningHistory and Trend of Big Data and Deep Learning
History and Trend of Big Data and Deep Learning
 
Rating Prediction using Deep Learning and Spark
Rating Prediction using Deep Learning and SparkRating Prediction using Deep Learning and Spark
Rating Prediction using Deep Learning and Spark
 
Introduction to Big Data: Smart Factory
Introduction to Big Data: Smart FactoryIntroduction to Big Data: Smart Factory
Introduction to Big Data: Smart Factory
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learning
 
Predictive Analysis of Financial Fraud Detection using Azure and Spark ML
Predictive Analysis of Financial Fraud Detection using Azure and Spark MLPredictive Analysis of Financial Fraud Detection using Azure and Spark ML
Predictive Analysis of Financial Fraud Detection using Azure and Spark ML
 
Intro to Data Science Concepts
Intro to Data Science ConceptsIntro to Data Science Concepts
Intro to Data Science Concepts
 
Scalable Predictive Analysis and The Trend with Big Data & AI
Scalable Predictive Analysis and The Trend with Big Data & AIScalable Predictive Analysis and The Trend with Big Data & AI
Scalable Predictive Analysis and The Trend with Big Data & AI
 
The Importance of Open Innovation in AI era
The Importance of Open Innovation in AI eraThe Importance of Open Innovation in AI era
The Importance of Open Innovation in AI era
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and Benefits
 
Full-Stack Data Science: How to be a One-person Data Team
Full-Stack Data Science: How to be a One-person Data TeamFull-Stack Data Science: How to be a One-person Data Team
Full-Stack Data Science: How to be a One-person Data Team
 
Data Scientist 101 BI Dutch
Data Scientist 101 BI DutchData Scientist 101 BI Dutch
Data Scientist 101 BI Dutch
 
Big Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our LivesBig Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our Lives
 
8 minute intro to data science
8 minute intro to data science 8 minute intro to data science
8 minute intro to data science
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
 
(Big) Data as the Fuel and Analytics as the Engine of the Digital Transformation
(Big) Data as the Fuel and Analytics as the Engine of the Digital Transformation(Big) Data as the Fuel and Analytics as the Engine of the Digital Transformation
(Big) Data as the Fuel and Analytics as the Engine of the Digital Transformation
 

En vedette

Big Data Startups - Top Visualization and Data Analytics Startups
Big Data Startups - Top Visualization and Data Analytics StartupsBig Data Startups - Top Visualization and Data Analytics Startups
Big Data Startups - Top Visualization and Data Analytics Startupswallesplace
 
User and IoT Data Analytics
User and IoT Data AnalyticsUser and IoT Data Analytics
User and IoT Data AnalyticsEricsson
 
Big-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunitiesBig-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunities台灣資料科學年會
 
Why apache Flink is the 4G of Big Data Analytics Frameworks
Why apache Flink is the 4G of Big Data Analytics FrameworksWhy apache Flink is the 4G of Big Data Analytics Frameworks
Why apache Flink is the 4G of Big Data Analytics FrameworksSlim Baltagi
 
Edwards using data to drive instruction
Edwards   using data to drive instructionEdwards   using data to drive instruction
Edwards using data to drive instructionamliterature
 
Big Data in the Online Classroom
Big Data in the Online ClassroomBig Data in the Online Classroom
Big Data in the Online ClassroomJames Howard
 
Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Marina Santini
 
Student analytics-fact-based-student-services deloitte-nl-data-analyse
Student analytics-fact-based-student-services deloitte-nl-data-analyseStudent analytics-fact-based-student-services deloitte-nl-data-analyse
Student analytics-fact-based-student-services deloitte-nl-data-analyseRoel Palmaers
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in EducationPhilip Piety
 
How to Use Data to Drive Instruction
How to Use Data to Drive InstructionHow to Use Data to Drive Instruction
How to Use Data to Drive InstructionDes Floyd
 
EDUCAUSE 2016 Analytics and Student Success: Research & Benchmarking
EDUCAUSE 2016 Analytics and Student Success: Research & BenchmarkingEDUCAUSE 2016 Analytics and Student Success: Research & Benchmarking
EDUCAUSE 2016 Analytics and Student Success: Research & BenchmarkingJeffrey Pomerantz, Ph.D.
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role LabelingMarina Santini
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word CloudsMarina Santini
 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational SemanticsMarina Santini
 
Data Mining and Text Mining in Educational Research
Data Mining and Text Mining in Educational ResearchData Mining and Text Mining in Educational Research
Data Mining and Text Mining in Educational ResearchQiang Hao
 
Lecture 3b: Decision Trees (1 part)
Lecture 3b: Decision Trees (1 part)Lecture 3b: Decision Trees (1 part)
Lecture 3b: Decision Trees (1 part) Marina Santini
 
Data Driven Instruction
Data Driven InstructionData Driven Instruction
Data Driven InstructionTman
 

En vedette (20)

Data mining
Data miningData mining
Data mining
 
Big Data Startups - Top Visualization and Data Analytics Startups
Big Data Startups - Top Visualization and Data Analytics StartupsBig Data Startups - Top Visualization and Data Analytics Startups
Big Data Startups - Top Visualization and Data Analytics Startups
 
User and IoT Data Analytics
User and IoT Data AnalyticsUser and IoT Data Analytics
User and IoT Data Analytics
 
Big-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunitiesBig-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunities
 
Introduction to Data Mining and Big Data Analytics
Introduction to Data Mining and Big Data AnalyticsIntroduction to Data Mining and Big Data Analytics
Introduction to Data Mining and Big Data Analytics
 
Why apache Flink is the 4G of Big Data Analytics Frameworks
Why apache Flink is the 4G of Big Data Analytics FrameworksWhy apache Flink is the 4G of Big Data Analytics Frameworks
Why apache Flink is the 4G of Big Data Analytics Frameworks
 
Edwards using data to drive instruction
Edwards   using data to drive instructionEdwards   using data to drive instruction
Edwards using data to drive instruction
 
Using Data to Drive Instruction
Using Data to Drive InstructionUsing Data to Drive Instruction
Using Data to Drive Instruction
 
Big Data in the Online Classroom
Big Data in the Online ClassroomBig Data in the Online Classroom
Big Data in the Online Classroom
 
Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)
 
Student analytics-fact-based-student-services deloitte-nl-data-analyse
Student analytics-fact-based-student-services deloitte-nl-data-analyseStudent analytics-fact-based-student-services deloitte-nl-data-analyse
Student analytics-fact-based-student-services deloitte-nl-data-analyse
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in Education
 
How to Use Data to Drive Instruction
How to Use Data to Drive InstructionHow to Use Data to Drive Instruction
How to Use Data to Drive Instruction
 
EDUCAUSE 2016 Analytics and Student Success: Research & Benchmarking
EDUCAUSE 2016 Analytics and Student Success: Research & BenchmarkingEDUCAUSE 2016 Analytics and Student Success: Research & Benchmarking
EDUCAUSE 2016 Analytics and Student Success: Research & Benchmarking
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role Labeling
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word Clouds
 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational Semantics
 
Data Mining and Text Mining in Educational Research
Data Mining and Text Mining in Educational ResearchData Mining and Text Mining in Educational Research
Data Mining and Text Mining in Educational Research
 
Lecture 3b: Decision Trees (1 part)
Lecture 3b: Decision Trees (1 part)Lecture 3b: Decision Trees (1 part)
Lecture 3b: Decision Trees (1 part)
 
Data Driven Instruction
Data Driven InstructionData Driven Instruction
Data Driven Instruction
 

Similaire à Analytics Education in the era of Big Data

Data Scientists: Your Must-Have Business Investment
Data Scientists: Your Must-Have Business InvestmentData Scientists: Your Must-Have Business Investment
Data Scientists: Your Must-Have Business InvestmentKalido
 
Cognitive/AI: views, perspectives & directions
Cognitive/AI: views, perspectives & directionsCognitive/AI: views, perspectives & directions
Cognitive/AI: views, perspectives & directionsIDC Italy
 
Big Data, Big True
Big Data, Big TrueBig Data, Big True
Big Data, Big TrueIDC Italy
 
Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...Neo4j
 
Big Data – Is it a hype or for real?
 Big Data – Is it a hype or for real?  Big Data – Is it a hype or for real?
Big Data – Is it a hype or for real? Dirk Ortloff
 
Big Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future FoundationBig Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future FoundationForesight Factory
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big DataeXascale Infolab
 
How to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraHow to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraChun Myung Kyu
 
The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...
The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...
The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...CityAge
 
Telco Big Data Workshop Sample
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop SampleAlan Quayle
 
ML Production Pipelines: A Classification Model
ML Production Pipelines: A Classification ModelML Production Pipelines: A Classification Model
ML Production Pipelines: A Classification ModelDatabricks
 
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Michael Lew
 
From Paris Hilton to Walmart: welcome to the Big Data Revolution
From Paris Hilton to Walmart: welcome to the Big Data RevolutionFrom Paris Hilton to Walmart: welcome to the Big Data Revolution
From Paris Hilton to Walmart: welcome to the Big Data RevolutionWilliam Visterin
 
Big Data in Media
Big Data in MediaBig Data in Media
Big Data in MediaKris Tuttle
 
IDC Perspectives on Big Data Outside of HPC
IDC Perspectives on Big Data Outside of HPCIDC Perspectives on Big Data Outside of HPC
IDC Perspectives on Big Data Outside of HPCinside-BigData.com
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Yaman Hajja, Ph.D.
 
Carlo Colicchio: Big Data for business
Carlo Colicchio: Big Data for businessCarlo Colicchio: Big Data for business
Carlo Colicchio: Big Data for businessCarlo Vaccari
 
Big Data Meets Customer Profitability Analytics
Big Data Meets Customer Profitability AnalyticsBig Data Meets Customer Profitability Analytics
Big Data Meets Customer Profitability AnalyticsDATAVERSITY
 
Disruptive Technology and Data
Disruptive Technology and DataDisruptive Technology and Data
Disruptive Technology and DataJLL
 
Big data driven decisions bnieme nyenrode
Big data driven decisions bnieme nyenrodeBig data driven decisions bnieme nyenrode
Big data driven decisions bnieme nyenrodeBob Nieme
 

Similaire à Analytics Education in the era of Big Data (20)

Data Scientists: Your Must-Have Business Investment
Data Scientists: Your Must-Have Business InvestmentData Scientists: Your Must-Have Business Investment
Data Scientists: Your Must-Have Business Investment
 
Cognitive/AI: views, perspectives & directions
Cognitive/AI: views, perspectives & directionsCognitive/AI: views, perspectives & directions
Cognitive/AI: views, perspectives & directions
 
Big Data, Big True
Big Data, Big TrueBig Data, Big True
Big Data, Big True
 
Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...
 
Big Data – Is it a hype or for real?
 Big Data – Is it a hype or for real?  Big Data – Is it a hype or for real?
Big Data – Is it a hype or for real?
 
Big Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future FoundationBig Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future Foundation
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
 
How to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraHow to design ai functions to the cloud native infra
How to design ai functions to the cloud native infra
 
The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...
The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...
The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...
 
Telco Big Data Workshop Sample
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop Sample
 
ML Production Pipelines: A Classification Model
ML Production Pipelines: A Classification ModelML Production Pipelines: A Classification Model
ML Production Pipelines: A Classification Model
 
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
 
From Paris Hilton to Walmart: welcome to the Big Data Revolution
From Paris Hilton to Walmart: welcome to the Big Data RevolutionFrom Paris Hilton to Walmart: welcome to the Big Data Revolution
From Paris Hilton to Walmart: welcome to the Big Data Revolution
 
Big Data in Media
Big Data in MediaBig Data in Media
Big Data in Media
 
IDC Perspectives on Big Data Outside of HPC
IDC Perspectives on Big Data Outside of HPCIDC Perspectives on Big Data Outside of HPC
IDC Perspectives on Big Data Outside of HPC
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
Carlo Colicchio: Big Data for business
Carlo Colicchio: Big Data for businessCarlo Colicchio: Big Data for business
Carlo Colicchio: Big Data for business
 
Big Data Meets Customer Profitability Analytics
Big Data Meets Customer Profitability AnalyticsBig Data Meets Customer Profitability Analytics
Big Data Meets Customer Profitability Analytics
 
Disruptive Technology and Data
Disruptive Technology and DataDisruptive Technology and Data
Disruptive Technology and Data
 
Big data driven decisions bnieme nyenrode
Big data driven decisions bnieme nyenrodeBig data driven decisions bnieme nyenrode
Big data driven decisions bnieme nyenrode
 

Dernier

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Dernier (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

Analytics Education in the era of Big Data

  • 1. Analytics Education in the Era of Big Data Gregory Piatetsky KDnuggets © KDnuggets 2013 1
  • 2. Outline • Analytics, Data Mining, Data Science - What do we call it? • Big Data Trends • Jobs and Skills • Analytics Education overview © KDnuggets 2013 2
  • 3. What do we call it? Buzzwords and Trends Useful for Marketing “Analytics” has staying power (c) KDnuggets 2013 3
  • 4. What do we call it? • Statistics Same Core Idea: • Data mining Finding Useful • Knowledge Discovery in Data (KDD) Patterns in Data • Predictive Analytics • Business Analytics • Data Science Different • Data Analytics Emphasis • …? © KDnuggets 2013 4
  • 5. Pre-history (1900-2000): Statistics statistics is the biggest term in 20th century, Analytics is used increasingly thru 20th century data mining appears in late 1990s From Google Ngram viewer – English language books Search case sensitive – used most popular spelling. © KDnuggets 2012 5
  • 6. 20th Century Analytics vs Data Mining data mining Data Mining Analytics analytics ?? Google N-grams search is case sensitive; Note: “data mining” > “Data Mining” usage While “analytics” < “Analytics” © KDnuggets 2012 6
  • 7. Recent History: 1980-2008 data mining Analytics analytics Knowledge Discovery “analytics” has been used since 1900, but started to rise in 2005 “data mining” surges around 1995 (soon after first KDD conference) but slowly declines after 2003 (TIA controversy, associated with Govt invasion of privacy). “Knowledge Discovery” appears in 1989, rises in 1996, and plateaus in 2000 (Google N-grams, smoothing =1) © KDnuggets 2012 7
  • 8. Google Trends: After 2006, Analytics > Data Mining Global – all regions (c) KDnuggets 2012 8
  • 9. >50% of “Analytics” searches are for “Google Analytics” Google Analytics introduced, Dec 2005 (c) KDnuggets 2012 9
  • 10. Google Trends observations (as of Jan 2013) Decline in analytics in 2012? data mining: 16 analytics -google: 54 Competing on Analytics book, Tom Davenport, Apr 2007 Vacation drops (c) KDnuggets 2013
  • 11. Global View: searches for data mining, analytics -google Google Trends (c) KDnuggets 2013 11
  • 12. Google Trends: USA, 2012 For “analytics – google –web –adsense” © KDnuggets 2013 12
  • 13. Google Trends: USA, Regional Interest © KDnuggets 2013 13
  • 14. Google Trends: USA, Analytics-related terms © KDnuggets 2013 14
  • 15. Analytics: Business > Data> Predictive > Text Google Insights, Jan 2007- Sep 2012, Global (c) KDnuggets 2012 15
  • 16. Big Data > Data Mining > * Analytics > Data Science “Big Data” Surge Google Trends search, Jan 2007- Dec 2012, USA (c) KDnuggets 2013 16
  • 17. Big Data Trends (c) KDnuggets 2013 17
  • 18. 3 Vs of Big Data • Volume – Gigabytes to Terabytes to Petabytes … • Velocity – online streaming • Variety – numbers, text, links, images, audio, video, … (c) KDnuggets 2013 18
  • 19. Volume + Velocity => No consistency • CAP Theorem (Eric Brewer, 2000) For highly scalable distributed systems, you can only have two of following: – 1) consistency, – 2) high availability, and – 3) (network) partition tolerance (network failure tolerance) http://www.julianbrowne.com/article/viewer/brewers-cap- theorem Implication: Big data solutions must stop worrying about consistency if they want high availability (c) KDnuggets 2013 19
  • 20. Big Data • 2nd Industrial Revolution • Do old activities better • Create new activities/businesses (c) KDnuggets 2013 20
  • 21. Doing Old Things Better “Classical” Analytics Application areas – Churn prediction – Direct marketing/Customer modeling – Recommendations – Fraud detection – Security/Intelligence –… • Competition will level companies (c) KDnuggets 2013 21
  • 22. Limit to Predicting Human Behavior? • There is randomness in human behavior and once we find first-level effects, there are diminishing returns in prediction on individual level • Many examples: Netflix Prize, Customer modeling… Gregory Piatetsky-Shapiro, Big Data Hype and Reality, Harvard Business Review blog, Oct 2012 (c) KDnuggets 2013 22
  • 23. Netflix Prize Progress The most advanced algorithms were only a few percentages better than basic algorithms © KDnuggets 2013 23
  • 24. Many Customer Modeling Tasks have Similar Lift Actual lift(T) Est. lift(T) Lift(T) ~ T -0.5 = sqrt (1/T) 14 12 See G. Piatetsky-Shapiro, 10 B. Masand, Estimating 8 Campaign Benefits and Lift 6 Modeling Lift, 4 Proceedings of KDD-99 2 Conference 0 0 5 10 15 20 25 100*T% © KDnuggets 2013 24
  • 25. Big Data Bubble? Gartner VP says Big Data is Falling into the Trough of Disillusionment, Jan 2013 Big Data Gartner Hype Cycle 25 © 2013 KDnuggets
  • 26. Big Data Enables New Things ! – Google – first big success of big data – Social networks (Facebook, Twitter, LinkedIn, …) success depends on network size, i.e. big data – Location analytics – Health-care • Personalized medicine – Semantics and AI ? • Imagine IBM Watson, Siri in 2020 ? – Beware of Loss of privacy (c) KDnuggets 2012 26
  • 27. Largest Dataset Analyzed? Big Data Miners – elite group 2012 median dataset size ~20-40 GB, vs 10-20 GB in 2011. www.KDnuggets.com/polls/2012/largest-dataset-analyzed-data-mined.html © KDnuggets 2012 27
  • 28. Digital Universe in 2020 40,000 exabytes by 2020 1 Exabyte = 1018 bytes = 1,000,000 TB Source : IDC Study The Digital Universe in 2020 © KDnuggets 2013 28
  • 29. Where in the World is Big Data? Most of Big Data is outside the US © KDnuggets 2012 29
  • 30. Opportunities for Big Data • Social networks • Surveillance video • Embedded and medical devices – M2M data, logs • Entertainment and social media • Consumer images – image recognition, labeling, … © KDnuggets 2012 30
  • 31. Where did you apply Analytics/Data Mining? Avg. Number of Industries 2.6 Most Popular: - CRM/Consumer analytics - Health care/ HR, - Retail - Banking - Education Highest growth in: 1. Advertising, 89.0% 2. Search / Web content mining, 55.1% 3. Retail, 40.6% 4. Other, 36.9% 5. Manufacturing, 35.7% www.KDnuggets.com/polls/2012/where-applied-analytics-data-mining.html © KDnuggets 2013 31
  • 32. Untapped Big Data Gap Big limitation is lack of Analytic Talent © KDnuggets 2013 32
  • 33. JOBS AND SKILLS (c) KDnuggets 2011 33
  • 34. Shortage of Skills • McKinsey: shortage by 2018 in the US of – 140-190,000 people with deep analytical skills – 1.5 M managers/analysts with the know-how to use the analysis of big data to make effective decisions. Source: www.mckinsey.com/mgi/publications/big_data/ (c) KDnuggets 2012 34
  • 35. Indeed.com fastest growing jobs Top 10 skills: • HTML5 Hadoop • MongoDB • iOS • Android MongoDB • Mobile app • Puppet • Hadoop • jQuery • PaaS • Social Media © KDnuggets 2012 35
  • 36. “Big Data” grows faster than MongoDB Big Data Hadoop MongoDB © KDnuggets 2012 36
  • 37. Data Mining >> Hadoop (c) KDnuggets 2013 37
  • 38. Demand for Data Scientists surging “Data Scientist” Fastest growing term on www.kdnuggets.com/jobs 1% of jobs in 2010 4% of jobs in 2011 19% of jobs in 2012 Data Scientist – sexiest job of the 21st Century (???) say Thomas H. Davenport and D.J. Patil, (HBR, Oct 2012) © KDnuggets 2013 38
  • 39. What is a Data Scientist My definition: A combination of MBA, a Statistician, and a Hacker Drew Conway: http://www.drewconway.co m/zia/?p=2378 © KDnuggets 2013 39
  • 40. Rebranding from “Data Mining” to “Big Data” Data Mining Big Data Data Scientist “Data mining” jobs are much more common, but “Big Data” jobs are surging much faster than “Data Scientist” (c) KDnuggets 2011 40
  • 41. LinkedIn Analytics/Data Mining Skills “Ground” analytics skills most common “Cloud” analytics skills grow fastest Text Analytics skills less common Sentiment Analysis – fastest growing (c) KDnuggets 2012 41
  • 42. Analytics Education © KDnuggets 2013 42
  • 43. Analytics Education: USA/Canada US: Northeast | US: South | US: Midwest | US: West | Canada Northeast • Connecticut: Central Connecticut State University (CCSU), exploring cutting-edge data mining techniques and Applications. New Britain, CT. • U. Conn. MS in Business Analytics and Project Management, designed to meet the growing demand for professionals who can harness advanced business analytics and project management skills. Hartford, CT. • Maryland: U. of Maryland MS in Business for Marketing Analytics, will help you learn how to harness and process massive amounts of data to help design products, predict the effects of marketing campaigns, and better understand your customers. Fall 2013. College Park, MD. • Massachusetts: Bentley Master of Science in Marketing Analytics, teaches students how to become more engaged with consumers, how to design and deliver robust statistical analysis, and how to effectively communicate the resulting insights. Waltham, MA. • Harvard Masters of Science in Computational Science and Engineering, including a major focus on machine learning and analyzing and visualizing very large data sets. Cambridge, MA. • New Jersey: Rutgers Master of Business and Science (MBS) in Analytics, prepares students for data-driven decision making; brings together fields of data management, statistics, machine learning and computation. New Brunswick, NJ. • Stevens Institute of Technology Master of Science - Business Intelligence & Analytics, Hoboken, NJ. • New York: Columbia MS in Computer Science, concentration in Machine Learning, New York, NY. • NYU Master of Science in Business Analytics, starting May 2013, New York, NY. • NYU MBA with specialization in Business Analytics, New York, NY. • …. Full list at http://www.kdnuggets.com/education/usa-canada.html © KDnuggets 2013 43 www.kdnuggets.com/education/usa-canada.html
  • 44. Analytics Education: Online • Big Data University, offering online classes on Hadoop and DB2. • Caltech Learning from Data course, free, broadcast online Apr-May 2012. • CMU Open Learning Initiative, including on-line courses in statistics, math & logic. • Coursera, offering online classes from Stanford and other top universities. Check especially Statistics, Data Analysis, and Scientific Computing courses. • Data Mining Tools Tutorials, covering Data Mining, Probability, Weka, R, and numerous commercial data mining tools. • EMC Data Science and Big Data Analytics open course. • LearnAnalytics India, delivering SAS and Advanced Analytics trainings online and offline. • Northwestern University Online Master of Science in Predictive Analytics, skills for leadership in a growing Field. • Oxford Advanced Diploma in Data and Systems Analysis, a one-year online course. • Stanford Center for Professional Education, offers certificate programs for managers and professionals in Data Mining and Applications and many related areas. • Statistics.com, offering on-line short courses in statistics and data mining. • UCI: U. of California, Irvine Extension, Predictive Analytics Certificate Program, a comprehensive online program. • UC San Diego Data Mining Courses, part of Data Analysis study area. • Udacity, online university founded by David Evans and Sebastian Thrun. • Video lectures from conferences, workshops and the scientific lectures in the areas of machine learning, data and text mining, and semantic web. Full list at http://www.kdnuggets.com/education/online.html © KDnuggets 2013 44
  • 45. Analytics Certificates • Business Analytics Certificates from BeyeUniversity, sponsored by Sybase. • Business Analytics Certificate from Statistics.com, become your company's expert on forecasting, customer segmentation, consumer behavior, and risk analysis. • Central Michigan University Graduate certificate in Data Mining, with SAS. Mount Pleasant, MI, USA • Data Mining Certificate from Statistics.com, master the secrets of teasing powerful information from large data sets to predict customer behavior, identify likely high value customers, visualize high dimensional data, and convert text to minable data. • EMC Data Science and Big Data Analytics open curriculum-based Education and Certification. • Indiana U. Kelley School of Business Business Analytics Certificate Program, Bloomington, IN, USA. • INFORMS Analytics Certification, coming April 2013. • NJIT (New Jersey Institute of Technology) The Graduate Certificate in Data Mining, online and in class. • Nova Southeastern University Graduate Certificate In Business Intelligence / Analytics, Fort Lauderdale, FL. • Stanford Center for Professional Education, offers certificate programs for managers and professionals in Data Mining and Applications and Quantitative Methods in Finance and Risk Management. • Statistics.com Institute for Statistics Education, offers certificates in Data Analytics, Biostatistics, Social Science, and Using R. • UCI: U. of California, Irvine Extension, Predictive Analytics Certificate Program, a comprehensive online program. • UCSD Data Mining certificate program, San Diego, CA. • University of Delaware Certificate in Analytics: Optimizing Big Data, Newark, DE, USA. • U. of Washington Certificate in Data Science, Seattle, WA, USA. • SAS Certificate Programs Full list http://www.kdnuggets.com/education/analytics-data-mining-certificates.html © KDnuggets 2013 45
  • 46. Analytics Education by Doing • Competitions – learn by doing – Kaggle and more • Kaggle beginner competitions • Kaggle in class : free to Instructors from any course dealing with data analysis © KDnuggets 2013 46
  • 47. Online Education – Free Courses Data Science pseudo degree from • Lower-Division Courses – Data Science 101 – Statistics One – Data Science 102 – Computing for Data Analysis (R) – Data Science 103 – Data Analysis – Data Science 104 – Introduction to Data Science • Upper-Division Courses – Data Science 201 – Machine Learning I – Data Science 202 – Machine Learning II – Data Science 203 – Neural Networks for Machine Learning • Graduate Courses – Data Science 301 – Learning from Data (Caltech course CS101) – Data Science 302 – Machine Learning III (MIT course 6.867) © KDnuggets 2013 47
  • 48. Analytics Education: Practical Knowledge > Degree Prestige Paco Nathan - School prestige matters some to hiring managers. Several top schools are known to have excellent programs and track records: Stanford, CMU, U Washington, UC Berkeley, Harvard, Johns Hopkins, etc. However, keep in mind that that list is not entirely representative. For example, there are nearly a dozen relevant programs at Stanford, which has produced Google, Yahoo, etc., while the list only mentions one program. More to the point, about half of my peers in this field have backgrounds in Physics or physical science/physical engineering -- and I tend to hire from those programs more so than from CS programs, because the grad students tend to have both the math/stats depth plus practice with real-world frameworks like R, Matlab, etc. Having a really solid background in applying statistics at scale, some knack for data visualization, plus good programming chops -- those skills will trump a PhD in Machine Learning. http://www.linkedin.com/groups/Graduate-Programs-in-Big-Data- 2013423.S.203868026?view=&srchtype=discussedNews&gid=2013423&item=203 868026&type=member&trk=eml-anet_dig-b_pd-ttl-cn&ut=2Xc36Ur0k0zBA1 © KDnuggets 2013 48
  • 49. Analytics Education Boom 2012 was a peak year for starting Analytics Degree Programs in the US © KDnuggets 2013 49
  • 50. Questions? Analytics Education Overview www.kdnuggets.com/education/ Subscribe to KDnuggets News email at www.KDnuggets.com/subscribe.html • Follow @kdnuggets on Twitter • Email to editor1@kdnuggets.com © KDnuggets 2013 50

Notes de l'éditeur

  1. Boris Evelson, Forrester also adds 4th V – Variability (meaning not constant)
  2. Churn: bestalgorithms for predicting churn have lift of 5-7 – 5-7 times better than random. Behavioral advertising: 2-3% CTR – 10 times better than random