SlideShare a Scribd company logo
1 of 20
Download to read offline
Introduction to
Machine Learning &
Data Analytics
Shishir Choudhary
Why ML ?
What happened
with user at
meeting today ?
What user has
bought in past
What happened
at user house
today morning ?
What people
with similar
interest
bought
What has user
recently shown
interest in by
search
ML Prediction System
You want to take a good decision
• You can’t ask a human to take the decision due to time and dollar cost involved at
the scale of problem 

• You are presently anyway not taking good decisions because decision is presently
random , hard coded or complex rules based

• You want to reduce cost or verify your own decision
E.g. Taking good decision is neither easy, nor perfect, but ML makes it less error prone than
simpler old alternatives
Why Data Analytics ?
Understanding your business/industry/users better
You want to improve your business, industry or users, e.g. What
segments of users you have and how well each segment likes your
product. How well are different players in industry doing in different
segments of products they well ? Which products are growing fast ?
Helps to remove bias or opinions and gives more factual insights. Also
called data driven decision making. Typically involves visualisation,
clustering and at times regression(ML). 

Sometimes this involves analysing “big” amount of data typically from
world or “small” amount of data from your company itself
Why deep learning ?
• Better results with large amount of data

• Very high dimensions in data or time series / sequence
data - E.g. Image, Voice, Text, Time series 

• End to end Vs Part of workflow
What is ML = Optimisation Problem

Given Past Data
Parameter that
can’t be
obtained
What user has
bought in past
Parameter that
can’t be
obtained
What people
with similar
interest
bought
What has user
recently shown
interest in by
search
What product should be recommended to user ? Imagine 20 inputs
Prediction System
Weight for decision
= 0.2 ?
Weight for decision
= 0.3 ?
Weight for
decision
= 0.5 ?
ML = 

Optimize the Weights
Past Purchase Data
What is ML = 

Inductive Reasoning Problem
• Guess the function 

• f(1) = 1 

• f(2) = 4

• f(3) = 9 

• f(4) = 16 

• ?? what is f(x) ?
Involves leap of faith. Does training generalise well is the crucial question ?
Alternative = 

Deductive Reasoning 

= Rules Based System
Simple attempt at predicting gender by rules 

• Men => Short Hair 

• Women => Medium/Long Hair 

• Arun is a Man => Arun has short hair
Quite quickly, rules and their exceptions can become too complex to manage
Most Popular ML
Problem Statements
• Classification 

• Given lot of examples, learn how to predict 

function(Input) -> Output one of Fixed Set Of Categories 

• E.g. Spam or Not 

• Good vs bad investment

• Image classification / labelling 

• Regression 

• Given lot of examples, learn how to predict 

function(Input) -> Continuous infinite values function 

• House Price estimation

• Pricing a stock 

• Time for food to arrive for food delivery app
Classification or Regression
?
$100-120k $120-140k $140-160k $160-180k
100
120
140
160
80 sqm 100 sqm 120 sqm 140 sqm
House Pricing - Classification
Class 1 Class 2 Class 3 Class 4
House Pricing - Regression
Size ->
Size ->
Crucial challenges
• Bias Error & Variance Error & Irreducible Error
ML Models try to balance between bias and variance. Some models by default have
more bias and some have by default more variance. Some some parameters their
default leaning can be adjusted.
DL can reduce both bias and variance error with huge amount of data and high number
of features/parameters
Crucial Challenges
• Curse of dimensionality 

• Amount of training data needed increases exponentially with more features
used for prediction

• Hence dimensionality reduction techniques

• Bad and missing data 

• Lot of real world data often is bad quality, with either wrong or missing
values. Thus good data becomes quite useful asset today. 

• Feature Engineering 

• In ML you need to select right features to use for prediction. This is a tough
problem and partly solved by domain knowledge and partly by data analysis
ML vs DL
0
25
50
75
100
Untitled 1
ML DL Small DL Large
Larger Training Data
Accuracy
At lower quantum of training data, ML , Small DL models and Large DL models all perform
quite similarly. At larger quantum DL is better able to utilise the additional training data. Latest
Image Recognition DL Models perform better than human error rates.
Models !
Models are fancy words for 

mathematical functions which are used to generalise from training data
Prediction Optimisation
Most learning algorithm work in some sort of derivative of
following steps
1. Based on inputs and initial model parameters, make a
prediction
2. Check what was actual answer and calculate error
3. Try to minimise the error by adjusting the models’s
parameters in direction which reduces error
4. Repeat till convergence (error % stops reducing and is
low)
Regression
Takes simplifying assumption that prediction
is linear (or binomial or trinomial) relation to input(s)
E.g. Height of plant by age , 

time required to deliver food in food app
Decision Trees
Perceptron
Neural Network
Good resources
• Udacity (completely) Free Machine Learning Course 

• Josh Gordon’s Youtube Videos on Introduction to ML 

• Libraries 

• Scikit Learn in Python - Machine Learning 

• Tensorflow - Deep Learning 

• Gensim - Natural Language Processing

• http://learney.sg/#machine learning

• http://learney.sg/#data science

• http://learney.sg/#deep learning

• http://learney.sg/#natural language processing
Summary &
Q &A

More Related Content

What's hot

Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Marina Santini
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachinePulse
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRaveen Perera
 
Module 9: Natural Language Processing Part 2
Module 9:  Natural Language Processing Part 2Module 9:  Natural Language Processing Part 2
Module 9: Natural Language Processing Part 2Sara Hooker
 
Machine learning with Big Data power point presentation
Machine learning with Big Data power point presentationMachine learning with Big Data power point presentation
Machine learning with Big Data power point presentationDavid Raj Kanthi
 
End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning ProjectEng Teong Cheah
 
Barga Data Science lecture 9
Barga Data Science lecture 9Barga Data Science lecture 9
Barga Data Science lecture 9Roger Barga
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learningTamir Taha
 
Fairly Measuring Fairness In Machine Learning
Fairly Measuring Fairness In Machine LearningFairly Measuring Fairness In Machine Learning
Fairly Measuring Fairness In Machine LearningHJ van Veen
 
Machine Learning: Understanding the Invisible Force Changing Our World
Machine Learning: Understanding the Invisible Force Changing Our WorldMachine Learning: Understanding the Invisible Force Changing Our World
Machine Learning: Understanding the Invisible Force Changing Our WorldKen Tabor
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesRui Pedro Paiva
 
Machine learning_ Replicating Human Brain
Machine learning_ Replicating Human BrainMachine learning_ Replicating Human Brain
Machine learning_ Replicating Human BrainNishant Jain
 
Applications in Machine Learning
Applications in Machine LearningApplications in Machine Learning
Applications in Machine LearningJoel Graff
 
Module 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationModule 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationSara Hooker
 
Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1Roger Barga
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Simplilearn
 

What's hot (20)

Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)
 
What is Machine Learning
What is Machine LearningWhat is Machine Learning
What is Machine Learning
 
Machine learning
Machine learning Machine learning
Machine learning
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Module 9: Natural Language Processing Part 2
Module 9:  Natural Language Processing Part 2Module 9:  Natural Language Processing Part 2
Module 9: Natural Language Processing Part 2
 
Machine learning with Big Data power point presentation
Machine learning with Big Data power point presentationMachine learning with Big Data power point presentation
Machine learning with Big Data power point presentation
 
End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning Project
 
Barga Data Science lecture 9
Barga Data Science lecture 9Barga Data Science lecture 9
Barga Data Science lecture 9
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learning
 
Fairly Measuring Fairness In Machine Learning
Fairly Measuring Fairness In Machine LearningFairly Measuring Fairness In Machine Learning
Fairly Measuring Fairness In Machine Learning
 
Machine Learning: Understanding the Invisible Force Changing Our World
Machine Learning: Understanding the Invisible Force Changing Our WorldMachine Learning: Understanding the Invisible Force Changing Our World
Machine Learning: Understanding the Invisible Force Changing Our World
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and Techniques
 
Machine learning_ Replicating Human Brain
Machine learning_ Replicating Human BrainMachine learning_ Replicating Human Brain
Machine learning_ Replicating Human Brain
 
Applications in Machine Learning
Applications in Machine LearningApplications in Machine Learning
Applications in Machine Learning
 
Module 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationModule 4: Model Selection and Evaluation
Module 4: Model Selection and Evaluation
 
Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1
 
ML Basics
ML BasicsML Basics
ML Basics
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
 

Similar to Introduction to machine learning and deep learning

Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxChitrachitrap
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2Roger Barga
 
How to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product ManagerHow to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product ManagerProduct School
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining ProcessMarc Berman
 
Mini datathon - Bengaluru
Mini datathon - BengaluruMini datathon - Bengaluru
Mini datathon - BengaluruKunal Jain
 
Data mining - Machine Learning
Data mining - Machine LearningData mining - Machine Learning
Data mining - Machine LearningRupaDutta3
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)SwatiTripathi44
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10Roger Barga
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayPhillip Law
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayPhillip Law
 
H2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandryH2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandrySri Ambati
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine LearningAgileThought
 
Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial IndustrySubrat Panda, PhD
 
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)Amazon Web Services
 
Top 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsSri Ambati
 
An introduction to machine learning and statistics
An introduction to machine learning and statisticsAn introduction to machine learning and statistics
An introduction to machine learning and statisticsSpotle.ai
 

Similar to Introduction to machine learning and deep learning (20)

Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptx
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
How to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product ManagerHow to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product Manager
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining Process
 
Deep learning for NLP
Deep learning for NLPDeep learning for NLP
Deep learning for NLP
 
Mini datathon - Bengaluru
Mini datathon - BengaluruMini datathon - Bengaluru
Mini datathon - Bengaluru
 
Data mining - Machine Learning
Data mining - Machine LearningData mining - Machine Learning
Data mining - Machine Learning
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 Day
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 Day
 
H2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandryH2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark Landry
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine Learning
 
Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial Industry
 
PCA.pptx
PCA.pptxPCA.pptx
PCA.pptx
 
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
 
Top 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner Pitfalls
 
Data Analytics .pptx
Data Analytics .pptxData Analytics .pptx
Data Analytics .pptx
 
An introduction to machine learning and statistics
An introduction to machine learning and statisticsAn introduction to machine learning and statistics
An introduction to machine learning and statistics
 

More from Shishir Choudhary

More from Shishir Choudhary (8)

Learncool - Angel Round Pitch
Learncool - Angel Round Pitch Learncool - Angel Round Pitch
Learncool - Angel Round Pitch
 
Adopting innovation
Adopting innovationAdopting innovation
Adopting innovation
 
Adopting innovation
Adopting innovationAdopting innovation
Adopting innovation
 
Adopting innovation
Adopting innovationAdopting innovation
Adopting innovation
 
Building innovative products
Building innovative productsBuilding innovative products
Building innovative products
 
Building innovative products
Building innovative productsBuilding innovative products
Building innovative products
 
Building innovative products
Building innovative productsBuilding innovative products
Building innovative products
 
Building innovative products
Building innovative productsBuilding innovative products
Building innovative products
 

Recently uploaded

Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxVenkatasubramani13
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?sonikadigital1
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)Data & Analytics Magazin
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best PracticesDataArchiva
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionajayrajaganeshkayala
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxDwiAyuSitiHartinah
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerPavel Šabatka
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Vladislav Solodkiy
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityAggregage
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationGiorgio Carbone
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructuresonikadigital1
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024Becky Burwell
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Guido X Jansen
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.JasonViviers2
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introductionsanjaymuralee1
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...PrithaVashisht1
 

Recently uploaded (17)

Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptx
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual intervention
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayer
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - Presentation
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructure
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introduction
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...
 

Introduction to machine learning and deep learning

  • 1. Introduction to Machine Learning & Data Analytics Shishir Choudhary
  • 2. Why ML ? What happened with user at meeting today ? What user has bought in past What happened at user house today morning ? What people with similar interest bought What has user recently shown interest in by search ML Prediction System You want to take a good decision • You can’t ask a human to take the decision due to time and dollar cost involved at the scale of problem • You are presently anyway not taking good decisions because decision is presently random , hard coded or complex rules based • You want to reduce cost or verify your own decision E.g. Taking good decision is neither easy, nor perfect, but ML makes it less error prone than simpler old alternatives
  • 3. Why Data Analytics ? Understanding your business/industry/users better You want to improve your business, industry or users, e.g. What segments of users you have and how well each segment likes your product. How well are different players in industry doing in different segments of products they well ? Which products are growing fast ? Helps to remove bias or opinions and gives more factual insights. Also called data driven decision making. Typically involves visualisation, clustering and at times regression(ML). Sometimes this involves analysing “big” amount of data typically from world or “small” amount of data from your company itself
  • 4. Why deep learning ? • Better results with large amount of data • Very high dimensions in data or time series / sequence data - E.g. Image, Voice, Text, Time series • End to end Vs Part of workflow
  • 5. What is ML = Optimisation Problem
 Given Past Data Parameter that can’t be obtained What user has bought in past Parameter that can’t be obtained What people with similar interest bought What has user recently shown interest in by search What product should be recommended to user ? Imagine 20 inputs Prediction System Weight for decision = 0.2 ? Weight for decision = 0.3 ? Weight for decision = 0.5 ? ML = 
 Optimize the Weights Past Purchase Data
  • 6. What is ML = 
 Inductive Reasoning Problem • Guess the function • f(1) = 1 • f(2) = 4 • f(3) = 9 • f(4) = 16 • ?? what is f(x) ? Involves leap of faith. Does training generalise well is the crucial question ?
  • 7. Alternative = 
 Deductive Reasoning 
 = Rules Based System Simple attempt at predicting gender by rules • Men => Short Hair • Women => Medium/Long Hair • Arun is a Man => Arun has short hair Quite quickly, rules and their exceptions can become too complex to manage
  • 8. Most Popular ML Problem Statements • Classification • Given lot of examples, learn how to predict 
 function(Input) -> Output one of Fixed Set Of Categories • E.g. Spam or Not • Good vs bad investment • Image classification / labelling • Regression • Given lot of examples, learn how to predict 
 function(Input) -> Continuous infinite values function • House Price estimation • Pricing a stock • Time for food to arrive for food delivery app
  • 9. Classification or Regression ? $100-120k $120-140k $140-160k $160-180k 100 120 140 160 80 sqm 100 sqm 120 sqm 140 sqm House Pricing - Classification Class 1 Class 2 Class 3 Class 4 House Pricing - Regression Size -> Size ->
  • 10. Crucial challenges • Bias Error & Variance Error & Irreducible Error ML Models try to balance between bias and variance. Some models by default have more bias and some have by default more variance. Some some parameters their default leaning can be adjusted. DL can reduce both bias and variance error with huge amount of data and high number of features/parameters
  • 11. Crucial Challenges • Curse of dimensionality • Amount of training data needed increases exponentially with more features used for prediction • Hence dimensionality reduction techniques • Bad and missing data • Lot of real world data often is bad quality, with either wrong or missing values. Thus good data becomes quite useful asset today. • Feature Engineering • In ML you need to select right features to use for prediction. This is a tough problem and partly solved by domain knowledge and partly by data analysis
  • 12. ML vs DL 0 25 50 75 100 Untitled 1 ML DL Small DL Large Larger Training Data Accuracy At lower quantum of training data, ML , Small DL models and Large DL models all perform quite similarly. At larger quantum DL is better able to utilise the additional training data. Latest Image Recognition DL Models perform better than human error rates.
  • 13. Models ! Models are fancy words for 
 mathematical functions which are used to generalise from training data
  • 14. Prediction Optimisation Most learning algorithm work in some sort of derivative of following steps 1. Based on inputs and initial model parameters, make a prediction 2. Check what was actual answer and calculate error 3. Try to minimise the error by adjusting the models’s parameters in direction which reduces error 4. Repeat till convergence (error % stops reducing and is low)
  • 15. Regression Takes simplifying assumption that prediction is linear (or binomial or trinomial) relation to input(s) E.g. Height of plant by age , 
 time required to deliver food in food app
  • 19. Good resources • Udacity (completely) Free Machine Learning Course • Josh Gordon’s Youtube Videos on Introduction to ML • Libraries • Scikit Learn in Python - Machine Learning • Tensorflow - Deep Learning • Gensim - Natural Language Processing • http://learney.sg/#machine learning • http://learney.sg/#data science • http://learney.sg/#deep learning • http://learney.sg/#natural language processing