SlideShare une entreprise Scribd logo
1  sur  33
Discover the Potential of
Your Data with Machine
Learning
Housekeeping
• Webinar recordings and slides will be shared
with all attendees
• Type in your questions and comments using
the question pane on the right hand side
© Harbinger Systems | www.harbinger-systems.com
Presenters
© Harbinger Systems | www.harbinger-systems.com
Lalit Kumar
Business Analyst
Harbinger Systems
Gautam Mainkar
Data Analyst
Harbinger Systems
Agenda
• A Practical definition
• Why its important
• Using machine learning on enterprise data
– Types of business problems machine learning can solve
– How to categorize a problem- Regression, Clustering and
Classification
• Overview of key algorithms, tools and technologies
• Walk-through of real-world use cases
© Harbinger Systems | www.harbinger-systems.com
Machine Learning (ML) – A Practical Definition
A type of artificial intelligence
that provides computers with the ability to learn
without being explicitly programmed.
• Computer can infer rules inherent in data
• Computer adapts when exposed to new data
© Harbinger Systems | www.harbinger-systems.com
Why we Need it?
© Harbinger Systems | www.harbinger-systems.com
Comic by XKCD
Enterprise Data Hides Information
“There are things we know we know,
There are things we know we don't know.
But there are also things we don't know we don't
know”
- Donald Rumsfeld
© Harbinger Systems | www.harbinger-systems.com
What Constitutes a Machine Learning Problem?
© Harbinger Systems | www.harbinger-systems.com
Emphasis of machine learning is on
automatic methods
Devise learning algorithms that do the
learning automatically without human
intervention
Program by example: we don't care what
the machine does, as long as it does it
right
Result-oriented rather than process-
oriented
How can Machine Learning Add Value?
© Harbinger Systems | www.harbinger-systems.com
ML is a data driven approach
• Business knowledge isn’t necessary
ML is domain independent
• Same algorithms can be used across domains and in different use cases
ML creates flexible decision systems
• Creates robust systems that can adjust for changing systems without
human intervention
ML and Big Data
ML thrives with big data!
– Accuracy of algorithms increases with size of data
– Statistical approaches can treat big datasets much better than
traditional paradigms
– Decision making using ML can adapt to transactional data much better
© Harbinger Systems | www.harbinger-systems.com
Machine Learning Big Data
Fraud Detection: Did the user really do this login/make this purchase?
Product Recommendation: Will the user like this product?
Stock Trading: Will the stock go up or down?
Medical Diagnosis: Given some symptoms, what is the patient
suffering from?
© Harbinger Systems | www.harbinger-systems.com
Machine Learning Applications- Some Examples
© Harbinger Systems | www.harbinger-systems.com
How to Categorize the Problem?
Generally, machine learning problems looks to:
Identify a Value
Assign data points to a category
Discover similarities between two data points
© Harbinger Systems | www.harbinger-systems.com
Flowchart
Start
Sufficient
Data?
Sort into
category?
Predict a
value?
Refine Problem!
Labeled
Data
Clustering
Classification
Get more!
Regression
© Harbinger Systems | www.harbinger-systems.com
What to look for in algorithms:
Flexible across many use cases
Able to handle several input types
Accurate
Resistant to over-fitting/noise/error
Machine Learning Algorithms
© Harbinger Systems | www.harbinger-systems.com
Random Forest
Used for classification and regression
Works on small subsets of data and combines the result into the best estimate
XGBoost
Works on classification and regression
Starts off with a weak learner that improves over successive iterations
K-Means
Works on classification and clustering
Tries to find boundaries between data points for each individual variable
Machine Learning Algorithms
© Harbinger Systems | www.harbinger-systems.com
Tools and Technologies
Emphasis on tools which:
Can integrate with existing data architecture
Have a smooth learning curve
Simplify the process of analysis and prediction
Have an active community
© Harbinger Systems | www.harbinger-systems.com
Popular Machine Learning Tools
Python
Free, open-source, widely popular
Consolidates many important libraries in python, C
Has an active community
Disclaimer: Brand names, logos and trademarks used herein remain the property of their respective owners.
© Harbinger Systems | www.harbinger-systems.com
Popular Machine Learning Tools
R
Statistical computing language that simplifies complex
statistical operations
Large number of libraries available for extending
functionality (DB connectors, algorithm, visualization)
Disclaimer: Brand names, logos and trademarks used herein remain the property of their respective owners.
Scenario
Industrial MNC buys part assemblies from various suppliers
Supplier selection workflow is cumbersome and inadaptable
Create a system to predict supplier price quotes and simplify selection process
© Harbinger Systems | www.harbinger-systems.com
Price Prediction: Regression Problem
Data Available
Technical specifications and pricing parameters of past supplier quotes
© Harbinger Systems | www.harbinger-systems.com
Problem Type
Predict a numerical value (price quoted by supplier)
Numerical data (specs, prices, etc.)
Categorical data (part composition, payment options, etc.)
Price Prediction: Regression Problem
Algorithm Chosen: Random Forest
We are working with a mix of numerical and categorical variables
Large number of records but with relatively low dimensionality of features
(Overfitting is not a big risk)
We expect a complex relationship between features
© Harbinger Systems | www.harbinger-systems.com
Price Prediction: Regression Problem
Result
Predicted the quote price given by the supplier with a relatively low error rate
Simplified supplier selection workflow and opened avenues for complete
automation in future
© Harbinger Systems | www.harbinger-systems.com
Price Prediction: Regression Problem
Scenario
eLearning product is sold to universities, corporate and institution across
the globe
There is a need to improve conversion rate by targeted marketing
Create a system to sort prospects into a specific segment
© Harbinger Systems | www.harbinger-systems.com
Targeted Marketing: Classification Problem
Data Available
Historical data of purchases and customer data from CRM
© Harbinger Systems | www.harbinger-systems.com
Problem Type: Predict a Category (Customer Segment) Based on
Numerical data (payment records)
Categorical data (customer profile data, product purchased by data)
Targeted Marketing: Classification Problem
Algorithm Chosen: Gradient Boosting Machine (XGBoost)
A mix of numerical and categorical values
Extremely high dimensionality and size of data
Parallel processing capacities could be useful
Overfitting could be a problem
© Harbinger Systems | www.harbinger-systems.com
Targeted Marketing: Classification Problem
Result
Created customer segments; new prospects entering CRM are sorted into a
segment and marketing campaigns are targeted to a particular segment
Sales people are better equipped with insights
© Harbinger Systems | www.harbinger-systems.com
Targeted Marketing: Classification Problem
Scenario
News feed engine publishes varied news content for users
Some level of categorization is done by humans
There is a need to personalize and recommend articles
Create a system to discover similar articles based on content
© Harbinger Systems | www.harbinger-systems.com
Personalized News Feed: Clustering Problem
Data Available
Text content of the news articles
User’s reading history
© Harbinger Systems | www.harbinger-systems.com
Personalized News Feed: Clustering Problem
Algorithm Chosen: K-Means Clustering
We are interested in sorting data points in an arbitrary series of clusters
No intrinsic metric for verifying the 'correctness' of a cluster, must be checked by
human oversight
We expect sorting to be accurate with more data
© Harbinger Systems | www.harbinger-systems.com
Personalized News Feed: Clustering Problem
Result
Sorted articles into different clusters which are nominally identified by a label
© Harbinger Systems | www.harbinger-systems.com
Personalized News Feed: Clustering Problem
Conclusion
• Amount of data available to enterprises is exploding
• In order to remain competitive, enterprises will have
to have mastery over their data
• Machine learning provides a powerful framework for
extracting meaning and actions from data
© Harbinger Systems | www.harbinger-systems.com
Q&A
© Harbinger Systems | www.harbinger-systems.com
© Harbinger Systems | www.harbinger-systems.com
Thank You!
Visit us at: www.harbinger-systems.com
Write to us at: hsinfo@harbingergroup.com
Blog: blog.harbinger-systems.com
Twitter: twitter.com/HarbingerSys (@HarbingerSys)
Slideshare: slideshare.net/hsplmkting
Facebook: facebook.com/harbingersys
LinkedIn: linkedin.com/company/382306
Instagram: https://www.instagram.com/harbingersystems

Contenu connexe

Tendances

SoftWatch Overview_short (1)
SoftWatch Overview_short (1)SoftWatch Overview_short (1)
SoftWatch Overview_short (1)Moshe Kozlovski
 
AMB120: How Mature Are You? ITAM Attainment Model
AMB120: How Mature Are You? ITAM Attainment ModelAMB120: How Mature Are You? ITAM Attainment Model
AMB120: How Mature Are You? ITAM Attainment ModelIvanti
 
WSO2Con US 2013 - Creating the API Centric Enterprise Towards a Connected Bus...
WSO2Con US 2013 - Creating the API Centric Enterprise Towards a Connected Bus...WSO2Con US 2013 - Creating the API Centric Enterprise Towards a Connected Bus...
WSO2Con US 2013 - Creating the API Centric Enterprise Towards a Connected Bus...WSO2
 
Wso2 con building the api centric enterprise - towards a connected business
Wso2 con   building the api centric enterprise - towards a connected businessWso2 con   building the api centric enterprise - towards a connected business
Wso2 con building the api centric enterprise - towards a connected businessJohn Mathon
 
AMB410: ITxM: The ITAM, ITSM, and Security Crossroads
AMB410: ITxM: The ITAM, ITSM, and Security CrossroadsAMB410: ITxM: The ITAM, ITSM, and Security Crossroads
AMB410: ITxM: The ITAM, ITSM, and Security CrossroadsIvanti
 
AMB300: Lessons Learned from ITAM Customers
AMB300: Lessons Learned from ITAM CustomersAMB300: Lessons Learned from ITAM Customers
AMB300: Lessons Learned from ITAM CustomersIvanti
 
Partner Transformation for Hybrid Cloud Management
Partner Transformation for Hybrid Cloud ManagementPartner Transformation for Hybrid Cloud Management
Partner Transformation for Hybrid Cloud ManagementVistara
 
Measuring the Success of Cloud-Based Services
Measuring the Success of Cloud-Based ServicesMeasuring the Success of Cloud-Based Services
Measuring the Success of Cloud-Based ServicesVistara
 
Overcoming Barriers to the Cloud
Overcoming Barriers to the Cloud Overcoming Barriers to the Cloud
Overcoming Barriers to the Cloud Andy Milsark
 
AMB110: IT Asset Management – How to Start When You Don’t Know Where to Start
AMB110: IT Asset Management – How to Start When You Don’t Know Where to StartAMB110: IT Asset Management – How to Start When You Don’t Know Where to Start
AMB110: IT Asset Management – How to Start When You Don’t Know Where to StartIvanti
 
IAM and cybersecurity - June 15
IAM and cybersecurity - June 15IAM and cybersecurity - June 15
IAM and cybersecurity - June 15Capgemini
 
AMB420: Data Center Licensing with License Optimizer
AMB420: Data Center Licensing with License OptimizerAMB420: Data Center Licensing with License Optimizer
AMB420: Data Center Licensing with License OptimizerIvanti
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital TransformationMukund Babbar
 
WSO2Con USA 2017: Building Platforms for Rapid Application Development
WSO2Con USA 2017: Building Platforms for Rapid Application DevelopmentWSO2Con USA 2017: Building Platforms for Rapid Application Development
WSO2Con USA 2017: Building Platforms for Rapid Application DevelopmentWSO2
 
McKesson - Business Process Redesign
McKesson - Business Process RedesignMcKesson - Business Process Redesign
McKesson - Business Process RedesignBhavik Doshi
 
CrossView Managed Services
CrossView Managed ServicesCrossView Managed Services
CrossView Managed ServicesCrossView
 
Application Performance Monitoring
Application Performance MonitoringApplication Performance Monitoring
Application Performance MonitoringOlivier Gérardin
 
Introducing Express Software Manager
Introducing Express Software ManagerIntroducing Express Software Manager
Introducing Express Software ManagerCherwell Software
 
Launch Managed Services for IT Datasheet
Launch Managed Services for IT DatasheetLaunch Managed Services for IT Datasheet
Launch Managed Services for IT DatasheetKevin Raines
 

Tendances (20)

SoftWatch Overview_short (1)
SoftWatch Overview_short (1)SoftWatch Overview_short (1)
SoftWatch Overview_short (1)
 
AMB120: How Mature Are You? ITAM Attainment Model
AMB120: How Mature Are You? ITAM Attainment ModelAMB120: How Mature Are You? ITAM Attainment Model
AMB120: How Mature Are You? ITAM Attainment Model
 
WSO2Con US 2013 - Creating the API Centric Enterprise Towards a Connected Bus...
WSO2Con US 2013 - Creating the API Centric Enterprise Towards a Connected Bus...WSO2Con US 2013 - Creating the API Centric Enterprise Towards a Connected Bus...
WSO2Con US 2013 - Creating the API Centric Enterprise Towards a Connected Bus...
 
Wso2 con building the api centric enterprise - towards a connected business
Wso2 con   building the api centric enterprise - towards a connected businessWso2 con   building the api centric enterprise - towards a connected business
Wso2 con building the api centric enterprise - towards a connected business
 
AMB410: ITxM: The ITAM, ITSM, and Security Crossroads
AMB410: ITxM: The ITAM, ITSM, and Security CrossroadsAMB410: ITxM: The ITAM, ITSM, and Security Crossroads
AMB410: ITxM: The ITAM, ITSM, and Security Crossroads
 
AMB300: Lessons Learned from ITAM Customers
AMB300: Lessons Learned from ITAM CustomersAMB300: Lessons Learned from ITAM Customers
AMB300: Lessons Learned from ITAM Customers
 
Partner Transformation for Hybrid Cloud Management
Partner Transformation for Hybrid Cloud ManagementPartner Transformation for Hybrid Cloud Management
Partner Transformation for Hybrid Cloud Management
 
Measuring the Success of Cloud-Based Services
Measuring the Success of Cloud-Based ServicesMeasuring the Success of Cloud-Based Services
Measuring the Success of Cloud-Based Services
 
Overcoming Barriers to the Cloud
Overcoming Barriers to the Cloud Overcoming Barriers to the Cloud
Overcoming Barriers to the Cloud
 
AMB110: IT Asset Management – How to Start When You Don’t Know Where to Start
AMB110: IT Asset Management – How to Start When You Don’t Know Where to StartAMB110: IT Asset Management – How to Start When You Don’t Know Where to Start
AMB110: IT Asset Management – How to Start When You Don’t Know Where to Start
 
IAM and cybersecurity - June 15
IAM and cybersecurity - June 15IAM and cybersecurity - June 15
IAM and cybersecurity - June 15
 
AMB420: Data Center Licensing with License Optimizer
AMB420: Data Center Licensing with License OptimizerAMB420: Data Center Licensing with License Optimizer
AMB420: Data Center Licensing with License Optimizer
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital Transformation
 
WSO2Con USA 2017: Building Platforms for Rapid Application Development
WSO2Con USA 2017: Building Platforms for Rapid Application DevelopmentWSO2Con USA 2017: Building Platforms for Rapid Application Development
WSO2Con USA 2017: Building Platforms for Rapid Application Development
 
McKesson - Business Process Redesign
McKesson - Business Process RedesignMcKesson - Business Process Redesign
McKesson - Business Process Redesign
 
CrossView Managed Services
CrossView Managed ServicesCrossView Managed Services
CrossView Managed Services
 
Application Performance Monitoring
Application Performance MonitoringApplication Performance Monitoring
Application Performance Monitoring
 
Introducing Express Software Manager
Introducing Express Software ManagerIntroducing Express Software Manager
Introducing Express Software Manager
 
eDocument Sciences SaaS 101
eDocument Sciences SaaS 101eDocument Sciences SaaS 101
eDocument Sciences SaaS 101
 
Launch Managed Services for IT Datasheet
Launch Managed Services for IT DatasheetLaunch Managed Services for IT Datasheet
Launch Managed Services for IT Datasheet
 

En vedette

En vedette (15)

Webinar presentation-startups and mobility
Webinar presentation-startups and mobilityWebinar presentation-startups and mobility
Webinar presentation-startups and mobility
 
Building next gen hr solutions with people analytics-final
Building next gen hr solutions with people analytics-finalBuilding next gen hr solutions with people analytics-final
Building next gen hr solutions with people analytics-final
 
Building real-time-collaborative-web-applications
Building real-time-collaborative-web-applicationsBuilding real-time-collaborative-web-applications
Building real-time-collaborative-web-applications
 
CLOUDIFICATION FOR INTERNET OF THINGS - THE ROAD AHEAD
CLOUDIFICATION FOR INTERNET OF THINGS - THE ROAD AHEADCLOUDIFICATION FOR INTERNET OF THINGS - THE ROAD AHEAD
CLOUDIFICATION FOR INTERNET OF THINGS - THE ROAD AHEAD
 
Webinar: Digital Health - The New Rx for USA Healthcare Ecosystem
Webinar: Digital Health - The New Rx for USA Healthcare EcosystemWebinar: Digital Health - The New Rx for USA Healthcare Ecosystem
Webinar: Digital Health - The New Rx for USA Healthcare Ecosystem
 
Webinar: UI/UX best practices in cms based web design
Webinar: UI/UX best practices in cms based web designWebinar: UI/UX best practices in cms based web design
Webinar: UI/UX best practices in cms based web design
 
Enhancing Unified Communication Experience through Microsoft Lync SDK and UCMA
Enhancing Unified Communication Experience through Microsoft Lync SDK and UCMAEnhancing Unified Communication Experience through Microsoft Lync SDK and UCMA
Enhancing Unified Communication Experience through Microsoft Lync SDK and UCMA
 
Webinar: How to choose your outsourcing partner for building mobile apps?
Webinar: How to choose your outsourcing partner for building mobile apps?Webinar: How to choose your outsourcing partner for building mobile apps?
Webinar: How to choose your outsourcing partner for building mobile apps?
 
iOS 8 HealthKit: Driving Smart Health Solutions
iOS 8 HealthKit: Driving Smart Health SolutionsiOS 8 HealthKit: Driving Smart Health Solutions
iOS 8 HealthKit: Driving Smart Health Solutions
 
Webinar: Structured attestation to meaningful use stage 2
Webinar: Structured attestation to meaningful use stage 2Webinar: Structured attestation to meaningful use stage 2
Webinar: Structured attestation to meaningful use stage 2
 
Webinar: Building amazing web apps rapidly with emerging tech
Webinar: Building amazing web apps rapidly with emerging techWebinar: Building amazing web apps rapidly with emerging tech
Webinar: Building amazing web apps rapidly with emerging tech
 
Webinar: Mobile UX: Doing It The Right Way
Webinar: Mobile UX: Doing It The Right WayWebinar: Mobile UX: Doing It The Right Way
Webinar: Mobile UX: Doing It The Right Way
 
Webinar: Automation of Test Automation
Webinar: Automation of Test AutomationWebinar: Automation of Test Automation
Webinar: Automation of Test Automation
 
Open Technology Solutions For Healthcare Startups
Open Technology Solutions For Healthcare StartupsOpen Technology Solutions For Healthcare Startups
Open Technology Solutions For Healthcare Startups
 
JavaScript MVC Frameworks: Backbone, Ember and Angular JS
JavaScript MVC Frameworks: Backbone, Ember and Angular JSJavaScript MVC Frameworks: Backbone, Ember and Angular JS
JavaScript MVC Frameworks: Backbone, Ember and Angular JS
 

Similaire à Discover the Potential of your Data with Machine Learning

How To Pick The Best Analytics Tool.pdf
How To Pick The Best Analytics Tool.pdfHow To Pick The Best Analytics Tool.pdf
How To Pick The Best Analytics Tool.pdfSatawaretechnologies1
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise deteo
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksAmazon Web Services
 
How to Wrangle Data for Machine Learning on AWS
 How to Wrangle Data for Machine Learning on AWS How to Wrangle Data for Machine Learning on AWS
How to Wrangle Data for Machine Learning on AWSAmazon Web Services
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data productsVikas Sardana
 
System Analysis And Design_FinalPPT_NirmishaK
System Analysis And Design_FinalPPT_NirmishaKSystem Analysis And Design_FinalPPT_NirmishaK
System Analysis And Design_FinalPPT_NirmishaKShehla Ghori
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon Web Services
 
Demand forecasting case study
Demand forecasting case studyDemand forecasting case study
Demand forecasting case studyRupam Devnath
 
What's New in Predictive Analytics IBM SPSS
What's New in Predictive Analytics IBM SPSSWhat's New in Predictive Analytics IBM SPSS
What's New in Predictive Analytics IBM SPSSVirginia Fernandez
 
What's New in Predictive Analytics IBM SPSS - Apr 2016
What's New in Predictive Analytics IBM SPSS - Apr 2016What's New in Predictive Analytics IBM SPSS - Apr 2016
What's New in Predictive Analytics IBM SPSS - Apr 2016Edgar Alejandro Villegas
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer AnalyticsCourse5i
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software EngineeringMiroslaw Staron
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...BigML, Inc
 
8 Tools For Digital Transformation For Every Leader.pdf
8 Tools For Digital Transformation For Every Leader.pdf8 Tools For Digital Transformation For Every Leader.pdf
8 Tools For Digital Transformation For Every Leader.pdflearntransformation0
 
Fuel for the cognitive age: What's new in IBM predictive analytics
Fuel for the cognitive age: What's new in IBM predictive analytics Fuel for the cognitive age: What's new in IBM predictive analytics
Fuel for the cognitive age: What's new in IBM predictive analytics IBM SPSS Software
 

Similaire à Discover the Potential of your Data with Machine Learning (20)

How To Pick The Best Analytics Tool.pdf
How To Pick The Best Analytics Tool.pdfHow To Pick The Best Analytics Tool.pdf
How To Pick The Best Analytics Tool.pdf
 
How to Power Your HR Apps With AI And Make It Explainable
How to Power Your HR Apps With AI And Make It ExplainableHow to Power Your HR Apps With AI And Make It Explainable
How to Power Your HR Apps With AI And Make It Explainable
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech Talks
 
How to Wrangle Data for Machine Learning on AWS
 How to Wrangle Data for Machine Learning on AWS How to Wrangle Data for Machine Learning on AWS
How to Wrangle Data for Machine Learning on AWS
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data products
 
System Analysis And Design_FinalPPT_NirmishaK
System Analysis And Design_FinalPPT_NirmishaKSystem Analysis And Design_FinalPPT_NirmishaK
System Analysis And Design_FinalPPT_NirmishaK
 
Introduction to Sagemaker
Introduction to SagemakerIntroduction to Sagemaker
Introduction to Sagemaker
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)
 
Demand forecasting case study
Demand forecasting case studyDemand forecasting case study
Demand forecasting case study
 
RowanDay3.pptx
RowanDay3.pptxRowanDay3.pptx
RowanDay3.pptx
 
What's New in Predictive Analytics IBM SPSS
What's New in Predictive Analytics IBM SPSSWhat's New in Predictive Analytics IBM SPSS
What's New in Predictive Analytics IBM SPSS
 
What's New in Predictive Analytics IBM SPSS - Apr 2016
What's New in Predictive Analytics IBM SPSS - Apr 2016What's New in Predictive Analytics IBM SPSS - Apr 2016
What's New in Predictive Analytics IBM SPSS - Apr 2016
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software Engineering
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
 
8 Tools For Digital Transformation For Every Leader.pdf
8 Tools For Digital Transformation For Every Leader.pdf8 Tools For Digital Transformation For Every Leader.pdf
8 Tools For Digital Transformation For Every Leader.pdf
 
Fuel for the cognitive age: What's new in IBM predictive analytics
Fuel for the cognitive age: What's new in IBM predictive analytics Fuel for the cognitive age: What's new in IBM predictive analytics
Fuel for the cognitive age: What's new in IBM predictive analytics
 
Crm1
Crm1Crm1
Crm1
 

Plus de Harbinger Systems - HRTech Builder of Choice

Plus de Harbinger Systems - HRTech Builder of Choice (20)

Using People Analytics for a Sustainable Remote Workforce
Using People Analytics for a Sustainable Remote WorkforceUsing People Analytics for a Sustainable Remote Workforce
Using People Analytics for a Sustainable Remote Workforce
 
5 Trends That Will Drive the Transformation of EdTech in 2021
5 Trends That Will Drive the Transformation of EdTech in 20215 Trends That Will Drive the Transformation of EdTech in 2021
5 Trends That Will Drive the Transformation of EdTech in 2021
 
Rapidly Transforming Organizational Content into Learning Experiences
Rapidly Transforming Organizational Content into Learning ExperiencesRapidly Transforming Organizational Content into Learning Experiences
Rapidly Transforming Organizational Content into Learning Experiences
 
Scalable HR Integrations for Better Data Analytics: Challenges & Solutions
Scalable HR Integrations for Better Data Analytics: Challenges & SolutionsScalable HR Integrations for Better Data Analytics: Challenges & Solutions
Scalable HR Integrations for Better Data Analytics: Challenges & Solutions
 
5 Key Items HR Should Consider Before Buying HR Technologies
5 Key Items HR Should Consider Before Buying HR Technologies5 Key Items HR Should Consider Before Buying HR Technologies
5 Key Items HR Should Consider Before Buying HR Technologies
 
Best Practices to Build Marketplace-Ready Integrations
Best Practices to Build Marketplace-Ready IntegrationsBest Practices to Build Marketplace-Ready Integrations
Best Practices to Build Marketplace-Ready Integrations
 
HRTech Integration Masterclass Session 4 How to Expand Your Recruitment Datab...
HRTech Integration Masterclass Session 4 How to Expand Your Recruitment Datab...HRTech Integration Masterclass Session 4 How to Expand Your Recruitment Datab...
HRTech Integration Masterclass Session 4 How to Expand Your Recruitment Datab...
 
Recalibrating Product Strategy - Addressing Demand Shifts in Existing Markets
Recalibrating Product Strategy - Addressing Demand Shifts in Existing MarketsRecalibrating Product Strategy - Addressing Demand Shifts in Existing Markets
Recalibrating Product Strategy - Addressing Demand Shifts in Existing Markets
 
How to Gain Key Insights from Data Distributed Across Multiple HR Systems
How to Gain Key Insights from Data Distributed Across Multiple HR SystemsHow to Gain Key Insights from Data Distributed Across Multiple HR Systems
How to Gain Key Insights from Data Distributed Across Multiple HR Systems
 
HRTech Integration Master Class Session 1 -Delivering Seamless Learning Exper...
HRTech Integration Master Class Session 1 -Delivering Seamless Learning Exper...HRTech Integration Master Class Session 1 -Delivering Seamless Learning Exper...
HRTech Integration Master Class Session 1 -Delivering Seamless Learning Exper...
 
Recalibrating Product Strategy - Addressing Demand Shifts in Existing Markets
Recalibrating Product Strategy - Addressing Demand Shifts in Existing MarketsRecalibrating Product Strategy - Addressing Demand Shifts in Existing Markets
Recalibrating Product Strategy - Addressing Demand Shifts in Existing Markets
 
Integrating System of Records and Collaboration Tools
Integrating System of Records and Collaboration ToolsIntegrating System of Records and Collaboration Tools
Integrating System of Records and Collaboration Tools
 
Chatbot for Continuous Performance Management
Chatbot for Continuous Performance Management Chatbot for Continuous Performance Management
Chatbot for Continuous Performance Management
 
Leveraging mobile capabilities in your HR application
Leveraging mobile capabilities in your HR applicationLeveraging mobile capabilities in your HR application
Leveraging mobile capabilities in your HR application
 
Automate HR applications using AI and ML
Automate HR applications using AI and MLAutomate HR applications using AI and ML
Automate HR applications using AI and ML
 
A Cloud-based Collaborative Learning and Coaching Platform
A Cloud-based Collaborative Learning and Coaching PlatformA Cloud-based Collaborative Learning and Coaching Platform
A Cloud-based Collaborative Learning and Coaching Platform
 
Extending LRSs and the xAPI for Event-driven Blended and Adaptive Learning
Extending LRSs and the xAPI for Event-driven Blended and Adaptive LearningExtending LRSs and the xAPI for Event-driven Blended and Adaptive Learning
Extending LRSs and the xAPI for Event-driven Blended and Adaptive Learning
 
A medical prescription reminder app for i phone
A medical prescription reminder app for i phoneA medical prescription reminder app for i phone
A medical prescription reminder app for i phone
 
Webinar IoT Cloud Platforms and Middleware for Rapid Application Development
Webinar IoT Cloud Platforms and Middleware for Rapid Application DevelopmentWebinar IoT Cloud Platforms and Middleware for Rapid Application Development
Webinar IoT Cloud Platforms and Middleware for Rapid Application Development
 
Harbinger Demo Session in cloud Expo 2015- IoT for You- Smart Watch, Smart Ho...
Harbinger Demo Session in cloud Expo 2015- IoT for You- Smart Watch, Smart Ho...Harbinger Demo Session in cloud Expo 2015- IoT for You- Smart Watch, Smart Ho...
Harbinger Demo Session in cloud Expo 2015- IoT for You- Smart Watch, Smart Ho...
 

Dernier

So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 

Dernier (20)

So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 

Discover the Potential of your Data with Machine Learning

  • 1. Discover the Potential of Your Data with Machine Learning
  • 2. Housekeeping • Webinar recordings and slides will be shared with all attendees • Type in your questions and comments using the question pane on the right hand side © Harbinger Systems | www.harbinger-systems.com
  • 3. Presenters © Harbinger Systems | www.harbinger-systems.com Lalit Kumar Business Analyst Harbinger Systems Gautam Mainkar Data Analyst Harbinger Systems
  • 4. Agenda • A Practical definition • Why its important • Using machine learning on enterprise data – Types of business problems machine learning can solve – How to categorize a problem- Regression, Clustering and Classification • Overview of key algorithms, tools and technologies • Walk-through of real-world use cases © Harbinger Systems | www.harbinger-systems.com
  • 5. Machine Learning (ML) – A Practical Definition A type of artificial intelligence that provides computers with the ability to learn without being explicitly programmed. • Computer can infer rules inherent in data • Computer adapts when exposed to new data © Harbinger Systems | www.harbinger-systems.com
  • 6. Why we Need it? © Harbinger Systems | www.harbinger-systems.com Comic by XKCD
  • 7. Enterprise Data Hides Information “There are things we know we know, There are things we know we don't know. But there are also things we don't know we don't know” - Donald Rumsfeld © Harbinger Systems | www.harbinger-systems.com
  • 8. What Constitutes a Machine Learning Problem? © Harbinger Systems | www.harbinger-systems.com Emphasis of machine learning is on automatic methods Devise learning algorithms that do the learning automatically without human intervention Program by example: we don't care what the machine does, as long as it does it right Result-oriented rather than process- oriented
  • 9. How can Machine Learning Add Value? © Harbinger Systems | www.harbinger-systems.com ML is a data driven approach • Business knowledge isn’t necessary ML is domain independent • Same algorithms can be used across domains and in different use cases ML creates flexible decision systems • Creates robust systems that can adjust for changing systems without human intervention
  • 10. ML and Big Data ML thrives with big data! – Accuracy of algorithms increases with size of data – Statistical approaches can treat big datasets much better than traditional paradigms – Decision making using ML can adapt to transactional data much better © Harbinger Systems | www.harbinger-systems.com Machine Learning Big Data
  • 11. Fraud Detection: Did the user really do this login/make this purchase? Product Recommendation: Will the user like this product? Stock Trading: Will the stock go up or down? Medical Diagnosis: Given some symptoms, what is the patient suffering from? © Harbinger Systems | www.harbinger-systems.com Machine Learning Applications- Some Examples
  • 12. © Harbinger Systems | www.harbinger-systems.com How to Categorize the Problem? Generally, machine learning problems looks to: Identify a Value Assign data points to a category Discover similarities between two data points
  • 13. © Harbinger Systems | www.harbinger-systems.com Flowchart Start Sufficient Data? Sort into category? Predict a value? Refine Problem! Labeled Data Clustering Classification Get more! Regression
  • 14. © Harbinger Systems | www.harbinger-systems.com What to look for in algorithms: Flexible across many use cases Able to handle several input types Accurate Resistant to over-fitting/noise/error Machine Learning Algorithms
  • 15. © Harbinger Systems | www.harbinger-systems.com Random Forest Used for classification and regression Works on small subsets of data and combines the result into the best estimate XGBoost Works on classification and regression Starts off with a weak learner that improves over successive iterations K-Means Works on classification and clustering Tries to find boundaries between data points for each individual variable Machine Learning Algorithms
  • 16. © Harbinger Systems | www.harbinger-systems.com Tools and Technologies Emphasis on tools which: Can integrate with existing data architecture Have a smooth learning curve Simplify the process of analysis and prediction Have an active community
  • 17. © Harbinger Systems | www.harbinger-systems.com Popular Machine Learning Tools Python Free, open-source, widely popular Consolidates many important libraries in python, C Has an active community Disclaimer: Brand names, logos and trademarks used herein remain the property of their respective owners.
  • 18. © Harbinger Systems | www.harbinger-systems.com Popular Machine Learning Tools R Statistical computing language that simplifies complex statistical operations Large number of libraries available for extending functionality (DB connectors, algorithm, visualization) Disclaimer: Brand names, logos and trademarks used herein remain the property of their respective owners.
  • 19. Scenario Industrial MNC buys part assemblies from various suppliers Supplier selection workflow is cumbersome and inadaptable Create a system to predict supplier price quotes and simplify selection process © Harbinger Systems | www.harbinger-systems.com Price Prediction: Regression Problem
  • 20. Data Available Technical specifications and pricing parameters of past supplier quotes © Harbinger Systems | www.harbinger-systems.com Problem Type Predict a numerical value (price quoted by supplier) Numerical data (specs, prices, etc.) Categorical data (part composition, payment options, etc.) Price Prediction: Regression Problem
  • 21. Algorithm Chosen: Random Forest We are working with a mix of numerical and categorical variables Large number of records but with relatively low dimensionality of features (Overfitting is not a big risk) We expect a complex relationship between features © Harbinger Systems | www.harbinger-systems.com Price Prediction: Regression Problem
  • 22. Result Predicted the quote price given by the supplier with a relatively low error rate Simplified supplier selection workflow and opened avenues for complete automation in future © Harbinger Systems | www.harbinger-systems.com Price Prediction: Regression Problem
  • 23. Scenario eLearning product is sold to universities, corporate and institution across the globe There is a need to improve conversion rate by targeted marketing Create a system to sort prospects into a specific segment © Harbinger Systems | www.harbinger-systems.com Targeted Marketing: Classification Problem
  • 24. Data Available Historical data of purchases and customer data from CRM © Harbinger Systems | www.harbinger-systems.com Problem Type: Predict a Category (Customer Segment) Based on Numerical data (payment records) Categorical data (customer profile data, product purchased by data) Targeted Marketing: Classification Problem
  • 25. Algorithm Chosen: Gradient Boosting Machine (XGBoost) A mix of numerical and categorical values Extremely high dimensionality and size of data Parallel processing capacities could be useful Overfitting could be a problem © Harbinger Systems | www.harbinger-systems.com Targeted Marketing: Classification Problem
  • 26. Result Created customer segments; new prospects entering CRM are sorted into a segment and marketing campaigns are targeted to a particular segment Sales people are better equipped with insights © Harbinger Systems | www.harbinger-systems.com Targeted Marketing: Classification Problem
  • 27. Scenario News feed engine publishes varied news content for users Some level of categorization is done by humans There is a need to personalize and recommend articles Create a system to discover similar articles based on content © Harbinger Systems | www.harbinger-systems.com Personalized News Feed: Clustering Problem
  • 28. Data Available Text content of the news articles User’s reading history © Harbinger Systems | www.harbinger-systems.com Personalized News Feed: Clustering Problem
  • 29. Algorithm Chosen: K-Means Clustering We are interested in sorting data points in an arbitrary series of clusters No intrinsic metric for verifying the 'correctness' of a cluster, must be checked by human oversight We expect sorting to be accurate with more data © Harbinger Systems | www.harbinger-systems.com Personalized News Feed: Clustering Problem
  • 30. Result Sorted articles into different clusters which are nominally identified by a label © Harbinger Systems | www.harbinger-systems.com Personalized News Feed: Clustering Problem
  • 31. Conclusion • Amount of data available to enterprises is exploding • In order to remain competitive, enterprises will have to have mastery over their data • Machine learning provides a powerful framework for extracting meaning and actions from data © Harbinger Systems | www.harbinger-systems.com
  • 32. Q&A © Harbinger Systems | www.harbinger-systems.com
  • 33. © Harbinger Systems | www.harbinger-systems.com Thank You! Visit us at: www.harbinger-systems.com Write to us at: hsinfo@harbingergroup.com Blog: blog.harbinger-systems.com Twitter: twitter.com/HarbingerSys (@HarbingerSys) Slideshare: slideshare.net/hsplmkting Facebook: facebook.com/harbingersys LinkedIn: linkedin.com/company/382306 Instagram: https://www.instagram.com/harbingersystems