SlideShare une entreprise Scribd logo
1  sur  28
Decision Forest
After Twenty Years
Lior Rokach
Dept. of Information Systems Engineering
Do we need hundreds of classifiers to solve real
world classification problems?
(Fernández-Delgado et al., 2014)
Empirically comparing
179 classification algorithms
over 121 datasets
“The classifier most likely to be the best is random forest
(achieves 94.1% of the maximum accuracy
overcoming 90% in the 84.3% of the data sets)”
Accumulated votes: 2154321
Classification by majority voting
New Instance: x
1
t 
1 1 2 1 2 1
T=7 classifiers
0 0 Final class: 1
t2t1
3
Obtained from Alberto Suárez, 2012
The Condorcet’s Jury Theorem
(Marquis of Condorcet,1784)
• The most basic jury theorem in social choice
• N = the number of jurors
• p = the probability of an individual juror being right
• µ= the probability that a jury gives the correct answer
• p > 0.5 implies µ > p.
• and µ  1 when N∞.
p = 0.6
µ
The Wisdom of Crowds
• Francis Galton promoted statistics and
invented the concept of correlation.
• In 1906 Galton visited a livestock fair and
stumbled upon an intriguing contest.
• An ox was on display, and the villagers were
invited to guess the animal's weight.
• Nearly 800 gave it a go and, not surprisingly,
not one hit the exact mark: 1,198 pounds.
• Astonishingly, however, the average of those
800 guesses came close - very close indeed. It
was 1,197 pounds.
Key Criteria for Crowd to be Wise
• Diversity of opinion
– Each person should have private information even if it's just an
eccentric interpretation of the known facts.
• Independence
– People's opinions aren't determined by the opinions of those
around them.
• Decentralization
– People are able to specialize and draw on local knowledge.
• Aggregation
– Some mechanism exists for turning private judgments into a
collective decision.
The Diversity Tradeoff
of individual trees
There’s no Real Tradeoff…
• Ideally, all trees would be right about
everything!
• If not, they should be wrong about different
cases.
Top Down Induction of Decision Trees
New Recipients
EmailLength
Email Len
<1.8 ≥1.8
HamSpam
1 Error 8 Errors
Top Down Induction of Decision Trees
New Recipients
EmailLength
Email Len
<1.8 ≥1.8
Spam
1 Error
Email Len
<4 ≥4
Spam
1 Error
Ham
3 Errors
Top Down Induction of Decision Trees
New Recipients
EmailLength
Email Len
<1.8 ≥1.8
Spam
1 Error
Email Len
<4 ≥4
Spam
1 Error
New Recip
<1 ≥1
Ham
1 Error
Spam
0 Errors
Which One?Top Down Induction of Decision Trees
Why Does Decision Forest Work?
• Local minima
• Lack of sufficient data
• Limited Representation
Bias
– The tendency to consistently learn the same wrong thing because
the hypothesis space considered by the learning algorithm does not
include sufficient hypotheses
Variance
– The tendency to learn random things irrespective of the real signal
due to the particular training set used
Bias and Variance Decomposition
Tree Size
It all started about two years ago …
Iterative Methods
• Reduce both Bias and Variance errors
• Hard to parallelize
• AdaBoost (Freund & Schapire, 1996)
• Gradient Boosted Trees (Friedman, 1999)
• Feature-based Partitioned Trees (Rokach,
2008)
• Stochastic gradient boosted distributed
decision trees (Ye et al., 2009)
• Parallel Boosted Regression Trees (Tyree
et al., 2011)
Non-Iterative Methods
• Mainly reduce variance error
• Embarrassingly parallel
• Random decision forests (Ho, 1995)
• Bagging (Bootstrap aggregating) (Breiman,
1996)
• Random Subspace Decision Forest (Ho,
1998)
• Randomized Tree (Dietterich, 2000)
• Random Forest (Breiman, 2001)
• Switching Classes (Martínez-Muñoz and
Suárez, 2005)
• Rotation Forest (Rodríguez et al., 2006)
• Extremely Randomized Trees (Geurts et al.,
2006)
• Randomly Projected Trees (Schclar and
Rokach, 2009)
Random decision forests [74]-1995-
-1996-
-1997-
-1998-
-1999-
-2000-
-2001-
-2002-
-2003-
-2004-
-2005-
-2006-
AdaBoost [33]
Bagging [72]
Random Subspace [99]
Random Forest [73]
Extremely Randomized Trees [2]
Rotation Forest [99]
Gradient Boosted Trees [84]
Iterative Methods Non-Iterative Methods
Random Forests
(Breiman, 2001)
1. A bootstrap random sample of size n sampled from
training set with replacement
2. Evaluate a node split on a random subset of variables
3. No pruning.
Limited Representation
19
Rotation Forest
(Rodríguez et al., 2006)
AdaBoost
(Freund & Schapire, 1996)
training cases correctly
classified
training case
has large weight
in this round
this DT has
a strong vote.
boosting rounds
“Best off-the-shelf classifier in the world” – Breiman (1996)
Training Errors vs Test Errors
Performance on ‘letter’ dataset
(Schapire et al. 1997)
Training
error
Test
error
Training error drops to 0 on round 5
Test error continues to drop after round 5
(from 8.4% to 3.1%)
Decision Forest Thinning:
Making the Forest Smaller
• Too thick decision forest results in:
– Large storage requirements
– Reduced compressibility
– Prolonged prediction time
– Reduced predictive performance
Forest thinning
• A post-processing step that aims to identify a
subset of decision trees that performs at least
as good as the original forest and discard any
other trees as redundant members.
• Collective-agreement-based Thinning
(Rokach, 2009):
Using best first search strategy and the above merit measure improves the accuracy of
the original forest by 2% on average while using only circa 3% of its trees (results based
on 30 different datasets)
Accumulated votes: 2154321
Instance-based (dynamical) Forest thinning
(Rokach, 2013)
New Instance: x
1
t 
1 1 2 1 2 1
T=7 classifiers
0 0 Final class: 1
Do we really need to query all classifiers in the ensemble?
NO
t2t1
Back To a Single Tree
Genuine
Training
Set
Artificially expanded
Training Set
The problem: The resulted forest
is far from being compact.
Decision Forest for Mitigating
Learning Challenges
• Class imbalance
• Concept Drift
• Curse of dimensionality
• Multi-label classification
Beyond Classification Tasks
• Regression tree (Breiman et al., 1984)
• Survival tree (Bou-Hamad et al., 2011)
• Clustering tree (Blockeel et al., 1998)
• Recommendation tree (Gershman et al., 2010):
• Markov model tree (Antwarg et al., 2012)
• ….
Summary
• “Two heads are better than none. One
hundred heads are so much better than
one”
– Dearg Doom, The Tain, Horslips, 1973
• “Great minds think alike, clever minds
think together” Lior Zoref, 2011.
• But they must be different, specialized
• And it might be an idea to select only the
best of them for the problem at hand

Contenu connexe

En vedette

Pythonで機械学習を自動化 auto sklearn
Pythonで機械学習を自動化 auto sklearnPythonで機械学習を自動化 auto sklearn
Pythonで機械学習を自動化 auto sklearnYukino Ikegami
 
Building a Large Scale SEO/SEM Application with Apache Solr
Building a Large Scale SEO/SEM Application with Apache SolrBuilding a Large Scale SEO/SEM Application with Apache Solr
Building a Large Scale SEO/SEM Application with Apache SolrRahul Jain
 
Case study of Rujhaan.com (A social news app )
Case study of Rujhaan.com (A social news app )Case study of Rujhaan.com (A social news app )
Case study of Rujhaan.com (A social news app )Rahul Jain
 
Augmented reality in future cars
Augmented reality in future carsAugmented reality in future cars
Augmented reality in future carsPrathamesh Barah
 
Machine Learning and Data Mining: 16 Classifiers Ensembles
Machine Learning and Data Mining: 16 Classifiers EnsemblesMachine Learning and Data Mining: 16 Classifiers Ensembles
Machine Learning and Data Mining: 16 Classifiers EnsemblesPier Luca Lanzi
 
Lecture 6: Ensemble Methods
Lecture 6: Ensemble Methods Lecture 6: Ensemble Methods
Lecture 6: Ensemble Methods Marina Santini
 
Decision Tree Ensembles - Bagging, Random Forest & Gradient Boosting Machines
Decision Tree Ensembles - Bagging, Random Forest & Gradient Boosting MachinesDecision Tree Ensembles - Bagging, Random Forest & Gradient Boosting Machines
Decision Tree Ensembles - Bagging, Random Forest & Gradient Boosting MachinesDeepak George
 
Automatic Machine Learning, AutoML
Automatic Machine Learning, AutoMLAutomatic Machine Learning, AutoML
Automatic Machine Learning, AutoMLHimadri Mishra
 
アンサンブル学習
アンサンブル学習アンサンブル学習
アンサンブル学習Hidekazu Tanaka
 
Emerging technologies /frameworks in Big Data
Emerging technologies /frameworks in Big DataEmerging technologies /frameworks in Big Data
Emerging technologies /frameworks in Big DataRahul Jain
 
Heads Up Display : A smart navigation system
Heads Up Display : A smart navigation systemHeads Up Display : A smart navigation system
Heads Up Display : A smart navigation systemMphasis
 
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesIntroduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesRahul Jain
 
Introduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneIntroduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneRahul Jain
 
Understanding Random Forests: From Theory to Practice
Understanding Random Forests: From Theory to PracticeUnderstanding Random Forests: From Theory to Practice
Understanding Random Forests: From Theory to PracticeGilles Louppe
 
Introduction to Scala
Introduction to ScalaIntroduction to Scala
Introduction to ScalaRahul Jain
 

En vedette (18)

HUD 232 Lean Financing: A Primer
HUD 232 Lean Financing: A PrimerHUD 232 Lean Financing: A Primer
HUD 232 Lean Financing: A Primer
 
Pythonで機械学習を自動化 auto sklearn
Pythonで機械学習を自動化 auto sklearnPythonで機械学習を自動化 auto sklearn
Pythonで機械学習を自動化 auto sklearn
 
Windshields of the future
Windshields of the futureWindshields of the future
Windshields of the future
 
Building a Large Scale SEO/SEM Application with Apache Solr
Building a Large Scale SEO/SEM Application with Apache SolrBuilding a Large Scale SEO/SEM Application with Apache Solr
Building a Large Scale SEO/SEM Application with Apache Solr
 
Case study of Rujhaan.com (A social news app )
Case study of Rujhaan.com (A social news app )Case study of Rujhaan.com (A social news app )
Case study of Rujhaan.com (A social news app )
 
Augmented reality in future cars
Augmented reality in future carsAugmented reality in future cars
Augmented reality in future cars
 
Machine Learning and Data Mining: 16 Classifiers Ensembles
Machine Learning and Data Mining: 16 Classifiers EnsemblesMachine Learning and Data Mining: 16 Classifiers Ensembles
Machine Learning and Data Mining: 16 Classifiers Ensembles
 
Lecture 6: Ensemble Methods
Lecture 6: Ensemble Methods Lecture 6: Ensemble Methods
Lecture 6: Ensemble Methods
 
Decision Tree Ensembles - Bagging, Random Forest & Gradient Boosting Machines
Decision Tree Ensembles - Bagging, Random Forest & Gradient Boosting MachinesDecision Tree Ensembles - Bagging, Random Forest & Gradient Boosting Machines
Decision Tree Ensembles - Bagging, Random Forest & Gradient Boosting Machines
 
Automatic Machine Learning, AutoML
Automatic Machine Learning, AutoMLAutomatic Machine Learning, AutoML
Automatic Machine Learning, AutoML
 
アンサンブル学習
アンサンブル学習アンサンブル学習
アンサンブル学習
 
Emerging technologies /frameworks in Big Data
Emerging technologies /frameworks in Big DataEmerging technologies /frameworks in Big Data
Emerging technologies /frameworks in Big Data
 
Heads Up Display : A smart navigation system
Heads Up Display : A smart navigation systemHeads Up Display : A smart navigation system
Heads Up Display : A smart navigation system
 
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesIntroduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and Usecases
 
Introduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneIntroduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of Lucene
 
Understanding Random Forests: From Theory to Practice
Understanding Random Forests: From Theory to PracticeUnderstanding Random Forests: From Theory to Practice
Understanding Random Forests: From Theory to Practice
 
Introduction to Scala
Introduction to ScalaIntroduction to Scala
Introduction to Scala
 
Apache kafka
Apache kafkaApache kafka
Apache kafka
 

Similaire à Decision Forest: Twenty Years of Research

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningAI Summary
 
Causal Random Forest
Causal Random ForestCausal Random Forest
Causal Random ForestBong-Ho Lee
 
Franhouder july2013
Franhouder july2013Franhouder july2013
Franhouder july2013CS, NcState
 
Engineering design of an environmental management system: A trans-disciplinar...
Engineering design of an environmental management system: A trans-disciplinar...Engineering design of an environmental management system: A trans-disciplinar...
Engineering design of an environmental management system: A trans-disciplinar...Henk (Jan) Roodt
 
Comparison Study of Decision Tree Ensembles for Regression
Comparison Study of Decision Tree Ensembles for RegressionComparison Study of Decision Tree Ensembles for Regression
Comparison Study of Decision Tree Ensembles for RegressionSeonho Park
 
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...Maninda Edirisooriya
 
Genetic algorithm raktim
Genetic algorithm raktimGenetic algorithm raktim
Genetic algorithm raktimRaktim Halder
 
Genetic algorithm_raktim_IITKGP
Genetic algorithm_raktim_IITKGP Genetic algorithm_raktim_IITKGP
Genetic algorithm_raktim_IITKGP Raktim Halder
 
math bio for 1st year math students
math bio for 1st year math studentsmath bio for 1st year math students
math bio for 1st year math studentsBen Bolker
 
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...Simplilearn
 
From decision trees to random forests
From decision trees to random forestsFrom decision trees to random forests
From decision trees to random forestsViet-Trung TRAN
 
Decision Tree.pptx
Decision Tree.pptxDecision Tree.pptx
Decision Tree.pptxkibriaswe
 

Similaire à Decision Forest: Twenty Years of Research (20)

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Causal Random Forest
Causal Random ForestCausal Random Forest
Causal Random Forest
 
Decision tree
Decision treeDecision tree
Decision tree
 
DT.pptx
DT.pptxDT.pptx
DT.pptx
 
Franhouder july2013
Franhouder july2013Franhouder july2013
Franhouder july2013
 
Engineering design of an environmental management system: A trans-disciplinar...
Engineering design of an environmental management system: A trans-disciplinar...Engineering design of an environmental management system: A trans-disciplinar...
Engineering design of an environmental management system: A trans-disciplinar...
 
Comparison Study of Decision Tree Ensembles for Regression
Comparison Study of Decision Tree Ensembles for RegressionComparison Study of Decision Tree Ensembles for Regression
Comparison Study of Decision Tree Ensembles for Regression
 
Judgment and decision making
Judgment and decision makingJudgment and decision making
Judgment and decision making
 
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
 
Genetic algorithm raktim
Genetic algorithm raktimGenetic algorithm raktim
Genetic algorithm raktim
 
Genetic algorithm_raktim_IITKGP
Genetic algorithm_raktim_IITKGP Genetic algorithm_raktim_IITKGP
Genetic algorithm_raktim_IITKGP
 
math bio for 1st year math students
math bio for 1st year math studentsmath bio for 1st year math students
math bio for 1st year math students
 
L4. Ensembles of Decision Trees
L4. Ensembles of Decision TreesL4. Ensembles of Decision Trees
L4. Ensembles of Decision Trees
 
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
 
From decision trees to random forests
From decision trees to random forestsFrom decision trees to random forests
From decision trees to random forests
 
CI_L02_Optimization_ag2_eng.pdf
CI_L02_Optimization_ag2_eng.pdfCI_L02_Optimization_ag2_eng.pdf
CI_L02_Optimization_ag2_eng.pdf
 
Gigerenzer
GigerenzerGigerenzer
Gigerenzer
 
Random Forests Lightning Talk
Random Forests Lightning TalkRandom Forests Lightning Talk
Random Forests Lightning Talk
 
Decision Tree.pptx
Decision Tree.pptxDecision Tree.pptx
Decision Tree.pptx
 
Genetic algorithm
Genetic algorithmGenetic algorithm
Genetic algorithm
 

Dernier

Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 

Dernier (20)

Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 

Decision Forest: Twenty Years of Research

  • 1. Decision Forest After Twenty Years Lior Rokach Dept. of Information Systems Engineering
  • 2. Do we need hundreds of classifiers to solve real world classification problems? (Fernández-Delgado et al., 2014) Empirically comparing 179 classification algorithms over 121 datasets “The classifier most likely to be the best is random forest (achieves 94.1% of the maximum accuracy overcoming 90% in the 84.3% of the data sets)”
  • 3. Accumulated votes: 2154321 Classification by majority voting New Instance: x 1 t  1 1 2 1 2 1 T=7 classifiers 0 0 Final class: 1 t2t1 3 Obtained from Alberto Suárez, 2012
  • 4. The Condorcet’s Jury Theorem (Marquis of Condorcet,1784) • The most basic jury theorem in social choice • N = the number of jurors • p = the probability of an individual juror being right • µ= the probability that a jury gives the correct answer • p > 0.5 implies µ > p. • and µ  1 when N∞. p = 0.6 µ
  • 5. The Wisdom of Crowds • Francis Galton promoted statistics and invented the concept of correlation. • In 1906 Galton visited a livestock fair and stumbled upon an intriguing contest. • An ox was on display, and the villagers were invited to guess the animal's weight. • Nearly 800 gave it a go and, not surprisingly, not one hit the exact mark: 1,198 pounds. • Astonishingly, however, the average of those 800 guesses came close - very close indeed. It was 1,197 pounds.
  • 6. Key Criteria for Crowd to be Wise • Diversity of opinion – Each person should have private information even if it's just an eccentric interpretation of the known facts. • Independence – People's opinions aren't determined by the opinions of those around them. • Decentralization – People are able to specialize and draw on local knowledge. • Aggregation – Some mechanism exists for turning private judgments into a collective decision.
  • 7. The Diversity Tradeoff of individual trees
  • 8. There’s no Real Tradeoff… • Ideally, all trees would be right about everything! • If not, they should be wrong about different cases.
  • 9. Top Down Induction of Decision Trees New Recipients EmailLength Email Len <1.8 ≥1.8 HamSpam 1 Error 8 Errors
  • 10. Top Down Induction of Decision Trees New Recipients EmailLength Email Len <1.8 ≥1.8 Spam 1 Error Email Len <4 ≥4 Spam 1 Error Ham 3 Errors
  • 11. Top Down Induction of Decision Trees New Recipients EmailLength Email Len <1.8 ≥1.8 Spam 1 Error Email Len <4 ≥4 Spam 1 Error New Recip <1 ≥1 Ham 1 Error Spam 0 Errors
  • 12. Which One?Top Down Induction of Decision Trees
  • 13. Why Does Decision Forest Work? • Local minima • Lack of sufficient data • Limited Representation
  • 14. Bias – The tendency to consistently learn the same wrong thing because the hypothesis space considered by the learning algorithm does not include sufficient hypotheses Variance – The tendency to learn random things irrespective of the real signal due to the particular training set used Bias and Variance Decomposition Tree Size
  • 15. It all started about two years ago … Iterative Methods • Reduce both Bias and Variance errors • Hard to parallelize • AdaBoost (Freund & Schapire, 1996) • Gradient Boosted Trees (Friedman, 1999) • Feature-based Partitioned Trees (Rokach, 2008) • Stochastic gradient boosted distributed decision trees (Ye et al., 2009) • Parallel Boosted Regression Trees (Tyree et al., 2011) Non-Iterative Methods • Mainly reduce variance error • Embarrassingly parallel • Random decision forests (Ho, 1995) • Bagging (Bootstrap aggregating) (Breiman, 1996) • Random Subspace Decision Forest (Ho, 1998) • Randomized Tree (Dietterich, 2000) • Random Forest (Breiman, 2001) • Switching Classes (Martínez-Muñoz and Suárez, 2005) • Rotation Forest (Rodríguez et al., 2006) • Extremely Randomized Trees (Geurts et al., 2006) • Randomly Projected Trees (Schclar and Rokach, 2009)
  • 16. Random decision forests [74]-1995- -1996- -1997- -1998- -1999- -2000- -2001- -2002- -2003- -2004- -2005- -2006- AdaBoost [33] Bagging [72] Random Subspace [99] Random Forest [73] Extremely Randomized Trees [2] Rotation Forest [99] Gradient Boosted Trees [84] Iterative Methods Non-Iterative Methods
  • 17. Random Forests (Breiman, 2001) 1. A bootstrap random sample of size n sampled from training set with replacement 2. Evaluate a node split on a random subset of variables 3. No pruning.
  • 20. AdaBoost (Freund & Schapire, 1996) training cases correctly classified training case has large weight in this round this DT has a strong vote. boosting rounds “Best off-the-shelf classifier in the world” – Breiman (1996)
  • 21. Training Errors vs Test Errors Performance on ‘letter’ dataset (Schapire et al. 1997) Training error Test error Training error drops to 0 on round 5 Test error continues to drop after round 5 (from 8.4% to 3.1%)
  • 22. Decision Forest Thinning: Making the Forest Smaller • Too thick decision forest results in: – Large storage requirements – Reduced compressibility – Prolonged prediction time – Reduced predictive performance
  • 23. Forest thinning • A post-processing step that aims to identify a subset of decision trees that performs at least as good as the original forest and discard any other trees as redundant members. • Collective-agreement-based Thinning (Rokach, 2009): Using best first search strategy and the above merit measure improves the accuracy of the original forest by 2% on average while using only circa 3% of its trees (results based on 30 different datasets)
  • 24. Accumulated votes: 2154321 Instance-based (dynamical) Forest thinning (Rokach, 2013) New Instance: x 1 t  1 1 2 1 2 1 T=7 classifiers 0 0 Final class: 1 Do we really need to query all classifiers in the ensemble? NO t2t1
  • 25. Back To a Single Tree Genuine Training Set Artificially expanded Training Set The problem: The resulted forest is far from being compact.
  • 26. Decision Forest for Mitigating Learning Challenges • Class imbalance • Concept Drift • Curse of dimensionality • Multi-label classification
  • 27. Beyond Classification Tasks • Regression tree (Breiman et al., 1984) • Survival tree (Bou-Hamad et al., 2011) • Clustering tree (Blockeel et al., 1998) • Recommendation tree (Gershman et al., 2010): • Markov model tree (Antwarg et al., 2012) • ….
  • 28. Summary • “Two heads are better than none. One hundred heads are so much better than one” – Dearg Doom, The Tain, Horslips, 1973 • “Great minds think alike, clever minds think together” Lior Zoref, 2011. • But they must be different, specialized • And it might be an idea to select only the best of them for the problem at hand