SlideShare une entreprise Scribd logo
1  sur  108
Télécharger pour lire hors ligne
Gene tree-species tree methods in 
RevBayes 
Bastien Boussau 
LBBE, CNRS, Université de Lyon
What I spend my time doing 
• Developing methods for: 
• molecular evolution 
• phylogenetic reconstruction 
• genomics, genome scale phylogenetic 
reconstruction 
• Using these methods to investigate: 
• Genome evolution (gene family evolution, drift) 
• Evolution of life on Earth (species trees, ancestral 
trait reconstruction)
Why the tautological title? 
“Phylogenomic” has been used to describe attempts at 
reconstructing species trees based on 10-100 genes 
We are interested in the species tree, but also in 
genome evolution 
A genome can contains >20,000 genes 
Genome-scale: thousands of genes, dozens of species
Genomes, Genes, gene families, 
gene trees and species trees 
Genome
Genomes, Genes, gene families, 
gene trees and species trees 
Genome 
Gene Homo sapiens; GeneA: ACTGGTGATGACATGAC…
Genomes, Genes, gene families, 
gene trees and species trees 
Genome 
Gene Homo sapiens; GeneA: ACTGGTGATGACATGAC… 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
Gene family 
alignment
Genomes, Genes, gene families, 
gene trees and species trees 
Genome 
Gene Homo sapiens; GeneA: ACTGGTGATGACATGAC… 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
Gene family 
alignment 
Gene tree
Genomes, Genes, gene families, 
gene trees and species trees 
Genome 
Gene Homo sapiens; GeneA: ACTGGTGATGACATGAC… 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
Gene family 
alignment 
Gene tree 
Species trees
A graphical model 
for phylogenomics 
Species tree 
Gene tree 
Gene family alignment 
Gene 
Genome sequences 
Raw sequences
A graphical model 
for phylogenomics 
Species tree 
Species tree construction 
Gene tree 
Gene tree construction 
Gene family alignment 
Homology prediction 
Alignment 
Gene 
Gene prediction 
Genome sequences 
Error correction, base 
calling, assembly 
Raw sequences
A graphical model 
for phylogenomics 
Species tree 
Species tree construction 
Gene tree 
Gene tree construction 
Gene family alignment 
Homology prediction 
Alignment 
Gene 
Gene prediction 
Genome sequences 
Error correction, base 
calling, assembly 
Raw sequences
The usual approach to 
reconstructing phylogenetic trees 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus;
The usual approach to 
reconstructing phylogenetic trees 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
• Parsimony 
• Model-based approaches: e.g. Felsenstein pruning 
algorithm (1981) to compute P(alignment|Gene tree)
The usual approach to 
reconstructing phylogenetic trees 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
• Parsimony 
• Model-based approaches: e.g. Felsenstein pruning 
algorithm (1981) to compute P(alignment|Gene tree) 
No species tree object in the usual 
approach
The gene tree graphical model 
Gene tree 
Gene alignment 
PhyloCTMC 
Felsenstein pruning algorithm (1981)
Gene trees inferred from 
sequences alone 
Homolens database
Gene trees inferred from 
sequences alone 
Homolens database 
Complex, messy, not to be trusted
The species tree-gene tree approach 
to reconstructing phylogenetic trees 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus;
The species tree-gene tree approach 
to reconstructing phylogenetic trees 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
P(alignment|Gene tree)
The species tree-gene tree approach 
to reconstructing phylogenetic trees 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
P(alignment|Gene tree) P(Gene tree|Species tree)
The species tree-gene tree approach 
to reconstructing phylogenetic trees 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
P(alignment|Gene tree) P(Gene tree|Species tree) 
P(alignment,Gene tree|Species tree)=P(alignment|Gene tree)xP(Gene tree|Species tree) 
!
The species tree-gene tree approach 
to reconstructing phylogenetic trees 
GeneA: ACTGGTGATGACATAAC…! 
GeneB: ACTGTTGATGACATGAC…! 
GeneC: ACTGATGATGACAAGAC…! 
GeneD: ACTGGTGA--CCATGAC…! 
GeneE: ACTGGTGATGACACGAC…! 
GeneF: ACT--TCATGAAACGAC…! 
Homo sapiens; 
Homo sapiens; 
Mus musculus; 
Mus musculus; 
Bison bison; 
Canis lupus; 
P(alignment|Gene tree) P(Gene tree|Species tree) 
Felsenstein pruning algorithm (1981) Many models and algorithms 
P(alignment,Gene tree|Species tree)=P(alignment|Gene tree)xP(Gene tree|Species tree) 
!
The species tree-gene tree 
graphical model 
Species tree 
Gene tree 
Gene alignment 
Many models and algorithms 
Felsenstein pruning algorithm (1981)
11 
The species tree-gene tree 
graphical model 
Hoehna et al., Syst Biol 2014
point mutation 
AATAC⬅ATTAC 
Substituion 
point mutation 
AATAG⬅AATAC 
12 
A hierarchy of processes 
population of 
individuals 
species diversification gene birth and death 
a) b) c) 
enlarged below enlarged below 
gene 
duplication gene 
gene 
transfer 
loss 
extinct 
linage 
species 
extinctions 
speciations 
X 
X 
X 
X 
X 
present extant species 
sampled genomes 
sequence substitution 
X ATTAC 
X 
X 
X 
X 
X 
speciation 
AATAG AATAC 
ATTAC 
speciation 
(species tree) (locus tree) (gene tree) 
time 
population of 
species 
X 
X 
X 
X 
X 
speciation 
speciation 
genes in genomes nucleotides in genes 
Szöllősi et al., Syst Biol 2014
Gene tree and species tree are linked 
Boussau and Daubin, Tree 2010
Gene trees in species trees 
Maddison, Syst. Biol. 1997
Gene trees in species trees 
Maddison, Syst. Biol. 1997 
Challenges 
Given gene trees: reconstruct species tree 
Given species tree and alignment: reconstruct gene 
tree 
Given alignments: reconstruct genes and species trees
The landscape of gene tree-species tree methods 
POMO 
(De Maio et al., 2013) 
Szöllősi et al., Syst Biol 2014
The landscape of gene tree-species tree methods 
POMO 
(De Maio et al., 2013) 
Szöllősi et al., Syst Biol 2014 
RevBayes
D
D DL
D DL LGT
D DL LGT ILS
PHYLDOG 
D DL LGT ILS
PHYLDOG 
D DL LGT ILS 
LGT: 
exODT
PHYLDOG 
D DL LGT ILS 
LGT: 
exODT 
ILS: 
MSC
Plan 
1. Modeling the relationship between species tree 
and gene tree 
– coalescent models 
– models of gene duplication and loss 
– models of gene transfer 
– models that combine the above
D DL LGT ILS
Incomplete lineage sorting 
Degnan and Rosenberg, Syst. Biol. 2006 
Maddison, Syst. Biol. 1997
Reconstructing species trees given ILS 
•Parsimony method: minimizing deep coalescences 
(Maddison 1997) 
•Distance method: NJst (Liu and Yu 2011) 
•Summary statistic methods: STAR (Liu et al, 2009), 
GLASS (Mossel and Roch 2010), iGLASS (Jewett and 
Rosenberg 2012), ASTRAL (Mirarab et al., 2014) 
•Maximum Likelihood methods: STEM (Kubatko et al. 
2009), MP-EST (Liu et al., 2010) 
•Bayesian methods: BeST (2007), *BEAST (Heled and 
Drummond, 2009) 
•Bypassing the gene trees with math tricks: SNAPP 
(Bryant et al., 2012), POMO (DeMaio et al., 2013), 
SVDQuartets (Chifman and Kubatko, 2014)
Likelihood of a gene tree given a species tree 
under the multi-species coalescent 
Rannala and Yang, Genetics 2003
Likelihood of a gene tree given a species tree 
under the multi-species coalescent 
Rannala and Yang, Genetics 2003
The species tree-gene tree 
graphical model with ILS 
Species tree topology 
Gene tree 
Gene alignment 
Ne 
T r
The POMO 
model 
De Maio et al., MBE 2013
The POMO 
model 
De Maio et al., MBE 2013
The POMO model 
De Maio et al., MBE 2013
Pros and cons of the Pomo model 
1. Pros: 
1. bypasses gene tree estimation (easy to combine 
with DL or DTL models…) 
2. does not assume linkage between sites 
2. Cons: 
1. You don’t get gene trees 
2. It’s not sure what branch lengths mean anymore 
3. matrix can become big 
4. Only bimorphic sites are considered: 3- or 4- 
morphic sites need to be handled differently 
5. Not sure how it behaves in practice
Plan 
1. Modeling the relationship between species tree 
and gene tree 
– coalescent models 
– models of gene duplication and loss 
– models of gene transfer 
– models that combine the above
D DL LGT ILS
DL models 
•Reconciliation between a gene tree and a species tree: 
•Parsimony method: minimizing numbers of duplications 
(Goodman et al. 1979; Page 1994; Zmasek and Eddy 2001) 
•Model-based method: Arvestad et al. 2003, 2004, 2009; Görecki 
et al. 2011 
•Reconstruction of a gene tree given a species tree and 
alignment: 
•Parsimony method: Chen et al. 2000; Durand et al. 2006 
•Model-based method: Rasmussen and Kellis 2007, 2011; 
Akerborg et al. 2009; Sjöstrand et al. 2012 
•Reconstruction of a species tree given gene trees: 
•Parsimony methods: Page and Charleston 1997, Bansal et al. 
2007; Wehe et al. 2008; Bansal et al. 2010; Chang et al. 2011 
•Joint reconstruction of gene and species trees given alignments: 
•Model-based method: Boussau et al., 2013
Many possible reconciliations 
Sennblad and Lagergren, Syst. Biol. 2009
Many possible reconciliations 
Need to integrate over all 
possible reconciliations 
Sennblad and Lagergren, Syst. Biol. 2009
Considering all mappings: 
The species tree discretization approximation 
Tofigh, PhD thesis
Considering all mappings: 
The species tree discretization approximation 
Tofigh, PhD thesis 
Mapping between a gene tree 
and species tree 
⬄ 
Mapping between nodes of 
the gene tree and nodes of 
the augmented species tree
Considering all mappings: 
The species tree discretization approximation 
Tofigh, PhD thesis 
Mapping between a gene tree 
and species tree 
⬄ 
Mapping between nodes of 
the gene tree and nodes of 
the augmented species tree 
Integrating over all mappings: 
done through dynamic 
programming from the leaves 
of the gene tree to its root
Birth-death models for DL 
M o d e l - b a s e d 
approaches: 
often use a birth-death 
process (Arvestad et al., 
2003, 2004, 2009; Akerborg et 
al., 2009; Sjöstrand et al., 2012; 
Boussau et al., 2013) 
Birth=Duplication: λ 
Death=Loss: μ 
Akerborg et al., PNAS 2009
The Akerborg et al. 2009 model 
• Input: Species tree with time, rooted gene tree, λ 
and μ assumed to be known 
• Output: reconciliation between species tree and 
gene tree 
• Relaxed-clock model 
• Gene trees generated according to a birth-death 
model
The Akerborg et al. 2009 model 
• Input: Species tree with time, rooted gene tree, λ 
and μ assumed to be known 
• Output: reconciliation between species tree and 
gene tree 
• Relaxed-clock model 
• Gene trees generated according to a birth-death 
model 
Felsenstein 1981 Gamma prior Dynamic 
programming 
to integrate 
over all 
mappings with 
BD model
The species tree-gene tree graphical model with DL 
(Akerborg 2009) 
Sp tree 
Topology 
T r 
Gene tree 
Gene alignment 
μ 
λ
The Akerborg et al. 2009 model 
vs PHYLDOG 
• Akerborg et al. 2009: 
• Generative model 
• Integrates over a large space of mappings 
• Integrates over all possible scenarios of 
duplications and losses 
PHYLDOG 
• 
use unrooted, non-clock gene trees 
• Use the most parsimonious mapping 
• Use a double-recursive tree traversal 
• Integrates over a large subset of possible scenarios of 
duplications and losses (Boussau et al., Genome 
Research 2013)
Plan 
1. Modeling the relationship between species tree 
and gene tree 
– coalescent models 
– models of gene duplication and loss 
– models of gene transfer 
– models that combine the above
D DL LGT ILS
Pure transfer models 
•Identification of transfers (not probabilistic): 
•Nakleh et al., 2005; Beiko and Hamilton 2006; Abby et 
al., 2010; Hill et al. 2010 
•Joint reconstruction of gene and species trees given 
alignments: 
•Model-based method: Suchard, Genetics 2005
Suchard “random walk through tree space” model 
•Input: gene alignments 
•Output: species tree, gene trees 
•Each gene tree is a Poisson number of transfers away 
from the species tree 
•Random walk through gene trees 
•C classes of genes, with different propensities to be 
transferred 
•Has been used with at most 6 species, 144 orthologous 
gene alignments, but could be used with 8 species or even 
more Suchard, Genetics 2005
The species tree-gene tree graphical 
model with transfers (Suchard 2005) 
Species tree 
Topology 
Gene tree 
Gene alignment 
Gene category 
bl 
Expected number of 
transfers
Plan 
1. Modeling the relationship between species tree 
and gene tree 
– coalescent models 
– models of gene duplication and loss 
– models of gene transfer 
– models that combine the above
Model that combines DL and ILS 
•Reconciliation between gene tree and species tree: 
•Model-based method: Rasmussen and Kellis, Genome 
Research 2012 
! 
•Input: dated and rooted species tree, DL rates, Ne 
•Output: Locus tree and DL+ILS scenario
D DL LGT ILS
The Rasmussen and Kellis model (2012) 
Species tree 
Gene tree 
Gene alignment
The Rasmussen and Kellis model (2012) 
Species tree 
Gene tree 
Gene alignment 
Species tree 
Locus tree 
Gene tree 
Gene alignment
The Rasmussen and Kellis model (2012) 
Species tree 
Gene tree 
Gene alignment 
Species tree 
DL model 
Locus tree 
Coalescent model 
Gene tree 
Gene alignment
The Rasmussen and Kellis model (2012) 
Species tree 
DL model 
Locus tree 
Coalescent model 
Gene tree 
Gene alignment
The Rasmussen and Kellis model (2012) 
Species tree 
DL model 
Locus tree 
Coalescent model 
Gene tree 
Gene alignment 
In practice: 
Species tree 
DL model 
Locus tree 
Coalescent model 
Gene tree
The Rasmussen and Kellis model (2012)
The Rasmussen and Kellis model (2012)
The species tree-gene tree graphical model 
with DL + ILS (Rasmussen and Kellis 2012) 
Sp tree 
Topology 
Locus tree 
Gene tree 
μ 
r 
T 
λ 
Ne
D DL LGT ILS
Model that combines DL and T 
•Reconciliation between a species tree and a gene tree: 
Parsimony setting: Gorecki, RECOMB 2004; Gorbunov 
and Lyubetski Mol. Biol. (Mosk), 2009; Libeskind-Hadas 
and Charleston, JCB 2009; Tofigh et al., IEEE ACM 2011; 
Doyon et al., Comparative Genomics 2011 
! 
•Reconstruction of a species tree given gene trees: 
•Model-based method: Szöllősi et al., PNAS 2012; 
Szöllősi et al., Systematic Biology 2013a, 2013b; 
Sjöstrand et al., Syst Biol 2014
Dating the tree of life is difficult
Dating the tree of life is difficult 
100 
100 100
Dating the tree of life is difficult 
100 
10 mya 
100 100
Dating the tree of life is difficult 
100 
10 mya 
100 100 
20 mya
Dating the tree of life is difficult 
100 
10 mya 
100 100 
20 mya 
Very few fossils for the first 3 billion years of 
evolution on Earth 
Other constraints are needed to date the tree of 
life
Using transfers to date clades
Using transfers to date clades
Using transfers to date clades
Using transfers to date clades
Gene transfers provide means to root 
Szöllősi et al., PNAS 2012 
and date species trees
Gene transfers provide means to root 
Szöllősi et al., PNAS 2012 
and date species trees
Avoiding back to the future scenarios 
Doyon et al., Comparative Genomics 2011
Avoiding back to the future scenarios 
➔ Algorithm in O (#NodesInS’ . #NodesInG ) 
Doyon et al., Comparative Genomics 2011
6 possible events are enough 
Doyon et al., Comparative Genomics 2011
Dynamic programming in the Doyon et 
al. 2011 approach 
•Input: rooted augmented species tree S’, rooted gene 
tree G 
•Output: reconciliation scenario between S’ and G 
•Finding scenarios can be done in polynomial time using 
DP: 
•Fill the cost matrix c (#NodesInG, #NodesInS’) 
•Post-order traversal of the gene tree 
•For each node i of G: 
•Map it to each node j of S’ 
•Consider all of 6 events, and set c(i, 
j)=min(cost(possible events)) 
•At the root, return the minimum cost found
From parsimony to model-based 
•Input: gene trees 
•Output: reconciled rooted gene trees, rooted species tree 
with ordered nodes 
•Birth-death process to model gene evolution: 
•Birth parameters: 
•D 
•T: rate of receiving a gene through transfer 
•Death parameter: 
•L 
•One or more sets of D, T, L parameters 
•Double recursive DP for computing P(gene tree|species 
tree) 
•Integrating over all possible scenarios 
•Sliced species tree 
Szöllősi et al., PNAS 2012
Use of the 6 
events from 
Doyon et al. 
2011 
Szöllősi et al., PNAS 2012
Within-slice discretization for solving 
ordinary differential equations 
Szöllősi et al., PNAS 2012
point mutation 
AATAC⬅ATTAC 
Substituion 
point mutation 
AATAG⬅AATAC 
59 
A hierarchy of processes 
population of 
individuals 
species diversification gene birth and death 
a) b) c) 
enlarged below enlarged below 
gene 
duplication gene 
gene 
transfer 
loss 
extinct 
linage 
species 
extinctions 
speciations 
X 
X 
X 
X 
X 
present extant species 
sampled genomes 
sequence substitution 
X ATTAC 
X 
X 
X 
X 
X 
speciation 
AATAG AATAC 
ATTAC 
speciation 
(species tree) (locus tree) (gene tree) 
time 
population of 
species 
X 
X 
X 
X 
X 
speciation 
speciation 
genes in genomes nucleotides in genes 
Szöllősi et al., Syst Biol 2014
Further refinement: 
seeing dead lineages 
Szöllősi et al., Syst. Biol 2013
Further refinement: 
seeing dead lineages 
Szöllősi et al., Syst. Biol 2013
The exODT graphical model 
N σ 
Species tree 
Topology 
Gene tree 
D 
Population of 
species 
T L
Plan 
Modeling the relationship between species tree and gene tree 
– coalescent models 
– models of gene duplication and loss 
– models of gene transfer 
– models that combine the above 
Gene tree species tree models in RevBayes, and tutorial
Gene tree species tree models in 
RevBayes 
•Multispecies coalescent 
•Pomo model 
•In progress: 
•exODT (DTL model) 
•Soon: 
•Akerborg et al. 2009 (DL model) 
•MP-EST
The tutorial 
•A walk around the multispecies coalescent: 
1. Simulation with the multispecies coalescent (play 
with the parameters!) 
2. Then: 
• Inference with the multispecies coalescent from 
gene trees 
• Inference with the multispecies coalescent from 
gene alignments 
• Inference with the POMO model from gene 
alignments 
• Inference by concatenating the gene alignments
The tutorial 
Files are in: 
tutorials/RB_GeneTreeSpeciesTree_Tutorial/RB_tutorial_files/RevBayes_scripts/ 
in five folders. It is highly recommended to start RevBayes from each 
of these folders when doing the exercises to avoid path problems. 
There is one simulation-only folder, used for better understanding the 
multispecies coalescent: 
UnderstandingMultiSpeciesCoalescent 
Then there are 4 inference folders. 
In each inference folder, there are 3 scripts. The file XXXMCMC.Rev is the 
one that can launch the entire analysis, which consists of three parts: 
Simulating the data 
Constructing the model for inference 
Preparing and running the MCMC

Contenu connexe

Tendances

Artificial chromosomes
Artificial chromosomesArtificial chromosomes
Artificial chromosomesDarshana Ajith
 
Machine Learning in Bioinformatics
Machine Learning in BioinformaticsMachine Learning in Bioinformatics
Machine Learning in BioinformaticsDmytro Fishman
 
Notes genetic drift
Notes genetic driftNotes genetic drift
Notes genetic driftstewart_j
 
ویرایش ژنوم Genome editing tools
ویرایش ژنوم Genome editing toolsویرایش ژنوم Genome editing tools
ویرایش ژنوم Genome editing toolsshahnam azizi
 
Linkage analysis
Linkage analysisLinkage analysis
Linkage analysisUshaYadav24
 
Protein structure
Protein structureProtein structure
Protein structurePooja Pawar
 
Molecular markr sscp
Molecular markr sscpMolecular markr sscp
Molecular markr sscpKAUSHAL SAHU
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data miningSangeeta Das
 
Gene mapping / Genetic map vs Physical Map | determination of map distance a...
Gene mapping / Genetic map vs Physical Map |  determination of map distance a...Gene mapping / Genetic map vs Physical Map |  determination of map distance a...
Gene mapping / Genetic map vs Physical Map | determination of map distance a...NARC, Islamabad
 

Tendances (20)

Synthetic Nanoscale Elements for Delivery of Material into Viable Cells
Synthetic Nanoscale Elements for Delivery of Material into Viable CellsSynthetic Nanoscale Elements for Delivery of Material into Viable Cells
Synthetic Nanoscale Elements for Delivery of Material into Viable Cells
 
Computational biology
Computational biologyComputational biology
Computational biology
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Artificial chromosomes
Artificial chromosomesArtificial chromosomes
Artificial chromosomes
 
Synthetic Genome
Synthetic Genome Synthetic Genome
Synthetic Genome
 
Machine Learning in Bioinformatics
Machine Learning in BioinformaticsMachine Learning in Bioinformatics
Machine Learning in Bioinformatics
 
Genome assembly
Genome assemblyGenome assembly
Genome assembly
 
Notes genetic drift
Notes genetic driftNotes genetic drift
Notes genetic drift
 
ویرایش ژنوم Genome editing tools
ویرایش ژنوم Genome editing toolsویرایش ژنوم Genome editing tools
ویرایش ژنوم Genome editing tools
 
Linkage analysis
Linkage analysisLinkage analysis
Linkage analysis
 
Designer babies
Designer babiesDesigner babies
Designer babies
 
Introduction to Biological databases
Introduction to Biological databasesIntroduction to Biological databases
Introduction to Biological databases
 
Tools in phylogeny
Tools in phylogeny Tools in phylogeny
Tools in phylogeny
 
Gene Silencing
Gene SilencingGene Silencing
Gene Silencing
 
Protein structure
Protein structureProtein structure
Protein structure
 
Molecular markr sscp
Molecular markr sscpMolecular markr sscp
Molecular markr sscp
 
Phylogenetic analysis
Phylogenetic analysisPhylogenetic analysis
Phylogenetic analysis
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Gene mapping / Genetic map vs Physical Map | determination of map distance a...
Gene mapping / Genetic map vs Physical Map |  determination of map distance a...Gene mapping / Genetic map vs Physical Map |  determination of map distance a...
Gene mapping / Genetic map vs Physical Map | determination of map distance a...
 

Similaire à Gene tree-species tree methods in RevBayes

Genomic inference of the evolution of life history traits
Genomic inference of the evolution of life history traitsGenomic inference of the evolution of life history traits
Genomic inference of the evolution of life history traitsboussau
 
Genome to pangenome : A doorway into crops genome exploration
Genome to pangenome : A doorway into crops genome explorationGenome to pangenome : A doorway into crops genome exploration
Genome to pangenome : A doorway into crops genome explorationKiranKm11
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENTMariya Raju
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatsidjena70
 
Comparitive genome mapping and model systems
Comparitive genome mapping and model systemsComparitive genome mapping and model systems
Comparitive genome mapping and model systemsHimanshi Chauhan
 
Iowa State Bioinformatics BCB Symposium 2018 - There and Back Again
Iowa State Bioinformatics BCB Symposium 2018 - There and Back AgainIowa State Bioinformatics BCB Symposium 2018 - There and Back Again
Iowa State Bioinformatics BCB Symposium 2018 - There and Back AgainAdina Chuang Howe
 
Genome assembly: then and now — v1.1
Genome assembly: then and now — v1.1Genome assembly: then and now — v1.1
Genome assembly: then and now — v1.1Keith Bradnam
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Leighton Pritchard
 
2014 whitney-research
2014 whitney-research2014 whitney-research
2014 whitney-researchc.titus.brown
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorialc.titus.brown
 
Detection of genomic homology in eukaryotic genomes
Detection of genomic homology in eukaryotic genomesDetection of genomic homology in eukaryotic genomes
Detection of genomic homology in eukaryotic genomesKlaas Vandepoele
 
http://lectures.gersteinlab.org/ppt/Gencode-winter08-20090121-pseudogenes/Gen...
http://lectures.gersteinlab.org/ppt/Gencode-winter08-20090121-pseudogenes/Gen...http://lectures.gersteinlab.org/ppt/Gencode-winter08-20090121-pseudogenes/Gen...
http://lectures.gersteinlab.org/ppt/Gencode-winter08-20090121-pseudogenes/Gen...Mark Gerstein
 
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...Torsten Seemann
 
Carleton Biology talk : March 2014
Carleton Biology talk : March 2014Carleton Biology talk : March 2014
Carleton Biology talk : March 2014Karen Cranston
 

Similaire à Gene tree-species tree methods in RevBayes (20)

Genomic inference of the evolution of life history traits
Genomic inference of the evolution of life history traitsGenomic inference of the evolution of life history traits
Genomic inference of the evolution of life history traits
 
Genome to pangenome : A doorway into crops genome exploration
Genome to pangenome : A doorway into crops genome explorationGenome to pangenome : A doorway into crops genome exploration
Genome to pangenome : A doorway into crops genome exploration
 
Molecular phylogenetics
Molecular phylogeneticsMolecular phylogenetics
Molecular phylogenetics
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 format
 
Hertweck uva2012
Hertweck uva2012Hertweck uva2012
Hertweck uva2012
 
Comparitive genome mapping and model systems
Comparitive genome mapping and model systemsComparitive genome mapping and model systems
Comparitive genome mapping and model systems
 
Iowa State Bioinformatics BCB Symposium 2018 - There and Back Again
Iowa State Bioinformatics BCB Symposium 2018 - There and Back AgainIowa State Bioinformatics BCB Symposium 2018 - There and Back Again
Iowa State Bioinformatics BCB Symposium 2018 - There and Back Again
 
Genome assembly: then and now — v1.1
Genome assembly: then and now — v1.1Genome assembly: then and now — v1.1
Genome assembly: then and now — v1.1
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1
 
2014 whitney-research
2014 whitney-research2014 whitney-research
2014 whitney-research
 
FROM THE CLASSROOM TO AN OPINION NOTE: COMPLEMENTARY ANALYSIS OF THE GENETIC ...
FROM THE CLASSROOM TO AN OPINION NOTE: COMPLEMENTARY ANALYSIS OF THE GENETIC ...FROM THE CLASSROOM TO AN OPINION NOTE: COMPLEMENTARY ANALYSIS OF THE GENETIC ...
FROM THE CLASSROOM TO AN OPINION NOTE: COMPLEMENTARY ANALYSIS OF THE GENETIC ...
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial
 
Hertweck bbl2012
Hertweck bbl2012Hertweck bbl2012
Hertweck bbl2012
 
Detection of genomic homology in eukaryotic genomes
Detection of genomic homology in eukaryotic genomesDetection of genomic homology in eukaryotic genomes
Detection of genomic homology in eukaryotic genomes
 
2013 duke-talk
2013 duke-talk2013 duke-talk
2013 duke-talk
 
http://lectures.gersteinlab.org/ppt/Gencode-winter08-20090121-pseudogenes/Gen...
http://lectures.gersteinlab.org/ppt/Gencode-winter08-20090121-pseudogenes/Gen...http://lectures.gersteinlab.org/ppt/Gencode-winter08-20090121-pseudogenes/Gen...
http://lectures.gersteinlab.org/ppt/Gencode-winter08-20090121-pseudogenes/Gen...
 
2013 alumni-webinar
2013 alumni-webinar2013 alumni-webinar
2013 alumni-webinar
 
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
 
Carleton Biology talk : March 2014
Carleton Biology talk : March 2014Carleton Biology talk : March 2014
Carleton Biology talk : March 2014
 

Plus de boussau

Combining relaxed clocks with lateral gene transfers to date species trees
Combining relaxed clocks with lateral gene transfers to date species treesCombining relaxed clocks with lateral gene transfers to date species trees
Combining relaxed clocks with lateral gene transfers to date species treesboussau
 
Models of gene duplication, transfer and loss to study genome evolution
Models of gene duplication, transfer and loss to study genome evolutionModels of gene duplication, transfer and loss to study genome evolution
Models of gene duplication, transfer and loss to study genome evolutionboussau
 
Genome-scale phylogenomics
Genome-scale phylogenomicsGenome-scale phylogenomics
Genome-scale phylogenomicsboussau
 
Evolutionary genomics
Evolutionary genomicsEvolutionary genomics
Evolutionary genomicsboussau
 
Presentation describing ALE, a suite of programs for gene tree reconstruction...
Presentation describing ALE, a suite of programs for gene tree reconstruction...Presentation describing ALE, a suite of programs for gene tree reconstruction...
Presentation describing ALE, a suite of programs for gene tree reconstruction...boussau
 
Towards inferring the history of life in the presence of lateral gene transfe...
Towards inferring the history of life in the presence of lateral gene transfe...Towards inferring the history of life in the presence of lateral gene transfe...
Towards inferring the history of life in the presence of lateral gene transfe...boussau
 
How to use PHYLDOG: a tutorial
How to use PHYLDOG: a tutorialHow to use PHYLDOG: a tutorial
How to use PHYLDOG: a tutorialboussau
 

Plus de boussau (7)

Combining relaxed clocks with lateral gene transfers to date species trees
Combining relaxed clocks with lateral gene transfers to date species treesCombining relaxed clocks with lateral gene transfers to date species trees
Combining relaxed clocks with lateral gene transfers to date species trees
 
Models of gene duplication, transfer and loss to study genome evolution
Models of gene duplication, transfer and loss to study genome evolutionModels of gene duplication, transfer and loss to study genome evolution
Models of gene duplication, transfer and loss to study genome evolution
 
Genome-scale phylogenomics
Genome-scale phylogenomicsGenome-scale phylogenomics
Genome-scale phylogenomics
 
Evolutionary genomics
Evolutionary genomicsEvolutionary genomics
Evolutionary genomics
 
Presentation describing ALE, a suite of programs for gene tree reconstruction...
Presentation describing ALE, a suite of programs for gene tree reconstruction...Presentation describing ALE, a suite of programs for gene tree reconstruction...
Presentation describing ALE, a suite of programs for gene tree reconstruction...
 
Towards inferring the history of life in the presence of lateral gene transfe...
Towards inferring the history of life in the presence of lateral gene transfe...Towards inferring the history of life in the presence of lateral gene transfe...
Towards inferring the history of life in the presence of lateral gene transfe...
 
How to use PHYLDOG: a tutorial
How to use PHYLDOG: a tutorialHow to use PHYLDOG: a tutorial
How to use PHYLDOG: a tutorial
 

Dernier

6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPRPirithiRaju
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Sérgio Sacani
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxkumarsanjai28051
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxtuking87
 
complex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfcomplex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfSubhamKumar3239
 
LESSON PLAN IN SCIENCE GRADE 4 WEEK 1 DAY 2
LESSON PLAN IN SCIENCE GRADE 4 WEEK 1 DAY 2LESSON PLAN IN SCIENCE GRADE 4 WEEK 1 DAY 2
LESSON PLAN IN SCIENCE GRADE 4 WEEK 1 DAY 2AuEnriquezLontok
 
Gas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGiovaniTrinidad
 
final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterHanHyoKim
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11GelineAvendao
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxPayal Shrivastava
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxfarhanvvdk
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDivyaK787011
 
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPRPirithiRaju
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionJadeNovelo1
 
How we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxHow we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxJosielynTars
 

Dernier (20)

6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptx
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
complex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfcomplex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdf
 
PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
LESSON PLAN IN SCIENCE GRADE 4 WEEK 1 DAY 2
LESSON PLAN IN SCIENCE GRADE 4 WEEK 1 DAY 2LESSON PLAN IN SCIENCE GRADE 4 WEEK 1 DAY 2
LESSON PLAN IN SCIENCE GRADE 4 WEEK 1 DAY 2
 
Gas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptx
 
final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarter
 
AZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTXAZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTX
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptx
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptx
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
 
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and Function
 
How we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxHow we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptx
 

Gene tree-species tree methods in RevBayes

  • 1. Gene tree-species tree methods in RevBayes Bastien Boussau LBBE, CNRS, Université de Lyon
  • 2. What I spend my time doing • Developing methods for: • molecular evolution • phylogenetic reconstruction • genomics, genome scale phylogenetic reconstruction • Using these methods to investigate: • Genome evolution (gene family evolution, drift) • Evolution of life on Earth (species trees, ancestral trait reconstruction)
  • 3. Why the tautological title? “Phylogenomic” has been used to describe attempts at reconstructing species trees based on 10-100 genes We are interested in the species tree, but also in genome evolution A genome can contains >20,000 genes Genome-scale: thousands of genes, dozens of species
  • 4. Genomes, Genes, gene families, gene trees and species trees Genome
  • 5. Genomes, Genes, gene families, gene trees and species trees Genome Gene Homo sapiens; GeneA: ACTGGTGATGACATGAC…
  • 6. Genomes, Genes, gene families, gene trees and species trees Genome Gene Homo sapiens; GeneA: ACTGGTGATGACATGAC… GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; Gene family alignment
  • 7. Genomes, Genes, gene families, gene trees and species trees Genome Gene Homo sapiens; GeneA: ACTGGTGATGACATGAC… GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; Gene family alignment Gene tree
  • 8. Genomes, Genes, gene families, gene trees and species trees Genome Gene Homo sapiens; GeneA: ACTGGTGATGACATGAC… GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; Gene family alignment Gene tree Species trees
  • 9. A graphical model for phylogenomics Species tree Gene tree Gene family alignment Gene Genome sequences Raw sequences
  • 10. A graphical model for phylogenomics Species tree Species tree construction Gene tree Gene tree construction Gene family alignment Homology prediction Alignment Gene Gene prediction Genome sequences Error correction, base calling, assembly Raw sequences
  • 11. A graphical model for phylogenomics Species tree Species tree construction Gene tree Gene tree construction Gene family alignment Homology prediction Alignment Gene Gene prediction Genome sequences Error correction, base calling, assembly Raw sequences
  • 12. The usual approach to reconstructing phylogenetic trees GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus;
  • 13. The usual approach to reconstructing phylogenetic trees GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; • Parsimony • Model-based approaches: e.g. Felsenstein pruning algorithm (1981) to compute P(alignment|Gene tree)
  • 14. The usual approach to reconstructing phylogenetic trees GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; • Parsimony • Model-based approaches: e.g. Felsenstein pruning algorithm (1981) to compute P(alignment|Gene tree) No species tree object in the usual approach
  • 15. The gene tree graphical model Gene tree Gene alignment PhyloCTMC Felsenstein pruning algorithm (1981)
  • 16. Gene trees inferred from sequences alone Homolens database
  • 17. Gene trees inferred from sequences alone Homolens database Complex, messy, not to be trusted
  • 18. The species tree-gene tree approach to reconstructing phylogenetic trees GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus;
  • 19. The species tree-gene tree approach to reconstructing phylogenetic trees GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; P(alignment|Gene tree)
  • 20. The species tree-gene tree approach to reconstructing phylogenetic trees GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; P(alignment|Gene tree) P(Gene tree|Species tree)
  • 21. The species tree-gene tree approach to reconstructing phylogenetic trees GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; P(alignment|Gene tree) P(Gene tree|Species tree) P(alignment,Gene tree|Species tree)=P(alignment|Gene tree)xP(Gene tree|Species tree) !
  • 22. The species tree-gene tree approach to reconstructing phylogenetic trees GeneA: ACTGGTGATGACATAAC…! GeneB: ACTGTTGATGACATGAC…! GeneC: ACTGATGATGACAAGAC…! GeneD: ACTGGTGA--CCATGAC…! GeneE: ACTGGTGATGACACGAC…! GeneF: ACT--TCATGAAACGAC…! Homo sapiens; Homo sapiens; Mus musculus; Mus musculus; Bison bison; Canis lupus; P(alignment|Gene tree) P(Gene tree|Species tree) Felsenstein pruning algorithm (1981) Many models and algorithms P(alignment,Gene tree|Species tree)=P(alignment|Gene tree)xP(Gene tree|Species tree) !
  • 23. The species tree-gene tree graphical model Species tree Gene tree Gene alignment Many models and algorithms Felsenstein pruning algorithm (1981)
  • 24. 11 The species tree-gene tree graphical model Hoehna et al., Syst Biol 2014
  • 25. point mutation AATAC⬅ATTAC Substituion point mutation AATAG⬅AATAC 12 A hierarchy of processes population of individuals species diversification gene birth and death a) b) c) enlarged below enlarged below gene duplication gene gene transfer loss extinct linage species extinctions speciations X X X X X present extant species sampled genomes sequence substitution X ATTAC X X X X X speciation AATAG AATAC ATTAC speciation (species tree) (locus tree) (gene tree) time population of species X X X X X speciation speciation genes in genomes nucleotides in genes Szöllősi et al., Syst Biol 2014
  • 26. Gene tree and species tree are linked Boussau and Daubin, Tree 2010
  • 27. Gene trees in species trees Maddison, Syst. Biol. 1997
  • 28. Gene trees in species trees Maddison, Syst. Biol. 1997 Challenges Given gene trees: reconstruct species tree Given species tree and alignment: reconstruct gene tree Given alignments: reconstruct genes and species trees
  • 29. The landscape of gene tree-species tree methods POMO (De Maio et al., 2013) Szöllősi et al., Syst Biol 2014
  • 30. The landscape of gene tree-species tree methods POMO (De Maio et al., 2013) Szöllősi et al., Syst Biol 2014 RevBayes
  • 31.
  • 32.
  • 33.
  • 34. D
  • 35. D DL
  • 37. D DL LGT ILS
  • 38. PHYLDOG D DL LGT ILS
  • 39. PHYLDOG D DL LGT ILS LGT: exODT
  • 40. PHYLDOG D DL LGT ILS LGT: exODT ILS: MSC
  • 41. Plan 1. Modeling the relationship between species tree and gene tree – coalescent models – models of gene duplication and loss – models of gene transfer – models that combine the above
  • 42. D DL LGT ILS
  • 43. Incomplete lineage sorting Degnan and Rosenberg, Syst. Biol. 2006 Maddison, Syst. Biol. 1997
  • 44. Reconstructing species trees given ILS •Parsimony method: minimizing deep coalescences (Maddison 1997) •Distance method: NJst (Liu and Yu 2011) •Summary statistic methods: STAR (Liu et al, 2009), GLASS (Mossel and Roch 2010), iGLASS (Jewett and Rosenberg 2012), ASTRAL (Mirarab et al., 2014) •Maximum Likelihood methods: STEM (Kubatko et al. 2009), MP-EST (Liu et al., 2010) •Bayesian methods: BeST (2007), *BEAST (Heled and Drummond, 2009) •Bypassing the gene trees with math tricks: SNAPP (Bryant et al., 2012), POMO (DeMaio et al., 2013), SVDQuartets (Chifman and Kubatko, 2014)
  • 45. Likelihood of a gene tree given a species tree under the multi-species coalescent Rannala and Yang, Genetics 2003
  • 46. Likelihood of a gene tree given a species tree under the multi-species coalescent Rannala and Yang, Genetics 2003
  • 47. The species tree-gene tree graphical model with ILS Species tree topology Gene tree Gene alignment Ne T r
  • 48. The POMO model De Maio et al., MBE 2013
  • 49. The POMO model De Maio et al., MBE 2013
  • 50. The POMO model De Maio et al., MBE 2013
  • 51. Pros and cons of the Pomo model 1. Pros: 1. bypasses gene tree estimation (easy to combine with DL or DTL models…) 2. does not assume linkage between sites 2. Cons: 1. You don’t get gene trees 2. It’s not sure what branch lengths mean anymore 3. matrix can become big 4. Only bimorphic sites are considered: 3- or 4- morphic sites need to be handled differently 5. Not sure how it behaves in practice
  • 52. Plan 1. Modeling the relationship between species tree and gene tree – coalescent models – models of gene duplication and loss – models of gene transfer – models that combine the above
  • 53. D DL LGT ILS
  • 54. DL models •Reconciliation between a gene tree and a species tree: •Parsimony method: minimizing numbers of duplications (Goodman et al. 1979; Page 1994; Zmasek and Eddy 2001) •Model-based method: Arvestad et al. 2003, 2004, 2009; Görecki et al. 2011 •Reconstruction of a gene tree given a species tree and alignment: •Parsimony method: Chen et al. 2000; Durand et al. 2006 •Model-based method: Rasmussen and Kellis 2007, 2011; Akerborg et al. 2009; Sjöstrand et al. 2012 •Reconstruction of a species tree given gene trees: •Parsimony methods: Page and Charleston 1997, Bansal et al. 2007; Wehe et al. 2008; Bansal et al. 2010; Chang et al. 2011 •Joint reconstruction of gene and species trees given alignments: •Model-based method: Boussau et al., 2013
  • 55. Many possible reconciliations Sennblad and Lagergren, Syst. Biol. 2009
  • 56. Many possible reconciliations Need to integrate over all possible reconciliations Sennblad and Lagergren, Syst. Biol. 2009
  • 57. Considering all mappings: The species tree discretization approximation Tofigh, PhD thesis
  • 58. Considering all mappings: The species tree discretization approximation Tofigh, PhD thesis Mapping between a gene tree and species tree ⬄ Mapping between nodes of the gene tree and nodes of the augmented species tree
  • 59. Considering all mappings: The species tree discretization approximation Tofigh, PhD thesis Mapping between a gene tree and species tree ⬄ Mapping between nodes of the gene tree and nodes of the augmented species tree Integrating over all mappings: done through dynamic programming from the leaves of the gene tree to its root
  • 60. Birth-death models for DL M o d e l - b a s e d approaches: often use a birth-death process (Arvestad et al., 2003, 2004, 2009; Akerborg et al., 2009; Sjöstrand et al., 2012; Boussau et al., 2013) Birth=Duplication: λ Death=Loss: μ Akerborg et al., PNAS 2009
  • 61. The Akerborg et al. 2009 model • Input: Species tree with time, rooted gene tree, λ and μ assumed to be known • Output: reconciliation between species tree and gene tree • Relaxed-clock model • Gene trees generated according to a birth-death model
  • 62. The Akerborg et al. 2009 model • Input: Species tree with time, rooted gene tree, λ and μ assumed to be known • Output: reconciliation between species tree and gene tree • Relaxed-clock model • Gene trees generated according to a birth-death model Felsenstein 1981 Gamma prior Dynamic programming to integrate over all mappings with BD model
  • 63. The species tree-gene tree graphical model with DL (Akerborg 2009) Sp tree Topology T r Gene tree Gene alignment μ λ
  • 64. The Akerborg et al. 2009 model vs PHYLDOG • Akerborg et al. 2009: • Generative model • Integrates over a large space of mappings • Integrates over all possible scenarios of duplications and losses PHYLDOG • use unrooted, non-clock gene trees • Use the most parsimonious mapping • Use a double-recursive tree traversal • Integrates over a large subset of possible scenarios of duplications and losses (Boussau et al., Genome Research 2013)
  • 65. Plan 1. Modeling the relationship between species tree and gene tree – coalescent models – models of gene duplication and loss – models of gene transfer – models that combine the above
  • 66. D DL LGT ILS
  • 67. Pure transfer models •Identification of transfers (not probabilistic): •Nakleh et al., 2005; Beiko and Hamilton 2006; Abby et al., 2010; Hill et al. 2010 •Joint reconstruction of gene and species trees given alignments: •Model-based method: Suchard, Genetics 2005
  • 68. Suchard “random walk through tree space” model •Input: gene alignments •Output: species tree, gene trees •Each gene tree is a Poisson number of transfers away from the species tree •Random walk through gene trees •C classes of genes, with different propensities to be transferred •Has been used with at most 6 species, 144 orthologous gene alignments, but could be used with 8 species or even more Suchard, Genetics 2005
  • 69. The species tree-gene tree graphical model with transfers (Suchard 2005) Species tree Topology Gene tree Gene alignment Gene category bl Expected number of transfers
  • 70. Plan 1. Modeling the relationship between species tree and gene tree – coalescent models – models of gene duplication and loss – models of gene transfer – models that combine the above
  • 71. Model that combines DL and ILS •Reconciliation between gene tree and species tree: •Model-based method: Rasmussen and Kellis, Genome Research 2012 ! •Input: dated and rooted species tree, DL rates, Ne •Output: Locus tree and DL+ILS scenario
  • 72. D DL LGT ILS
  • 73. The Rasmussen and Kellis model (2012) Species tree Gene tree Gene alignment
  • 74. The Rasmussen and Kellis model (2012) Species tree Gene tree Gene alignment Species tree Locus tree Gene tree Gene alignment
  • 75. The Rasmussen and Kellis model (2012) Species tree Gene tree Gene alignment Species tree DL model Locus tree Coalescent model Gene tree Gene alignment
  • 76. The Rasmussen and Kellis model (2012) Species tree DL model Locus tree Coalescent model Gene tree Gene alignment
  • 77. The Rasmussen and Kellis model (2012) Species tree DL model Locus tree Coalescent model Gene tree Gene alignment In practice: Species tree DL model Locus tree Coalescent model Gene tree
  • 78. The Rasmussen and Kellis model (2012)
  • 79. The Rasmussen and Kellis model (2012)
  • 80. The species tree-gene tree graphical model with DL + ILS (Rasmussen and Kellis 2012) Sp tree Topology Locus tree Gene tree μ r T λ Ne
  • 81. D DL LGT ILS
  • 82. Model that combines DL and T •Reconciliation between a species tree and a gene tree: Parsimony setting: Gorecki, RECOMB 2004; Gorbunov and Lyubetski Mol. Biol. (Mosk), 2009; Libeskind-Hadas and Charleston, JCB 2009; Tofigh et al., IEEE ACM 2011; Doyon et al., Comparative Genomics 2011 ! •Reconstruction of a species tree given gene trees: •Model-based method: Szöllősi et al., PNAS 2012; Szöllősi et al., Systematic Biology 2013a, 2013b; Sjöstrand et al., Syst Biol 2014
  • 83. Dating the tree of life is difficult
  • 84. Dating the tree of life is difficult 100 100 100
  • 85. Dating the tree of life is difficult 100 10 mya 100 100
  • 86. Dating the tree of life is difficult 100 10 mya 100 100 20 mya
  • 87. Dating the tree of life is difficult 100 10 mya 100 100 20 mya Very few fossils for the first 3 billion years of evolution on Earth Other constraints are needed to date the tree of life
  • 88. Using transfers to date clades
  • 89. Using transfers to date clades
  • 90. Using transfers to date clades
  • 91. Using transfers to date clades
  • 92. Gene transfers provide means to root Szöllősi et al., PNAS 2012 and date species trees
  • 93. Gene transfers provide means to root Szöllősi et al., PNAS 2012 and date species trees
  • 94. Avoiding back to the future scenarios Doyon et al., Comparative Genomics 2011
  • 95. Avoiding back to the future scenarios ➔ Algorithm in O (#NodesInS’ . #NodesInG ) Doyon et al., Comparative Genomics 2011
  • 96. 6 possible events are enough Doyon et al., Comparative Genomics 2011
  • 97. Dynamic programming in the Doyon et al. 2011 approach •Input: rooted augmented species tree S’, rooted gene tree G •Output: reconciliation scenario between S’ and G •Finding scenarios can be done in polynomial time using DP: •Fill the cost matrix c (#NodesInG, #NodesInS’) •Post-order traversal of the gene tree •For each node i of G: •Map it to each node j of S’ •Consider all of 6 events, and set c(i, j)=min(cost(possible events)) •At the root, return the minimum cost found
  • 98. From parsimony to model-based •Input: gene trees •Output: reconciled rooted gene trees, rooted species tree with ordered nodes •Birth-death process to model gene evolution: •Birth parameters: •D •T: rate of receiving a gene through transfer •Death parameter: •L •One or more sets of D, T, L parameters •Double recursive DP for computing P(gene tree|species tree) •Integrating over all possible scenarios •Sliced species tree Szöllősi et al., PNAS 2012
  • 99. Use of the 6 events from Doyon et al. 2011 Szöllősi et al., PNAS 2012
  • 100. Within-slice discretization for solving ordinary differential equations Szöllősi et al., PNAS 2012
  • 101. point mutation AATAC⬅ATTAC Substituion point mutation AATAG⬅AATAC 59 A hierarchy of processes population of individuals species diversification gene birth and death a) b) c) enlarged below enlarged below gene duplication gene gene transfer loss extinct linage species extinctions speciations X X X X X present extant species sampled genomes sequence substitution X ATTAC X X X X X speciation AATAG AATAC ATTAC speciation (species tree) (locus tree) (gene tree) time population of species X X X X X speciation speciation genes in genomes nucleotides in genes Szöllősi et al., Syst Biol 2014
  • 102. Further refinement: seeing dead lineages Szöllősi et al., Syst. Biol 2013
  • 103. Further refinement: seeing dead lineages Szöllősi et al., Syst. Biol 2013
  • 104. The exODT graphical model N σ Species tree Topology Gene tree D Population of species T L
  • 105. Plan Modeling the relationship between species tree and gene tree – coalescent models – models of gene duplication and loss – models of gene transfer – models that combine the above Gene tree species tree models in RevBayes, and tutorial
  • 106. Gene tree species tree models in RevBayes •Multispecies coalescent •Pomo model •In progress: •exODT (DTL model) •Soon: •Akerborg et al. 2009 (DL model) •MP-EST
  • 107. The tutorial •A walk around the multispecies coalescent: 1. Simulation with the multispecies coalescent (play with the parameters!) 2. Then: • Inference with the multispecies coalescent from gene trees • Inference with the multispecies coalescent from gene alignments • Inference with the POMO model from gene alignments • Inference by concatenating the gene alignments
  • 108. The tutorial Files are in: tutorials/RB_GeneTreeSpeciesTree_Tutorial/RB_tutorial_files/RevBayes_scripts/ in five folders. It is highly recommended to start RevBayes from each of these folders when doing the exercises to avoid path problems. There is one simulation-only folder, used for better understanding the multispecies coalescent: UnderstandingMultiSpeciesCoalescent Then there are 4 inference folders. In each inference folder, there are 3 scripts. The file XXXMCMC.Rev is the one that can launch the entire analysis, which consists of three parts: Simulating the data Constructing the model for inference Preparing and running the MCMC