# Csuros2010

CREATED: 201001261558 Title: Counting along phylogenies: ancestral reconstruction of numerical characters

Steiner tree labeling

• label inner nodes to minimize sum over all edges penalty of changing from c[u] to c[v]
• asymmetric Wagner penalty, computable in $O(nh)$
• standard solution by DP, not obvious how to extend to infinite values

Probabilistic model

• stochastic models for numerical characters
• Markov assumption

Evolution of gene structure

• some eukaryotes have few introns (yeast), some have many (human)
• human genome is “primitive” in term of introns. Most intros are present in last common ancestor of eukaryotes

Evolution of gene counts

• a birth death model, continuous time Markov process (gain rate, loss rate, dup rate)
• distributions of inparalogs and xenologs are well characterized
• how to compute likelihood of a profile?
• Felsenstein’s algorithm doesn’t work. Condition on the number of genes that have surviving modern offspring gives a DP solution, need correction for missing families

Example: gene content evolution in Archaea, COGs for gene families