Phylogenetic Tree Reconstruction (Inference) Problem
Given:
- n species
- m characters
- for each species, values for all characters
Want: fully labelled phylogenetic tree that 'best' explains the given data
(i.e. maximize a target function (score) )
Assumptions:
- characters are mutually independent
- after two species diverged, their further evolution is independent of each other
Simple Solution: check them all out and pick the best one
- problem: too many possibilities to check
- n species -> (2n-3)!! different rooted trees
- n = 20 -> 1021 trees