Genetics. Published Articles Ahead of Print: March 21, 2005, Copyright © 2005
doi:10.1534/genetics.103.025692


A more recent version of this article appeared on May 1, 2005.


REGULAR RESEARCH PAPERS

Stochastic models for horizontal gene transfer: taking a random walk through tree space

1 David Geffen School of Medicine at UCLA

* To whom correspondence should be addressed. E-mail: msuchard{at}ucla.edu.

Submitted on December 11, 2003
Revised on October 20, 2004
Accepted on 31 January 2005


Abstract

Horizontal gene transfer (HGT) plays a critical role in evolution across all domains of life with important biological and medical implications. I propose a simple class of stochastic models to examine HGT using multiple orthologous gene alignments. The models function in a hierarchical phylogenetic framework. The top level of the hierarchy is based on a random walk process in "tree space" that allows for the development of a joint probabilistic distribution over multiple gene-trees and an unknown, but estimable species-tree. I consider two general forms of random walks. The first form is derived from the subtree prune and regraft (SPR) operator that mirrors the observed effects that HGT has on inferred trees. The second form is based on walks over complete graphs and offers numerically tractable solutions for increasing number of taxa. The lower level of the hierarchy utilizes standard phylogenetic models to reconstruct gene-trees given multiple gene alignments conditional on the random walk process. I develop a well mixing Markov chain Monte Carlo algorithm to fit the models in a Bayesian framework. I demonstrate the flexibility of these stochastic models to test competing ideas about HGT by examining the Complexity Hypothesis. Using 144 orthologous gene alignments from six prokaryotes previously collected and analyzed, Bayesian model selection finds support for (1) the SPR model over the alternative form, (2) the 16S rRNA reconstruction as the most likely species-tree and (3) increased HGT of operational genes compared to informational genes.

Key Words: Bayes factor, Complexity Hypothesis, MCMC, horizontal gene transfer, phylogeny




This article has been cited by other articles:


Home page
Mol Biol EvolHome page
S. Linz, A. Radtke, and A. von Haeseler
A Likelihood Framework to Measure Horizontal Gene Transfer
Mol. Biol. Evol., June 1, 2007; 24(6): 1312 - 1319.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. Ane, B. Larget, D. A. Baum, S. D. Smith, and A. Rokas
Bayesian Estimation of Concordance among Gene Trees
Mol. Biol. Evol., February 1, 2007; 24(2): 412 - 426.
[Abstract] [Full Text] [PDF]