Originally published as Genetics Published Articles Ahead of Print on December 28, 2006.
Genetics, Vol. 175, 1773-1785, April 2007, Copyright © 2007
doi:10.1534/genetics.106.066258
Phylogenetic Mapping of Recombination Hotspots in Human Immunodeficiency Virus via Spatially Smoothed Change-Point Processes
Vladimir N. Minin*,
Karin S. Dorman
,
,
,
Fang Fang
and
Marc A. Suchard*,**,
,1
* Department of Biomathematics and ** Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California 90095, 
Department of Biostatistics, UCLA School of Public Health, Los Angeles, California 90095 and
Bioinformatics and Computational Biology Program,
Department of Statistics and
Department of Genetics, Cell and Development Biology, Iowa State University, Ames, Iowa 50011
1 Corresponding author: Departments of Biomathematics and Human Genetics, David Geffen School of Medicine, University of California, 695 Charles E. Young Dr., Box 951766, S. Los Angeles, CA 90095-1766.
E-mail: msuchard{at}ucla.edu
We present a Bayesian framework for inferring spatial preferences of recombination from multiple putative recombinant nucleotide sequences. Phylogenetic recombination detection has been an active area of research for the last 15 years. However, only recently attempts to summarize information from several instances of recombination have been made. We propose a hierarchical model that allows for simultaneous inference of recombination breakpoint locations and spatial variation in recombination frequency. The dual multiple change-point model for phylogenetic recombination detection resides at the lowest level of our hierarchy under the umbrella of a common prior on breakpoint locations. The hierarchical prior allows for information about spatial preferences of recombination to be shared among individual data sets. To overcome the sparseness of breakpoint data, dictated by the modest number of available recombinant sequences, we a priori impose a biologically relevant correlation structure on recombination location log odds via a Gaussian Markov random field hyperprior. To examine the capabilities of our model to recover spatial variation in recombination frequency, we simulate recombination from a predefined distribution of breakpoint locations. We then proceed with the analysis of 42 human immunodeficiency virus (HIV) intersubtype gag recombinants and identify a putative recombination hotspot.
Related articles in Genetics:
- ISSUE HIGHLIGHTS
Genetics 2007 175: NP.
[Full Text]
Copyright © 2007 by the Genetics Society of America.