IDT. Quality oligos. Every time.

Originally published as Genetics Published Articles Ahead of Print on February 16, 2005.

Genetics, Vol. 170, 291-304, May 2005, Copyright © 2005
doi:10.1534/genetics.104.038232

Linkage Disequilibrium Grouping of Single Nucleotide Polymorphisms (SNPs) Reflecting Haplotype Phylogeny for Efficient Selection of Tag SNPs

* Department of Medical Ecology and Informatics, Research Institute, International Medical Center of Japan, Shinjuku, Tokyo, 162-8655, Japan
{ddagger} Department of Gene Diagnostics and Therapeutics, Research Institute, International Medical Center of Japan, Shinjuku, Tokyo, 162-8655, Japan
{dagger} Department of Infection Control Science, Juntendo University, Bunkyo, Tokyo, 113-8421, Japan

2 Corresponding author: Department of Infection Control Science, Juntendo University, 2-1-1 Hongo, Bunkyo, Tokyo, 113-8421, Japan.
E-mail: fumihiko@takeuchi.name

Single nucleotide polymorphisms (SNPs) have been proposed to be grouped into haplotype blocks harboring a limited number of haplotypes. Within each block, the portion of haplotypes is expected to be tagged by a selected subset of SNPs; however, none of the proposed selection algorithms have been definitive. To address this issue, we developed a tag SNP selection algorithm based on grouping of SNPs by the linkage disequilibrium (LD) coefficient r2 and examined five genes in three ethnic populations—the Japanese, African Americans, and Caucasians. Additionally, we investigated ethnic diversity by characterizing 979 SNPs distributed throughout the genome. Our algorithm could spare 60% of SNPs required for genotyping and limit the imprecision in allele-frequency estimation of nontag SNPs to 2% on average. We discovered the presence of a mosaic pattern of LD plots within a conventionally inferred haplotype block. This emerged because multiple groups of SNPs with strong intragroup LD were mingled in their physical positions. The pattern of LD plots showed some similarity, but the details of tag SNPs were not entirely concordant among three populations. Consequently, our algorithm utilizing LD grouping allows selection of a more faithful set of tag SNPs than do previous algorithms utilizing haplotype blocks.




This article has been cited by other articles:


Home page
Am J EpidemiolHome page
T. J. Jorgensen, I. Ruczinski, B. Kessing, M. W. Smith, Y. Y. Shugart, and A. J. Alberg
Hypothesis-Driven Candidate Gene Association Studies: Practical Design and Analytical Considerations
Am. J. Epidemiol., October 15, 2009; 170(8): 986 - 993.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
N. Kato, T. Miyata, Y. Tabara, T. Katsuya, K. Yanai, H. Hanada, K. Kamide, J. Nakura, K. Kohara, F. Takeuchi, et al.
High-density association study and nomination of susceptibility genes for hypertension in the Japanese National Project
Hum. Mol. Genet., February 14, 2008; 17(4): 617 - 627.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Xu, W. Huang, H. Wang, Y. He, Y. Wang, Y. Wang, J. Qian, M. Xiong, and L. Jin
Dissecting Linkage Disequilibrium in African-American Genomes: Roles of Markers and Individuals
Mol. Biol. Evol., September 1, 2007; 24(9): 2049 - 2058.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. C. Gonzalez-Martinez, E. Ersoz, G. R. Brown, N. C. Wheeler, and D. B. Neale
DNA Sequence Variation and Selection of Tag Single-Nucleotide Polymorphisms at Candidate Genes for Drought-Stress Response in Pinus taeda L.
Genetics, March 1, 2006; 172(3): 1915 - 1926.
[Abstract] [Full Text] [PDF]