- THIS ARTICLE
-
Abstract
- Full Text (PDF)
-
All Versions of this Article:
genetics.106.068890v1
176/3/1811 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Li, Y.
- Articles by Wu, R.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Li, Y.
- Articles by Wu, R.
Originally published as Genetics Published Articles Ahead of Print on June 11, 2007.
Genetics, Vol. 176, 1811-1821, July 2007, Copyright © 2007
doi:10.1534/genetics.106.068890
Estimation of Multilocus Linkage Disequilibria in Diploid Populations With Dominant Markers
Yanchun Li*,
Yang Li
,
Song Wu
,
Kun Han*,
Zhengjia Wang*,
Wei Hou
,
Yanru Zeng* and
Rongling Wu*,
,1
* School of Forestry and Biotechnology, Zhejiang Forestry University, Lin'an, Zhejiang 311300, People's Republic of China and
Department of Computer and Information Science and Engineering,
Department of Epidemiology and Health Policy Research and
Department of Statistics, University of Florida, Gainesville, Florida 32611
1 Corresponding author: Department of Statistics, University of Florida, Gainesville, FL 32611.
E-mail: rwu{at}stat.ufl.edu
>ABSTRACT
TWO-LOCUS LINKAGE DISEQUILIBRIUM...
THREE-LOCUS LINKAGE...
MONTE CARLO SIMULATION
WORKED EXAMPLE
DISCUSSION
ACKNOWLEDGEMENTS
LITERATURE CITED
Analysis of population structure and organization with DNA-based markers can provide important information regarding the history and evolution of a species. Linkage disequilibrium (LD) analysis based on allelic associations between different loci is emerging as a viable tool to unravel the genetic basis of population differentiation. In this article, we derive the EM algorithm to obtain the maximum-likelihood estimates of the linkage disequilibria between dominant markers, to study the patterns of genetic diversity for a diploid species. The algorithm was expanded to estimate and test linkage disequilibria of different orders among three dominant markers and can be technically extended to manipulate an arbitrary number of dominant markers. The feasibility of the proposed algorithm is validated by an example of population genetic studies of hickory trees, native to southeastern China, using dominant random amplified polymorphic DNA markers. Extensive simulation studies were performed to investigate the statistical properties of this algorithm. The precision of the estimates of linkage disequilibrium between dominant markers was compared with that between codominant markers. Results from simulation studies suggest that three-locus LD analysis displays increased power of LD detection relative to two-locus LD analysis. This algorithm is useful for studying the pattern and amount of genetic variation within and among populations.
THE pattern and extent of nonrandom associations among polymorphic markers distributed over the genome are related to the evolutionary rate of population structure for a species (TISHKOFF et al. 1996, 2001; STEPHENS et al. 2001; ARDLIE et al. 2002; WEISS and CLARK 2002). One measure of such nonrandom associations, linkage disequilibrium (LD), is often affected by various evolutionary forces, such as selection, genetic drift, mutation, admixture, population structure, etc., which have operated in the population. For this reason, by estimating and testing for the extent and distribution of LD throughout the genome, the evolution of population structure can be inferred (TISHKOFF and WILLIAMS 2002). There is a wealth of literature on the application of the LD analysis to understand the population evolution of humans (REICH et al. 2001; ARDLIE et al. 2002; DAWSON et al. 2002; GABRIEL et al. 2002) as well as a variety of plants and animals (REMINGTON et al. 2001; HANSSON et al. 2004; LIU et al. 2006).
Unlike humans and several model systems, such as mouse and Arabidopsis, in which high-resolution LD maps have been constructed with codominant markers, such as single-nucleotide polymorphisms (SNPs) and microsatellites, many underrepresented species, like forest trees, still heavily rely upon simple and cheap dominant marker techniques. These markers including random amplified polymorphic DNA (RAPD) (WILLIAMS et al. 1990) and amplified fragment length polymorphism (AFLP) (VOS et al. 1995) can be genotyped arbitrarily from the genome with no need of prior knowledge about the structure and sequence of the genome. There have been extensive publications on the use of dominant markers to explore the amount, structure, and distribution of genetic variation in a population (YAN et al. 1999; ZHIVOTOVSKY 1999; HOLSINGER et al. 2002; MILLER and SCHAAL 2006) and manage biological resources and diversity in agriculture and forestry (KUANG et al. 1998; SILBIGER et al. 1998; KREMER et al. 2005). In the postgenomic era, proteomic techniques have become increasingly available to produce dominant markers such as presence/absence sport (PAS) and protein quantitative locus (PQL) (THIELLEMENT et al. 1999; ZIVY and DE VIENNE 2000; CONSOLI et al. 2002). Proteomic markers are often related to biological functions and, therefore, will play an important role in the genetic study of variation in a natural or an experimental population.
For a dominant marker, the plus allele, shown as the presence of a band on the gel, dominates over the null allele, shown as the absence of a band (unamplifiable by PCR). Therefore, heterozygous and homozygous genotypes both with the band cannot be directly distinguished from each other. Statistical methods for estimating linkage disequilibria with codominant markers, for which two homozygotes and the heterozygote are distinguishable, have been well developed (PRITCHARD and PRZEWORSKI 2001; WANG and WU 2004). However, the estimates of LD among dominant markers have been poorly explored, although they have attracted the interest of geneticists for a long time because of their potential importance in the understanding of population structure (HILL 1974). As compared to codominant markers, dominant markers are less informative and, therefore, there may be some loss of information for their practical use. In this article, we derive a statistical method for estimating and testing the LD between dominant markers within the maximum-likelihood context. This method is implemented with the EM algorithm, allowing for a number of hypotheses regarding the structure and organization of a population. The statistical properties of the method for analyzing dominant markers are investigated and compared with those for codominant markers through simulation studies.
ABSTRACT
>TWO-LOCUS LINKAGE DISEQUILIBRIUM...
THREE-LOCUS LINKAGE...
MONTE CARLO SIMULATION
WORKED EXAMPLE
DISCUSSION
ACKNOWLEDGEMENTS
LITERATURE CITED
The codominant model:
Suppose n diploid individuals are randomly drawn from a large population at Hardy–Weinberg equilibrium. Let us consider two diallelic markers A (with alleles A and a) and B (with two alleles B and b). These two markers are genetically associated with the linkage disequilibrium D in a Hardy–Weinberg equilibrium (HWE) population. Allele frequencies are defined as pA and pa (pA + pa = 1) for A and pB and pb (pB + pb = 1) for B, respectively. The frequencies of four haplotypes, AB, Ab, aB, and ab, formed by the two markers, are denoted, respectively, as
![]() | (1) |
be the observed number of observations of a two-marker genotype lAlB (lA = AA, Aa, aa; lB = BB, Bb, bb). For a practical data set, the observations of these genotypes and their expected frequencies can be tabulated in a format like Table 1. Under the HWE, the diplotype frequencies are the products of the corresponding haplotype frequencies. The resulting genotype frequencies for each cell are then expressed in terms of haplotype frequencies.
|
The likelihood of unknown haplotype frequencies,
= (pAB, pAb, paB, pab), given observed genotypes (M) can be written as a multinomial form; i.e.,
![]() | (2) |
The maximum-likelihood estimates (MLEs) of haplotype frequencies are then derived as
![]() | (3) |
![]() | (4) |
Equations 3 and 4 compose a loop for the EM algorithm. Initial values for the haplotype frequencies are provided to calculate the proportion
in the E step (Equation 4). The calculated
-value is then used to estimate haplotype frequencies in the M step (Equation 3). Both the E and the M steps are repeated until the estimates of haplotype frequencies converge. The MLEs of allele frequencies at the two markers and their linkage disequilibrium can be obtained by solving a group of equations (Equation 1); i.e.,
![]() | (5) |
The degree of linkage disequilibrium between two markers can be tested by formulating two hypotheses expressed as
![]() | (6) |
and
, are calculated, respectively, where the tilde corresponds to the MLEs for the null hypothesis and the circumflex corresponds to the MLEs for the alternative hypothesis. Under the null hypothesis above, the allele frequencies of codominant markers can be directly estimated from the marker data without the use of the EM algorithm. The log-likelihood-ratio test statistic is calculated by
![]() | (7) |
2-distributed with 1 d.f.
The dominant model:
For a dominant marker (say A), the homozygote (AA) for the dominant allele cannot be distinguished from the heterozygote (Ao). Thus, these two genotypes are observed as a single "phenotype" (A_). Because of this, some cells for the observations and expected genotype frequencies in Table 1 are collapsed in a way as shown in Table 2. We use
to denote the observed number of observations of a two-dominant-marker genotype kA/kB
. Using the same idea as that conceived for the codominant markers, we formulate the EM algorithm to estimate haplotype frequencies
![]() | (8) |
![]() | (9) |
|
It should be pointed out that, unlike the codominance case, the estimation of allele frequencies of two dominant markers under the null hypothesis of no linkage disequilibrium should be based on the EM algorithm. The EM algorithm is described as follows:
In the E step, calculate
![]() |
![]() |
ABSTRACT
TWO-LOCUS LINKAGE DISEQUILIBRIUM...
>THREE-LOCUS LINKAGE...
MONTE CARLO SIMULATION
WORKED EXAMPLE
DISCUSSION
ACKNOWLEDGEMENTS
LITERATURE CITED
The codominant model:
Consider three segregating markers, A with alleles A and a, B with alleles B and b, and C with alleles C and c. The notation for marker alleles and their frequencies is given similarly for two markers in the preceding section. Linkage disequilibrium analysis among three markers is characterized by allele frequencies at each marker and linkage disequilibria between any pair of markers and among all the three markers are denoted as DAB, DAC, DBC, and DABC, respectively (BENNETT 1954; NIELSEN et al. 2004). Three markers generate eight different haplotypes, expressed as ABC, ABc, AbC, Abc, aBC, aBc, abC, and abc, whose frequencies are expressed, respectively, as
![]() | (10) |
These haplotype frequencies are used to describe the diplotype (and genotype) frequencies at the three markers given in Table 3. Let
be the observed number of observations of a three-marker genotype lAlBlC (lA = AA, Aa, aa; lB = BB, Bb, bb; lC = CC, Cc, cc). On the basis of the expected frequencies, generally expressed as
, as given in Table 3, we construct the multinomial log-likelihood by
![]() | (11) |
![]() | (12) |
, where
![]() | (13) |
|
Equations 13 and 12 construct a loop for iterations in the EM algorithm. Started with initial values, iterations are undertaken until converged estimates are obtained. The allele frequencies and linkage disequilibrium of different orders can be estimated from the MLEs of haplotype frequencies by
![]() | (14) |
The sampling variances for the MLEs of haplotype frequencies (and therefore allele frequencies and linkage disequilibria of different orders) in a three-locus analysis can be calculated from the Fisher information matrix.
The linkage disequilibria of different orders can be tested in general or individually. The existence of linkage disequilibria is tested using the following hypotheses:
![]() | (15) |
2-distribution with 4 d.f. The MLEs of allelic frequencies under H0 can be estimated using the EM algorithm described above, but with the constraints
![]() | (16) |
The dominant model:
Like two-locus linkage disequilibrium analysis, we derive the EM algorithm to estimate the haplotype frequencies for three-locus linkage disequilibrium analysis of dominant markers. When all three markers are dominant, expected genotype frequencies for codominant markers listed in Table 3 are collapsed to form the simpler Table 4. We use
to denote the observed number of observations of a three-dominant-marker genotype kA/kB/kC
. In this case, haplotype frequencies are estimated by
![]() | (17) |
![]() | (18) |
|
ABSTRACT
TWO-LOCUS LINKAGE DISEQUILIBRIUM...
THREE-LOCUS LINKAGE...
>MONTE CARLO SIMULATION
WORKED EXAMPLE
DISCUSSION
ACKNOWLEDGEMENTS
LITERATURE CITED
Two-locus LD analysis:
For cases of both high and low heterozygosity, LD is assumed to be 0.015. This assumed LD value corresponds to a larger normalized value (LEWONTIN 1964) for the two markers of lower heterozygosity than of higher heterozygosity. Table 5 gives the MLEs of the allele frequencies and linkage disequilibrium and the square roots of their mean square errors for two dominant markers. The model generally provides reasonable estimates of allele frequencies even when the sample size used is as low as 30. The accuracy and precision of the parameter estimation increase dramatically when the sample size increases to 100 for the markers of high heterozygosity, but to 200 for the markers of low heterozygosity. Regardless of the degree of heterozygosity, a small sample size cannot provide an unbiased estimate of LD. For the markers of high heterozygosity, a reasonable estimate of LD requires at least 200 individuals, whereas for the markers of low heterozygosity a doubled sample size is still not sufficient (Table 5). The simulation studies allow for the estimation of empirical power for the detection of significant LD. In general, the power is quite low for the assumed LD value, which, as expected, increases with the increase of sample size.
|
The same set of parameters was used to simulate codominant markers with the estimation results also tabulated in Table 5. The estimation accuracy and precision of all the parameters, particularly LD, can be better estimated for codominant than for dominant markers. A small sample size of 30 would be adequate for the reasonable estimate of LD for codominant markers regardless of their heterozygosity. Compared to dominant markers, codominant markers have higher power for the detection of LD, especially when the sample size is large. Different from dominant markers, allele frequencies and LD for codominant markers can be better estimated when the markers have lower heterozygosity (Table 5).
Three-locus LD analysis:
Similar simulation schemes are used for three-locus linkage disequilibrium analysis. More heterozygous markers are simulated by assuming allele frequencies pA = 0.5 and pB = pC = 0.6, whereas less heterozygous markers are simulated by assuming allele frequencies pA = 0.9 and pB = pC = 0.8. In both cases, linkage disequilibria are assumed to be DAB = DAC = DBC = 0.015 and DABC = 0.010. The results from the three-locus analysis are basically consistent with those from the two-locus analysis: For dominant markers, higher heterozygosity favors the estimation of the parameters in terms of estimation precision and power, whereas for codominant markers, lower heterozygosity favors the estimation (Table 6). Beyond two-locus analysis, three-locus analysis provides the estimation of more parameters related to linkage disequilibria and, thus, can be more informative for the understanding of population structure and organization.
|
Results from Tables 5 and 6 allow for the comparisons between two- and three-locus linkage disequilibrium analysis. Under the same condition, the precision of parameter estimation is similar between these two analysis approaches. But three-locus analysis displays much greater power for LD detection than two-locus analysis for codominant markers with a sample size of
100. For dominant markers, the power for LD detection can be increased only for the markers of high heterozygosity. Three-locus analysis does not improve the power for the dominant markers of low heterozygosity. An additional simulation study was conducted to estimate type I error rates of linkage disequilibrium analyses. Marker data were repeatedly (1000 times) simulated according to the scenarios as designed above for different sample sizes, marker heterozygosities, and marker types by assuming that there is no linkage disequilibrium between the markers. It appears that sample sizes do not largely affect type I error rates for linkage disequilibrium detection. Two- and three-locus analyses may incorrectly find significant linkage disequilibria at a similar rate (Table 7). There is more chance to obtain false-positive results for markers with a high heterozygosity (7–16%) than for those with a low heterozygosity (1–8%). Type I errors are slightly different between dominant and codominant markers, although the pattern of differences depends on the level of marker heterozygosity.
|
ABSTRACT
TWO-LOCUS LINKAGE DISEQUILIBRIUM...
THREE-LOCUS LINKAGE...
MONTE CARLO SIMULATION
>WORKED EXAMPLE
DISCUSSION
ACKNOWLEDGEMENTS
LITERATURE CITED
|
ABSTRACT
TWO-LOCUS LINKAGE DISEQUILIBRIUM...
THREE-LOCUS LINKAGE...
MONTE CARLO SIMULATION
WORKED EXAMPLE
>DISCUSSION
ACKNOWLEDGEMENTS
LITERATURE CITED
While high-throughput molecular marker techniques, such as SNPs, have been widely used to study the population structure and evolution of humans (STEPHENS et al. 2001; DAWSON et al. 2002), cheap dominant markers still serve as an important tool for genetic research in underrepresented species (KUANG et al. 1998). More importantly, the role of dominant markers in the characterization of biochemical functions has become more pronounced in the postgenomic era with the advent of proteomic techniques (THIELLEMENT et al. 1999; ZIVY and DE VIENNE 2000; CONSOLI et al. 2002). Statistical approaches have been derived to study the structure and organization of a population with individual dominant markers (LYNCH and MILLIGAN 1994; ZHIVOTOSKY 1999; HOLSINGER et al. 2002). However, the estimation of LD with multilocus dominant markers has not been well explored, although the importance of multilocus analysis in population genetic studies has been increasingly recognized (KREMER et al. 2005).
In this article, we present a statistical method for estimating and testing the LD between dominant markers by implementing the EM algorithm. With this algorithm, the frequencies of haplotypes constructed by a series of nonalleles at different markers can be precisely estimated and the estimated haplotype frequencies are further used to estimate allele frequencies and linkage disequilibria. Our presentation of the method was first based on informative codominant markers, for which the estimation of haplotype frequencies can be readily formulated, and then expanded to take into account dominant markers. Although our method was presented for two- and three-locus LD analysis, it is common to estimate linkage disequilibria among four or more markers. Perhaps, a general model should be derived that can estimate and test the LDs for an arbitrary number of codominant and dominant markers. Such a multilocus LD analysis by exploiting more information at one time has advantages for increasing the precision of parameter estimation and the power to detect significant LD. Also, more information, e.g., trigenic or higher-order LD, can be detected.
Extensive simulation studies have been performed to examine the robustness and properties of LD analysis with dominant markers under different degrees of heterozygosity and different sample sizes. The method proposed in this study has proved its power to estimate and test for the LD between dominant markers, but because of their inherent adequacy of information, the use of dominant markers to study the LD is limited under some circumstances, such as a small sample size and low heterozygosities of markers. According to the results from simulation studies, the following guidelines are recommended for dominant markers to be more effectively used in practice:
- For dominant markers of high heterozygosity, the sample size used for a two-locus LD analysis should be
200, whereas for dominant markers of low heterozygosity, the sample size should be
400.
- Because multilocus analysis can improve the test power, it is always recommended to perform the LD analysis with three or more dominant markers. Furthermore, multilocus analysis provide additional information about a web of nonrandom associations among multiple markers.
- Codominant markers are more powerful for the estimation and test of LD than dominant markers. It is recommended to generate a mix of dominant and codominant markers for a better characterization of the genetic structure of a population (e.g., see YAN et al. 1999).
ZHIVOTOVSKY (1999) and HOLSINGER et al. (2002) presented an approach for estimating population structure in diploids with individual dominant DNA markers. Our method has extended the use of dominant markers to characterize linkage disequilibrium among multiple loci. Although the focus in this study is on dominant markers, the methodological principle of the model proposed can be extended to consider a complex web of polygenic LD of different orders among multiallelic markers. For outcrossing populations, multiallelic markers, such as microsatellites, are crucial for the precise characterization of population structure and diversity. Our approach for LD analysis was derived within the maximum-likelihood context. Bayesian approaches that have several advantages over maximum likelihood (HOLSINGER et al. 2002; FU et al. 2005) may offer greater power to understand population structure, especially when a number of markers with multiple alleles are analyzed simultaneously. The computer code to perform linkage disequilibrium analyses can be requested from the corresponding author.
ABSTRACT
TWO-LOCUS LINKAGE DISEQUILIBRIUM...
THREE-LOCUS LINKAGE...
MONTE CARLO SIMULATION
WORKED EXAMPLE
DISCUSSION
>ACKNOWLEDGEMENTS
LITERATURE CITED
ABSTRACT
TWO-LOCUS LINKAGE DISEQUILIBRIUM...
THREE-LOCUS LINKAGE...
MONTE CARLO SIMULATION
WORKED EXAMPLE
DISCUSSION
ACKNOWLEDGEMENTS
>LITERATURE CITED
ARDLIE, K. G., L. KRUGLYAK and M. SEIELSTAD, 2002 Patterns of linkage disequilibrium in the human genome. Nat. Rev. Genet. 3: 299–309.[CrossRef][Medline]
BENNETT, J. H., 1954 On the theory of random mating. Ann. Eugen. 18: 311–317.
CONSOLI, L., A. LEFEVRE, M. ZIVY, D. DE VIENNE and C. DAMERVAL, 2002 QTL analysis of proteome and transcriptome variations for dissecting the genetic architecture of complex traits in maize. Plant Mol. Biol. 48: 575–581.[CrossRef][Medline]
DAWSON, E., G. R. ABECASIS, S. BUMPSTEAD, Y. CHEN, S. HUNT et al., 2002 A first-generation linkage disequilibrium map of human chromosome 22. Nature 418: 544–548.[CrossRef][Medline]
FU, R. W., D. K. DEY and K. E. HOLSINGER, 2005 Bayesian models for the analysis of genetic structure when populations are correlated. Bioinformatics 21: 1516–1529.
GABRIEL, S. B., S. F. SCHAFFNER, H. NGUYEN, J. M. MOORE, J. ROY et al., 2002 The structure of haplotype blocks in the human genome. Science 296: 2225–2229.
HANSSON, B., H. WESTERDAHL, D. HASSELQUIST, M. AKESSON and S. BENSCH, 2004 Does linkage disequilibrium generate heterozygosity-fitness correlations in great reed warblers? Evolution 58: 870–879.[CrossRef][Medline]
HILL, W. G., 1974 Estimation of linkage disequilibrium in randomly mating populations. Heredity 33: 229–239.[Medline]
HOLSINGER, K. E., P. O. LEWIS and D. K. DEY, 2002 A Bayesian approach to inferring population structure from dominant markers. Mol. Ecol. 11: 1157–1164.[CrossRef][Medline]
KREMER, A., H. CARON, S. CAVERS, N. COLPAERT, G. GHEYSEN et al., 2005 Monitoring genetic diversity in tropical trees with multilocus dominant markers. Heredity 95: 274–280.[CrossRef][Medline]
KUANG, H., T. E. RICHARDSON, S. D. CARSON and B. C. BONGARTEN, 1998 An allele responsible for seedling death in Pinus radiata D. Don. Theor. Appl. Genet. 96: 640–644.[CrossRef]
LEWONTIN, R. C., 1964 The interaction of selection and linkage. I. General considerations; heterotic models. Genetics 49: 49–67.
LIU, T., R. J. TODHUNTER, Q. LU, L. SCHOETTINGER, H. Y. LI et al., 2006 Extent and distribution of zygotic linkage disequilibrium in canine. Genetics 174: 439–453.
LYNCH, M., and B. MILLIGAN, 1994 Analysis of population genetic structure with RAPD markers. Mol. Ecol. 3: 91–99.[Medline]
LYNCH, M., and B. WALSH, 1998 Genetics and Analysis of Quantitative Traits. Sinauer Associates, Sunderland, MA.
MILLER, A. J., and B. A. SCHAAL, 2006 Domestication and the distribution of genetic variation in wild and cultivated populations of the Mesoamerican fruit tree Spondias purpurea L. (Anacardiaceae). Mol. Ecol. 15: 1467–1480.[CrossRef][Medline]
NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, New York.
NIELSEN, D. M., M. D. EHM, D. V. ZAYKIN and B. S. WEIR, 2004 Effect of two- and three-locus linkage disequilibrium on the power to detect marker/phenotype associations. Genetics 168: 1029–1040.
PRITCHARD, J. K., and M. PRZEWORSKI, 2001 Linkage disequilibrium in humans: models and data. Am. J. Hum. Genet. 69: 1–14.[CrossRef][Medline]
RAFALSKI, A., and M. MORGANTE, 2004 Corn and humans: recombination and linkage disequilibrium in two genomes of similar size. Trends Genet. 20: 103–111.[CrossRef][Medline]
REICH, D. E., M. CARGILL, S. BOLK, J. IRELAND, P. C. SABETI et al., 2001 Linkage disequilibrium in the human genome. Nature 411: 199–204.[CrossRef][Medline]
REMINGTON, D. L., J. M. THORNSBERRY, Y. MATSUOKA, L. M. WILSON, S. R. WHITT et al., 2001 Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc. Natl. Acad. Sci. USA 98: 11479–11484.
SILBIGER, R. N., S. A. CHRIST, A. C. LEONARD, M. GARG, D. L. LATTIER et al., 1998 Preliminary studies on the population genetics of the central stoneroller (Campostoma anomalum) from the Great Miami River Basin, Ohio. Environ. Monit. Assess. 51: 481–495.[CrossRef]
STEPHENS, J. C., J. A. SCHNEIDER, D. A. TANGUAY, J. CHOI, T. ACHARYA et al., 2001 Haplotype variation and linkage disequilibrium in 313 human genes. Science 293: 489–493.
THIELLEMENT, H., N. BAHRMAN, C. DAMERVAL, C. PLOMION, M. ROSSIGNOL et al., 1999 Proteomics for genetic and physiological studies in plants. Electrophoresis 20: 2013–2026.[CrossRef][Medline]
TISHKOFF, S. A., and S. M. WILLIAMS, 2002 Genetic analysis of African populations: human evolution and complex disease. Nat. Rev. Genet. 3: 611–621.[CrossRef][Medline]
TISHKOFF, S. A., E. DIETZSCH, W. SPEED, A. J. PAKSTIS, K. CHEUNG et al., 1996 Global patterns of linkage disequilibrium at the CD4 locus and modern human origins. Science 271: 1380–1387.[Medline]
TISHKOFF, S. A., R. VARKONYI, N. CAHINHINAN, S. ABBES, G. ARGYROPOLOUS et al., 2001 Haplotype diversity and linkage disequilibrium at human G6PD: recent origin of alleles that confer malarial resistance. Science 293: 455–462.
VOS, P., R. HOGERS, M. BLEEKER, M. REIJANS, T. VAN DE LEE et al., 1995 AFLP: a new technique for DNA fingerprinting. Nucleic Acids Res. 23: 4407–4414.
WANG, Z. H., and R. L. WU, 2004 A statistical model for high-resolution mapping of quantitative trait loci determining human HIV-1 dynamics. Stat. Med. 23: 3033–3051.[CrossRef][Medline]
WEISS, K. M., and A. G. CLARK, 2002 Linkage disequilibrium and the mapping of complex human traits. Trends Genet. 18: 19–24.[CrossRef][Medline]
WILLIAMS, J. G. K., A. R. KUBELKI, K. J. LIVAK and S. V. TINGEY, 1990 DNA polymorphisms amplified by arbitrary primers are useful as genetic markers. Nucleic Acids Res. 18: 6531–6535.
YAN, G., J. ROMERO-SEVERSON, M. WALTON, D. D. CHADEE and D. W. SEVERSON, 1999 Population genetics of the yellow fever mosquito in Trinidad: comparisons of amplified fragment length polymorphism (AFLP) and restriction fragment length polymorphism (RFLP) markers. Mol. Ecol. 8: 951–963.[CrossRef][Medline]
ZHIVOTOVSKY, L. A., 1999 Estimating population structure in diploids with multilocus dominant DNA markers. Mol. Ecol. 8: 907–913.[CrossRef][Medline]
ZIVY, M., and D. DE VIENNE, 2000 Proteomics: a link between genomics, genetics and physiology. Plant Mol. Biol. 44: 575–580.[CrossRef][Medline]
Communicating editor: R. NIELSEN
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
-
All Versions of this Article:
genetics.106.068890v1
176/3/1811 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Li, Y.
- Articles by Wu, R.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Li, Y.
- Articles by Wu, R.



















