| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 177, 457-468, September 2007, Copyright © 2007
doi:10.1534/genetics.107.074054
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






,2
* Department of Crop and Soil Science, Oregon State University, Corvallis, Oregon 97331,
Advanta Seeds UK, Norfolk, PE31 8LS, United Kingdom,
Advanta Semillas, Balcarce Research Station, Argentina,
Center for Applied Genetic Technologies, The University of Georgia, Athens, Georgia 30602, ** Department of Biochemistry and Molecular Biology, The University of Nevada, Reno, Nevada 89557 and 
Department of Plant Biology, The University of Georgia, Athens, Georgia 30602
2 Corresponding author: Center for Applied Genetic Technologies, 111 Riverbend Road, The University of Georgia, Athens, GA 30602.
E-mail: sjknapp{at}uga.edu.
| ABSTRACT |
|---|
|
|
|---|
= 0.0094) than wild populations (
= 0.0128). Mean haplotype diversity was 0.74. When extraploted across the genome (
3500 Mbp), sunflower was predicted to harbor at least 76.4 million common SNPs among modern cultivar alleles. LD decayed more slowly in inbred lines than wild populations (mean LD declined to 0.32 by 5.5 kbp in the former, the maximum physical distance surveyed), a difference attributed to domestication and breeding bottlenecks. SNP frequencies and LD decay are sufficient in modern sunflower cultivars for very high-density genetic mapping and high-resolution association mapping.
SNP abundance and LD decay are highly variable in eukaryotic genomes and affected by natural, domestication, and breeding history, mating systems, mutation, migration, genomic rearrangements, recombination, and other factors (CHAPMAN and THOMPSON 2001; HUDSON 2001; BUCKLER and THORNSBERRY 2002; STUMPF 2002; GREENWOOD et al. 2004; RAFALSKI and MORGANTE 2004). Typically, SNPs are less abundant, and LD decays more slowly in autogamous than allogamous species, domesticated than wild genotypes, and inbred than outbred genotypes (CHING et al. 2002; NORDBERG et al. 2002; NORDBERG and TAVARE 2002; FLINT-GARCIA et al. 2003; RAFALSKI and MORGANTE 2004; SHIFMAN et al. 2003; INGVARSSON 2005). For example, SNPs are significantly more frequent in maize (Zea mays L.; 1 SNP/61 bp), a predominantly allogamous species, than soybean (Glycine max L.; 1 SNP/273 bp to 1 SNP/343 bp), a predominantly autogamous species (REMINGTON et al. 2001; TENAILLON et al. 2001, 2002; CHING et al. 2002; ZHU et al. 2003; VAN et al. 2005). Moreover, LD decays more slowly (persists over much longer tracts of DNA) in soybean (>50 kbp) than maize (400–1500 bp). LD decayed more rapidly in exotic outbred germplasm than elite inbred lines in maize, a difference attributed to the effects inbreeding and selection (CHING et al. 2002; RAFALSKI and MORGANTE 2004). The persistence of LD decreases the density of DNA marker loci needed for identifying phenotypic–genotypic associations, but decreases resolution (CARDON and BELL 2001; CARDON and ABECASIS 2003; RAFALSKI and MORGANTE 2004).
Sunflower (Helianthus annuus L.), a predominantly allogamous species, should display patterns of nucleotide diversity and LD similar to maize and other allogamous species. Genetic diversity in modern sunflower cultivars (elite oilseed inbred lines and hybrids) has been shaped by domestication and breeding, as well as the introgression of alleles from wild and exotic germplasm (migration) (CHERES and KNAPP 1998; TANG and KNAPP 2003; HARTER et al. 2004; BURKE et al. 2005). Domestication and breeding create population bottlenecks, decrease genetic diversity, and increase LD, whereas migration increases genetic diversity and decreases LD (CHING et al. 2002; RAFALSKI and MORGANTE 2004). The abundance and distribution of SNPs in elite oilseed inbred line alleles has only been reported for a few genic loci in sunflower (KOLKMAN et al. 2004; HASS et al. 2006; SCHUPPERT et al. 2006; TANG et al. 2006b), and LD has only been surveyed in Native American land races and other exotic cultivars and wild populations (LIU and BURKE 2006). KOLKMAN et al. (2004) found significant differences in SNP frequencies among acetohydroxyacid synthase alleles resequenced from inbred lines and wild populations, a pattern predicted from analyses of SSR diversity (TANG and KNAPP 2003). LIU and BURKE (2006) surveyed nucleotide diversity and LD in nine genic loci in wild populations and exotic germplasm accessions (Native American land races and prehybrid era open-pollinated confectionery and oilseed cultivars); only one elite inbred line allele (HA89) was resequenced. SNPs were twofold more abundant in wild populations (1 SNP/19.9 bp) than exotic germplasm accessions (1 SNP/38.8 bp), exotic alleles harbored half of the nucleotide diversity found in wild alleles, and LD decayed within
200 bp in wild alleles and
1100 bp in exotic alleles. Here, we report SNP frequencies, nucleotide diversity, and LD in elite sunflower inbred lines alleles resequenced from 82 previously mapped restriction fragment length polymorphism (RFLP) marker loci distributed throughout the sunflower genome (2n = 2x = 34) (BERRY et al. 1995; GEDIL et al. 2001; YU et al. 2002, 2003).
| MATERIALS AND METHODS |
|---|
|
|
|---|
e–15 (ALTSCHUL et al. 1990; ALTSCHUL and GISH 1996; MCGINNIS and MADDEN 2004). The probe insert sequences were used as templates for designing resequencing primers using Primer3 (http://frodo.wi.mit.edu) and manual selection. Forward and reverse primer sites were chosen as close as possible to opposite ends of the reference allele sequences so as to amplify the longest DNA fragments possible from each locus (supplemental Table 1). Genomic DNA fragments were amplified using long-distance PCR (LD-PCR) (BARNES 1994) in most cases and PCR in a few cases. PCRs and LD-PCRs were performed by adding 30–60 ng of genomic DNA to a 20-µl PCR mix containing 1x buffer, 2 mM MgSO4, 0.3 mM dNTPs, 0.3 µM of forward and reverse primers, 0.5 U of Platinum Taq DNA Polymerase High Fidelity (Invitrogen, Carlsbad, CA), and dH2O to a final volume of 20 µl. For LD-PCR, genomic DNAs were amplified using one cycle at 94° for 4 min, followed by 10 cycles at 94° for 10 sec, 58° for 1 min, and 68° for up to 12 min (1 min per kb), 25 cycles at 94° for 10 sec, 58° for 1 min, and 68° for up to 12 min plus 10 sec per cycle, and one cycle at 72° for 20 min; annealing temperatures ranged from 55° to 62°, and extension times ranged from 2 to 12 min. Genomic DNA amplicons were cloned using the Invitrogen TOPO TA-cloning method. We selected and single-pass sequenced a single clone for each genotype by amplicon combination from one or both ends at the University of Nevada, Reno Genomics Center on an Applied Biosystems Prism 3730 DNA Sequencer (Foster City, CA). By sequencing a single cloned amplicon, we acquired a single phase known allele from each genotype.
|
|
and
) were estimated for synonymous, nonsynonymous, and silent (synonymous and noncoding) sites, where
is the mean number of nucleotide differences per site between two allele sequences (NEI 1987), and
is the mean number of segregating sites (WATTERSON 1975; HALUSHKA et al. 1999). Haplotype diversity was estimated as described by NEI (1987). LD analyses were performed on RSAs >1 kbp in length harboring at least 10 polymorphic sites. The physical distances separating pairs of polymorphic sites between independent RSAs amplified from opposite ends of a locus were estimated from DNA fragment length estimates (supplemental Table 1 at http://www.genetics.org/supplemental/). Using DnaSP, we estimated the minimum number of recombination events (RM) in inbred line alleles using the four-gamete test (HUDSON and KAPLAN 1985), proportion of adjacent polymorphisms in perfect disequilibrium (B) (WALL 1999), and strength of LD between pairs of polymorphic sites (estimated as the squared allele frequency correlation, r2) (WEIR 1996). The decay of LD against physical distance was modeled using nonlinear regression methods described by REMINGTON et al. (2001). Briefly, SAS PROC NLIN (Cary, NC) was used to fit r2 estimates (pooled across loci) to a model of the expected level of r2 at drift-recombination equilibrium, allowing for a low level of mutation and finite sample size (see Appendix 2 of HILL and WEIR 1988). Although factors such as the nonindependence of linked sites and nonequilibrium populations can reduce the precision of such analyses and introduce bias, they are still useful for investigating the overall rate of decay of LD (see INGVARSSON 2005).
| RESULTS |
|---|
|
|
|---|
The 82 RFLP markers were known to be low-copy and polymorphic among elite inbred lines (BERRY et al. 1994, 1995). The primer pairs selected for allele resequencing produced amplicons ranging in length from 97 to
10,000 bp across loci (Figure 1; supplemental Table 1 at http://www.genetics.org/supplemental/). Of the 82 primer pairs, 77 produced paralog-specific amplicons and 31 of the 77 spanned introns, INDELs, or both (amplicon lengths are shown in supplemental Table 1). By sequencing cloned amplicons, a single phase-known allele was resequenced from each genotype (Table 2). Collectively, 1312 RSAs and 129 DNA sequence alignments were produced for 81 of the 82 loci; allele sequences could not be produced for the ZVG46 locus (GenBank accession nos. EF469941–EF462190; allele sequence alignments are displayed in supplemental Figure 1). Nucleotide polymorphisms were surveyed in 84 to 100 DNA sequences/genotype and 49.4 kbp of DNA sequence/genotype (Table 3). Nucleotide diversity analyses were performed on 107 DNA sequence alignments comprised of 6 to 10 inbred line allele sequences each from 71 of the 81 resequenced loci. The other 22 DNA sequence alignments were either comprised of 6 or fewer inbred line allele sequences, paralogous RSAs, or both specific and nonspecific RSAs.
|
|
The mean number of segregating sites was
= 0.0094, and the mean number of pairwise sequence differences was
= 0.0107 among RSAs (Table 3). Nucleotide diversity was twofold greater in noncoding than coding sequences, sixfold greater for SNPs (
= 0.0092) than INDELs (
= 0.0016), and greatest in introns (
= 0.01480). Nonsynonymous substitutions (
nonsyn = 0.0028) were sixfold less prevalent than synonymous substitutions (
syn = 0.0176), suggesting variability among loci has primarily been produced by purifying selection (Figure 2; Table 3).
nonsyn ranged from 0.0 to 0.055, and
syn ranged from 0.0 to 0.109 among RSAs (nucleotide diversity statistics for individual RSAs are shown in supplemental Table 2 at http://www.genetics.org/supplemental/). Only two RSAs had
nonsyn/
syn ratios >1.0 (
nonsyn/
syn = 1.12 for ZVG47 and 1.10 for ZVG80-R) (supplemental Table 2).
|
silent ranged from 0.0008 to 0.109 among RSAs, a 136-fold difference (Figure 3; supplemental Table 2 at http://www.genetics.org/supplemental/). RSAs on two linkage groups (6 and 15), as a whole, had significantly fewer silent substitutions than RSAs on the other 15 linkage groups.
silent ranged from 0.0029 for ZVG28-F to 0.0113 for ZVG27 on linkage group (LG) 6 and from 0.0022 for ZVG69-R to 0.0147 for ZVG70-R on LG 15.
|
0.125) were not counted so as to minimize false positives (sequencing errors) and avoid upwardly biasing SNP frequencies and downwardly biased SNP heterozygosities.
|
|
|
| DISCUSSION |
|---|
|
|
|---|
= 0.0094 in elite inbred lines vs.
= 0.0128 in wild progenitors (Table 3) (LIU and BURKE 2006). Surprisingly, nucleotide diversity was estimated to be 1.7-fold greater in elite inbred lines than primitive and early open-pollinated (OP) cultivars (
= 0.0056) (Tables 2 and 3) (LIU and BURKE 2006). While the latter estimate was based on data from a smaller number of genes, this finding suggests that the land races and early OP cultivars supplied only a fraction of the genetic diversity found in elite inbred lines. The germplasm underlying modern oilseed sunflower cultivars was not founded by direct selection in primitive and early OP cultivars alone, but through breeding in elite and exotic germplasm (CHERES and KNAPP 1998). Although the early history of sunflower breeding is incomplete, our data support the notion that genetic diversity in modern cultivars has been supplemented by the introgression of wild and exotic alleles. Because the sunflower domestication syndrome is complex, the number of loci under selection in wide hybrids in contemporary oilseed sunflower breeding programs is predicted to be large; at least 14 of the 17 chromosomes are known to harbor phenotypic and quantitative trait loci for domestication and confectionery traits and should be under strong selection in oilseed sunflower breeding programs (BURKE et al. 2002, 2005; GANDHI et al. 2005; TANG et al. 2006a). The introgression of wild alleles into modern oilseed sunflower inbred lines has produced a patchwork of elite and wild alleles. Unique haplotypes were found in one or more inbred lines for several of the loci sampled (Figure 5; supplemental Figure 1 at http://www.genetics.org/supplemental/). As noted earlier, sampling may partly underlie differences between the present analysis and the work of LIU and BURKE (2006). Here, we resequenced alleles from a sample of 81 previously mapped genic loci and performed analyses on 107 fragments amplified from 71 loci (BERRY et al. 1995; GEDIL et al. 2001), whereas LIU and BURKE (2006) resequenced alleles from a random sample of nine genic loci. Moreover, because RFLP markers for the former were known to be polymorphic among oilseed inbred lines (BERRY et al. 1994, 1995), the resequenced loci could be more polymorphic, as a whole, than a random sample of loci. As a point of comparison, SSRs revealed greater diversity in land races than oilseed inbred lines (TANG and KNAPP 2003; HARTER et al. 2004).
Nucleotide diversity in autogamous and allogamous plant species:
Nucleotide diversity in sunflower is slightly lower than maize (REMINGTON et al. 2001; TENAILLON et al. 2001, 2002; CHING et al. 2002; LIU and BURKE 2006; BUCKLER et al. 2006), two- to fivefold greater than other domesticated grasses (BUCKLER et al. 2001), eight- to 10-fold greater than soybean (ZHU et al. 2003; VAN et al. 2005), and several-fold greater than other autogamous plant species (KANAZIN et al. 2002; GARRIS et al. 2003; HAMBLIN et al. 2004). Observed SNP frequencies seem to be comparable in sunflower and maize inbred lines. SNP frequencies were 1/32 bp in noncoding and 1/63 bp in coding sequences in sunflower inbred lines (Table 3) and 1/31 bp in noncoding and 1/124 bp in coding sequences in maize inbred lines (CHING et al. 2002). SNP frequencies, however, are sensitive to the number of genotypes sampled (larger samples have a greater likelihood of capturing rare SNPs), and the studies referenced above differed widely in terms of sampling strategies. However, because
is roughly proportional to heterozygosity, the expected number of nucleotide differences between a randomly selected pair of alleles can be estimated. For sunflower, a randomly selected pair of elite alleles is expected to differ at 1 out of every 106 nucleotides (i.e., 1/0.0094 = 106.4), whereas corn is expected to differ at 1 out of every 105 nucleotides (TENAILLON et al. 2001), and soybean is expected to differ at 1 out of every 1,030 nucleotides (ZHU et al. 2003). Hence, SNP frequencies seem to be sufficient in the modern sunflower cultivars for the development of SNP genotyping assays for most loci and for very high density genetic mapping using highly parallel SNP genotyping methods (BOREVITZ et al. 2003; HAZEN and KAY 2003; WINZELER et al. 2003; WERNER et al. 2005; GUNDERSON et al. 2006; SYVANEN 2001, 2005).
SNP abundance in sunflower and other plant genomes:
The genic loci we sampled supply an estimate of the number of common SNPs in the sunflower genome. With 3500 Mbp of DNA in the nuclear genome (BAACK et al. 2005) and 1078 SNPs in the 49.4 kbp sample of DNA surveyed in the present study (Table 3), modern sunflower cultivars are predicted to harbor at least 76.4 million SNPs (3,500,000,000 bp/49,400 bp x 1078 SNPs). When translated into genetic distance, modern cultivars are predicted to harbor at least 54,571 SNPs/cM, assuming 1400 cM in the sunflower genome (TANG et al. 2002; YU et al. 2002, 2003). These estimates assume the loci sampled are typical of DNA as a whole in sunflower and do not account for rare SNPs below the threshold of detection in our study (fr < 0.125). If the inbred lines we selected under represent allelic diversity in modern cultivars, and the protein coding loci we selected are less polymorphic than noncoding DNA in sunflower, the number of common SNPs will be >76.4 million. Conversely, if the loci selected for resequencing are more polymorphic than the balance of the genome, 76.4 million may overestimate the number of common SNPs in modern cultivars. Cultivated soybean, which is significantly less polymorphic than cultivated sunflower, is predicted to harbor 4–5 million SNPs in 1115 Mbp of DNA (ZHU et al. 2003; YOON et al. 2007), whereas maize inbred lines, with 114 SNPs in 6935 bp of DNA (CHING et al. 2002), is predicted to harbor 41 million SNPs in 2500 Mbp of DNA. The number of SNPs in sunflower, per Mbp of DNA (21,800/Mbp), is estimated to be five- to sixfold greater than soybean (3587–4484/Mbp) and 1.3-fold greater than maize (16,400/Mbp). Hence, the predicted number of SNPs in cultivated and wild sunflower is on par with the most polymorphic plant species surveyed so far (BUCKLER et al. 2001; REMINGTON et al. 2001; TENAILLON et al. 2001, 2002; CHING et al. 2002; KANAZIN et al. 2002; GARRIS et al. 2003; HAMBLIN et al. 2004; ZHU et al. 2003; BUCKLER et al. 2006; LIU and BURKE 2006; VAN et al. 2005).
Nucleotide and haplotype diversity within and between heterotic groups:
Two wild alleles (ANN1238 and ANN1811) were resequenced to supply a benchmark for assessing differences in haplotype structure, SNP frequencies, and nucleotide and haplotype diversities between elite and wild sunflower alleles. Similar to maize (CHING et al. 2002), we identified a small number of distinct haplotypes (one to nine) among inbred line alleles, where intralocus SNPs comprising haplotypes were in LD (supplemental Figure 1 at http://www.genetics.org/supplemental/). Selection for seed yield and hybrid seed production traits has created broad female (B) and male (R) heterotic groups in sunflower (BERRY et al. 1994; GENTZBITTEL et al. 1994; HONGTRAKUL et al. 1997; CHERES and KNAPP 1998). Significant genetic diversity has apparently been preserved in a small number of highly divergent B- and R-line haplotypes in sunflower, where haplotype divergence is greater between than within heterotic groups (supplemental Figure 1). While heterotic groups seem to be much less sharply differentiated in sunflower than maize, patterns of genetic diversity and haplotype divergence seem to be similar within and between heterotic groups in both species (TENAILLON et al. 2001, 2002; YU et al. 2002, 2003; LIU et al. 2003; REIF et al. 2003; JUNG et al. 2004; CHING et al. 2002). By contrast, haplotypes seem to be unstructured in the wild progenitor of maize (WHITE and DOEBLEY 1999; LIU et al. 2003) and wild sunflower (SLABAUGH et al. 2003; TANG and KNAPP 2003; KOLKMAN et al. 2004; LIU and BURKE 2006). While we only sampled two wild alleles/locus, wild haplotypes for two-thirds of the loci were unique (Figure 5; supplemental Figure 1).
Heterozygosity and haplotype diversity:
SNPs and other biallelic DNA markers are, as a whole, less informative than mulitallelic RFLP and SSR markers; however, when multiple SNPs in haplotype blocks are genotyped, the informativeness of SNP haplotypes should be comparable to RFLP and SSR markers (CHING et al. 2002). The inbred lines selected for allele resequencing (Table 2) were predicted from pedigree and RFLP, AFLP, and SSR marker diversity analyses to broadly sample genetic diversity, capture a significant percentage of the nucleotide diversity in elite inbred lines, and to be minimally redundant (BERRY et al. 1994; GENTZBITTEL et al. 1994; CHERES and KNAPP 1998; GEDIL et al. 2001; YU et al. 2002, 2003; TANG and KNAPP 2003). SNP heterozygosities and haplotype diversities were therefore expected to be greater among the resequenced inbred line alleles than among a random sample of inbred line alleles (Figures 3 and 4; supplemental Figure 1 at http://www.genetics.org/supplemental/). The probability of observing RFLP or SSR polymorphisms between two inbred lines (hp) has been in the 0.32–0.53 range in several inbred line surveys in sunflower (BERRY et al. 1994; GENTZBITTEL et al. 1994; YU et al. 2002a,b; TANG and KNAPP 2003). The probability of observing different SNP haplotypes (ps) between two inbred lines was 0.57 in the present study and thus slightly greater than hp for RFLP and SSR markers (supplemental Figure 1). The difference could be an artifact of sampling differences; we selected inbred lines to minimize redundancy and maximize uniqueness, whereas several inbred lines within heterotic groups were sampled in previous RFLP and SSR diversity surveys, thereby increasing redundancy and decreasing heterozygosity. With deeper sampling, haplotype diversity should decrease, whereas the number of haplotypes should not substantially increase (CHING et al. 2002; ZHU et al. 2003; VAN et al. 2005); deeper sampling is predicted to identify less common alleles introgressed into elite inbred lines from exotic germplasm sources.
LD:
The rate of decay of LD affects the resolution of association mapping analyses and the density of DNA markers needed for identifying phenotype–genotype associations (JORDE 1995, 2000; BUCKLER and THORNSBERRY 2002; NORDBORG et al. 2002; CHING et al. 2002; RAFALSKI and MORGANTE 2004; BUCKLER et al. 2006). The rapid decay of LD in wild sunflower and maize alleles (CHING et al. 2002; LIU and BURKE 2006) facilitates very high-resolution association mapping; however, concomitantly high DNA marker densities are needed for discovering associations (RISCH 2000; CARDON and BELL 2001; JOHNSON et al. 2001; STUMPF 2002; GREENWOOD et al. 2004; WEIGEL and NORDBORG 2005; KIM et al. 2006). Lower DNA marker densities are needed for association mapping in species where LD persists over greater physical distances, although resolution decreases (CARDON and ABECASIS 2003). Our results indicate that LD persists over longer tracts of DNA in inbred lines than primitive and early open-pollinated cultivars and wild populations in sunflower (LIU and BURKE 2006). LD decayed to r2 = 0.1 by 200 bp in wild populations and 1100 bp in OP cultivars (LIU and BURKE 2006), but only decayed to 0.32 by 5500 bp in inbred lines in our study, the longest physical distance surveyed (Figure 6); analyses of longer tracts of DNA are needed to more thoroughly assess LD decay in inbred lines. While there was significant LD variability among loci, the slower decay in sunflower inbred lines can be attributed to population bottlenecks produced by inbreeding and artificial selection, a common phenomenon in domesticated species where intense selection has been practiced for many generations (BUCKLER et al. 2001; CHING et al. 2002; DOEBLEY et al. 2006). Whether analyses are done in domesticated or wild germplasm, very high DNA marker densities are needed for association mapping in sunflower, a species with ample diversity to support such analyses.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
1 Present address: Department of Plant Pathology, Cornell University, Ithaca, NY 14853. ![]()
| LITERATURE CITED |
|---|
|
|
|---|
ALTSCHUL, S. F., and W. GISH, 1996 Local alignment statistics. Meth. Enzymol. 266: 460–480.[Medline]
ALTSCHUL, S. F., W. GISH, W. MILLER, E. W. MYERS and D. J. LIPMAN, 1990 Basic local alignment search tool. J. Mol. Biol. 215: 403–410.[CrossRef][Medline]
AQUADRO C. F., V. BAUER DUMONT and F. A. REED, 2001 Genome-wide variation in the human and fruitfly: a comparison. Curr. Opin. Genet. Dev. 11: 627–634.[CrossRef][Medline]
BAACK, E. J., K. D. WHITNEY and L. H. RIESEBERG, 2005 Hybridization and genome size evolution: timing and magnitude of nuclear DNA content increases in Helianthus homoploid hybrid species. New Phytol. 167: 623–630.[CrossRef][Medline]
BARNES, W. M., 1994 PCR amplification of up to 35-kb DNA with high fidelity and high yield from bacteriophage templates. Proc. Natl. Acad. Sci. USA 91: 2216–2220.
BERRY, S. T., R. J. ALLEN, S. R. BARNES and P. D. S. CALIGARI, 1994 Molecular marker analysis of Helianthus annuus L. 1. Restriction fragment length polymorphisms between inbred lines of cultivated sunflower. Theor. Appl. Genet. 89: 435–441.
BERRY, S. T., A. J. LEON, C. C. HANFREY, P. CHALLIS, A. BURKHOLZ et al., 1995 Molecular marker analysis of Helianthus annuus L. 2. Contruction of an RFLP linkage map for cultivated sunflower. Theor. Appl. Genet. 91: 195–199.
BHATTRAMAKKI, D., M. DOLAN, M. HANAFEY, R. WINELAND, D. VASKE et al., 2002 Insertion-deletion polymorphisms in 3' regions of maize genes occur frequently and can be used as highly informative genetic markers. Plant Mol. Biol. 48: 539–547.[CrossRef][Medline]
BOREVITZ, J. O., D. LIANG, D. PLOUFFE, H. S. CHANG, T. ZHU et al., 2003 Large-scale identification of single-feature polymorphisms in complex genomes. Genome Res. 13: 513–523.
BUCKLER, E. S., and J. M. THORNSBERRY, 2002 Plant molecular diversity and applications to genomics. Curr. Opin. Plant Biol. 5: 107–111.[CrossRef][Medline]
BUCKLER, E. S., J. M. THORNSBERRY and S. KRESOVICH, 2001 Molecular diversity, structure and domestication of grasses. Genet. Res. 77: 213–218.[CrossRef][Medline]
BUCKLER, E. S., B. S. GAUT and M. D. MCMULLEN, 2006 Molecular and functional diversity of maize. Curr. Opin. Plant. Biol. 9: 172–176.[CrossRef][Medline]
BURKE, J. H., S. J. KNAPP and L. H. RIESEBERG, 2005 Genetic consequences of selection during the evolution of cultivated sunflower. Genetics 171: 1933–1940.
BURKE, J. M., S. TANG, S. J. KNAPP and L. H. RIESEBERG, 2002 Genetic analysis of sunflower domestication. Genetics 161: 1257–1267.
CARDON, L. R., and G. R. ABECASIS, 2003 Using haplotype blocks to map human complex loci. Trends Genet. 19: 135–140.[CrossRef][Medline]
CARDON, L. R., and J. I. BELL, 2001 Association study designs for complex diseases. Nat. Rev. Genet. 2: 91–99.[CrossRef][Medline]
CHAPMAN, N. H., and E. A. THOMPSON, 2001 Linkage disequilibrium mapping: the role of population history, size, and structure. Adv. Genet. 42: 413–437.[Medline]
CHERES, M., and S. J. KNAPP, 1998 Ancestral origins and genetic diversity of cultivated sunflower: coancestry analysis of public germplasm sources. Crop Sci. 38: 1476–1482.
CHING, A., K. S. CALDWELL, M. JUNG, M. DOLAN, O. S. SMITH et al., 2002 SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines. BMC Genet. 3: 19.[CrossRef][Medline]
COLLINS, F. S., L. D. BROOKS and A. CHAKRAVARTI, 1998 A DNA polymorphism discovery resource for research on human genetic variation. Genome Res. 8: 1229–1231.
DOEBLEY, J. F., B. S. GAUT and B. D. SMITH, 2006 The molecular genetics of crop domestication. Cell 127: 1309–1321.[CrossRef][Medline]
EWING, B., and P. GREEN, 1998 Basecalling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8: 186–194.
EWING, B., L. HILLIER, M. WENDL and P. GREEN, 1998 Basecalling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8: 175–185.
FLINT-GARCIA, S. A., J. M. THORNSBERRY and E. S. BUCKER, 2003 Structure of linkage disequilibrium in plants. Annu. Rev. Plant. Biol. 54: 357–374.[CrossRef][Medline]
GANDHI, S. D., A. F. HEESACKER, C. A. FREEMAN, J. ARGYRIS, K. BRADFORD et al., 2005 The self-incompatibility locus (S) and quantitative trait loci for self-pollination and seed dormancy in sunflower. Theor. Appl. Genet. 111: 619–629.[CrossRef][Medline]
GARRIS, A. J., S. R. MCCOUCH and S. KRESOVICH, 2003 Population structure and its effect on haplotype diversity and linkage disequilibrium surrounding the xa5 locus of rice (Oryza sativa L.). Genetics 165: 759–769.
GEDIL, M. A., C. WYE, S. BERRY, B. SEGERS, J. PELEMAN et al., 2001 An integrated restriction fragment length polymorphism–amplified fragment length polymorphism linkage map for cultivated sunflower. Genome 44: 213–221.[Medline]
GENTZBITTEL, L., Y. X. ZHANG, F. VEAR, B. GRIVEAU and P. NICOLAS, 1994 RFLP studies of genetic relationships among inbred lines of the cultivated sunflower, Helianthus annuus L.: evidence fore distinct restorer and maintainer germplasm pools. Theor. Appl. Genet. 92: 419–425.
GREENWOOD, T. A., B. K. RANA and N. J. SCHORK, 2004 Human haplotype block sizes are negatively correlated with recombination rates. Genome Res. 14: 1358–1361.
GUNDERSON, K. L., F. J. STEEMERS, H. REN, P. NG, L. ZHOU et al., 2006 Whole-genome genotyping. Methods Enzymol. 410: 359–376.[Medline]
HALUSHKA, M. K., J. B. FAN, K. BENTLEY, L. HSIE, N. SHEN et al., 1999 Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis. Nat. Genet. 22: 239–247.[CrossRef][Medline]
HAMBLIN, M. T., S. E. MITCHELL, G. M. WHITE, J. GALLEGO, R. KUKATLA et al., 2004 Comparative population genetics of the panicoid grasses: sequence polymorphism, linkage disequilibrium and selection in a diverse sample of Sorghum bicolor. Genetics 167: 471–483.
HARTER, A. V., K. A. GARDNER, D. FALUSH, D. L. LENTZ, R. A. BYE et al., 2004 Origin of extant domesticated sunflowers in eastern North America. Nature 430: 201–205.[CrossRef][Medline]
HASS, C., S. TANG, S. LEONARD, J. F. MILLER, M. TRABER et al., 2006 Three non-allelic epistatically interacting methyltransferase mutations produce novel tocopherol (vitamin E) profiles in sunflower. Theor. Appl. Genet. 113: 767–782.[CrossRef][Medline]
HAZEN, S. P., and S. A. KAY, 2003 Gene arrays are not just for measuring gene expression. Trends Plant Sci. 8: 413–416.[CrossRef][Medline]
HILL, W. G., and B. S. WEIR, 1988 Variances and covariances of squared linkage disequilibria in finite populations. Theor. Popul. Biol. 33: 54–78.[CrossRef][Medline]
HONGTRAKUL, V., G. HUESTIS and S. J. KNAPP, 1997 Amplified fragment length polymorphisms as a tool for DNA fingerprinting sunflower germplasm: genetic diversity among oilseed inbred lines. Theor. Appl. Genet. 95: 400–407.[CrossRef]
HUDSON, R. R., 2001 Linkage disequilibrium and recombination, pp. 309–324 in Handbook of Statistical Genetics, edited by D. J. BALDING, M. BISHOP and C. CANNINGS. John Wiley and Sons, Chichester, UK.
HUDSON, R. R., and N. L. KAPLAN, 1985 Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111: 147–164.
INGVARSSON, P. K., 2005 Nucleotide polymorphism and linkage disequilibrium within and among natural populations of European aspen (Populus tremula L., Salicaceae). Genetics 169: 945–953.
JOHNSON, G. C., L. ESPOSITO, B. J. BARRATT, A. N. SMITH, J. HEWARD et al., 2001 Haplotype tagging for the identification of common disease genes. Nat. Genet. 29: 233–237.[CrossRef][Medline]
JORDE, L. B., 1995 Linkage diseqilibrium as a gene-mapping tool. Am. J. Hum. Genet. 56: 11–14.[Medline]
JORDE, L. B., 2000 Linkage disequilibrium and the search for complex disease genes. Genome Res. 10: 1435–1444.
JUNG, M., A. CHING, D. BHATTRAMAKKI, M. DOLAN, S. TINGEY et al., 2004 Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm. Theor. Appl. Genet. 109: 681–689.[CrossRef][Medline]
KANAZIN, V., H. TALBERT, D. SEE, P. DECAMP, E. NEVO et al., 2002 Discovery and assay of single-nucleotide polymorphisms in barley (Hordeum vulgare). Plant. Mol. Biol. 48: 529–537.[CrossRef][Medline]
KIM, S., K. ZHAO, R. JIANG, J. MOLITOR, J. O. BOREVITZ et al., 2006 Association mapping with single-feature polymorphisms. Genetics 173: 1125–1133.
KOLKMAN, J. M., M. B. SLABAUGH, J. M. BRUNIARD, S. T. BERRY, S. B. BUSHMAN et al., 2004 Acetohydroxyacid synthase mutations conferring resistance to imidazolinone or sulfonylurea herbicides in wild sunflower biotypes. Theor. Appl. Genet. 109: 1147–1159.[CrossRef][Medline]
LINDBLAD-TOH K., E. WINCHESTER, M. DALY, D. WANG, J. N. HIRSCHHORN et al., 2000 Large-scale discovery and genotyping of single-nucleotide polymorphisms in the mouse. Nat. Genet. 24: 381–386.[CrossRef][Medline]
LIU, A., and J. M. BURKE, 2006 Patterns of nucleotide diversity in wild and cultivated sunflower. Genetics 173: 321–330.
LIU, K., M. GOODMAN, S. MUSE, J. S. SMITH, E. BUCKLER et al., 2003 Genetic structure and diversity among maize inbred lines as inferred from DNA microsatellites. Genetics 165: 2117–2128.
MCGINNIS, S., and T. L. MADDEN, 2004 BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res. 32: W20–W25.
MURRAY, M. G., and W. R. THOMPSON, 1980 Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 8: 4321–4325.
NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, New York.
NORDBORG, M., and S. TAVARE, 2002 Linkage disequilibrium: what history has to tell us. Trends Genet. 18: 83–90.[CrossRef][Medline]
NORDBORG, M., J. O. BOREVITZ, J. BERGELSON, C. C. BERRY, J. CHORY et al., 2002 The extent of linkage disequilibrium in Arabidopsis thaliana. Nature Genet. 30: 190–193.[CrossRef][Medline]
RAFALSKI, A, 2002a Applications of single nucleotide polymorphisms in crop genetics. Curr. Opinion Plant Biol. 5: 94–100.[CrossRef][Medline]
RAFALSKI, A., 2002b Novel genetic mapping tools in plants: SNPs and LD-based approaches. Plant Sci. 162: 329–333.
RAFALSKI, A., and M. MORGANTE, 2004 Corn and humans: recombination and linkage disequilibrium in two genomes of similar size. Trends Genet. 20: 103–111.[CrossRef][Medline]
REIF, J. C., A. E. MELCHINGER, X. C. XIA, M. L. WARBURTON, D. A. HOISINGTON et al., 2003 Use of SSRs for establishing heterotic groups in subtropical maize. Theor. Appl. Genet. 107: 947–957.[CrossRef][Medline]
REMINGTON, D. L., J. M. THORNSBERRY, Y. MATSUOKA, L. M. WILSON, S. R. WHITT et al., 2001 Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc. Natl. Acad. Sci. USA 98: 11479–11484.
RISCH, N. J., 2000 Searching for genetic determinants for the new millenium. Nature 405: 847–856.[CrossRef][Medline]
ROZAS, J., and R. ROZAS, 1999 DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15: 174–175.
ROZAS, J., J. C. SÁNCHEZ-DELBARRIO, X. MESSEGYER and R. ROZAS, 2003 DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19: 2496–2497.
SCHUPPERT, G. F., S. TANG, M. B. SLABAUGH and S. J. KNAPP, 2006 The sunflower high-oleic mutant Ol carries variable tandem repeats of FAD2–1, a seed-specific oleoyl-phosphatidyl choline desaturase. Mol. Breeding 17: 241–256.[CrossRef]
SHIFMAN, S., J. KUYPERS, M. KOKORIS, B. YAKIR and A. DARVASI, 2003 Linkage disequilibrium patterns of the human genome across populations. Hum. Mol. Genet. 12: 771–776.
SLABAUGH, M. B., J. K. YU, S. TANG, A. HEESACKER, X. HU et al., 2003 Haplotyping and mapping a large cluster of downy mildew resistance gene candidates in sunflower using multilocus intron fragment length polymorphisms. Plant Biotechnol. J. 1: 167–185.[CrossRef][Medline]
STUMPF, M. P., 2002 Haplotype diversity and the block structure of linkage disequilibrium. Trends Genet. 18: 226–228.[CrossRef][Medline]
SYVANEN, A. C., 2001 Accessing genetic variation: genotyping single nucleotide polymorphisms. Nat. Rev. Genet. 2: 930–942.[CrossRef][Medline]
SYVANEN, A. C., 2005 Toward genome-wide SNP genotyping. Nat. Genet. 37: S5–S10.[CrossRef][Medline]
TANG, S., and S. J. KNAPP, 2003 Microsatellites uncover extraordinary diversity in native American landraces and wild populations of cultivated sunflower. Theor. Appl. Genet. 106: 990–1003.[Medline]
TANG, S., J.-K. YU, M. B. SLABAUGH, D. K. SHINTANI and S. J. KNAPP, 2002 Simple sequence repeat map of the sunflower genome. Theor. Appl. Genet. 105: 1124–1136.[CrossRef][Medline]
TANG, S., A. LEON, W. C. BRIDGES and S. J. KNAPP, 2006a Quantitative trait loci for genetically correlated seed traits are tightly linked to branching and pericarp pigment loci in sunflower. Crop Sci. 46: 721–734.
TANG, S., C. HASS and S. J. KNAPP, 2006b Ty3/gypsy-like retrotransposon knockout of a 2-methyl-6-phytyl-1,4-benzoquinone methyltransferase is non-lethal, unmasks a cryptic paralogous mutation, and produces novel tocopherol (vitamin E) profiles in sunflower. Theor. Appl. Genet. 113: 783–799.[CrossRef][Medline]
TARAMINO, G., and S. TINGEY, 1996 Simple sequence repeats for germplasm analysis and mapping in maize. Genome 39: 277–287.[Medline]
TENAILLON, M., M. C. SAWKINS, A. D. LONG, R. L. GAUT, J. F. DOEBLEY et al., 2001 Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp. mays L.). Proc. Natl. Acad. Sci. USA 98: 9161–9166.
TENAILLON, M. I., M. C. SAWKINS, L. K. ANDERSON, S. M. STACK, J. DOEBLEY et al., 2002 Patterns of diversity and recombination along chromosome 1 of maize (Zea mays ssp. mays L.). Genetics 162: 1401–1413.
VAN, K., E. Y. HWANG, M. Y. KIM, H. J. PARK, S. H. LEE et al., 2005 Discovery of SNPs in soybean genotypes frequently used as the parents of mapping populations in the United States and Korea. J. Hered. 96: 529–535.
WALL, J. D., 1999 Recombination and the power of statistical tests of neutrality. Genet. Res. 74: 65–79.[CrossRef]
WATTERSON, G. A., 1975 On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7: 256–276.[CrossRef][Medline]
WEIGEL, D., and M. NORDBORG, 2005 Natural variation in Arabidopsis. How do we find the causal genes? Plant Physiol. 138: 567–568.
WEIR, B. S., 1996 Genetic Data Analysis II. Sinauer, Sunderland, MA.
WERNER, J. D., J. O. BOREVITZ, N. H. UHLENHAUT, J. R. ECKER, J. CHORY et al., 2005 FRIGIDA-independent variation in flowering time of natural A. thaliana accessions. Genetics 170: 1197–1207.