- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Zhu, Y. L.
- Articles by Cregan, P. B.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Zhu, Y. L.
- Articles by Cregan, P. B.
Single-Nucleotide Polymorphisms in Soybean
Y. L. Zhua,d, Q. J. Songa,e, D. L. Hytena, C. P. Van Tassellb, L. K. Matukumallia,f, D. R. Grimm1,a, S. M. Hyatta, E. W. Fickusa, N. D. Youngc, and P. B. Creganaa Soybean Genomics and Improvement Laboratory, U.S. Department of Agriculture-Agricultural Research Service, Beltsville, Maryland 20705,
b Animal Improvement Programs Laboratory and Gene Evaluation and Mapping Laboratory, U.S. Department of Agriculture-Agricultural Research Service, Beltsville, Maryland 20705,
c Department of Plant Pathology, University of Minnesota, St. Paul, Minnesota 55108,
d Department of Bioscience and Biotechnology, Nanchang University, Nanchang 330047, People's Republic of China,
e Agronomy Department, Nanjing Agricultural University, Nanjing 210095, People's Republic of China
f Bioinformatics and Computational Biology, SCS, George Mason University, Fairfax, Virginia 22030
Corresponding author: P. B. Cregan, Bldg. 006, Rm. 100, USDA, ARS, BARC-West, Beltsville, MD 20705., creganp{at}ba.ars.usda.gov (E-mail)
Communicating editor: A. H. D. BROWN
| ABSTRACT |
|---|
Single-nucleotide polymorphisms (SNPs) provide an abundant source of DNA polymorphisms in a number of eukaryotic species. Information on the frequency, nature, and distribution of SNPs in plant genomes is limited. Thus, our objectives were (1) to determine SNP frequency in coding and noncoding soybean (Glycine max L. Merr.) DNA sequence amplified from genomic DNA using PCR primers designed to complete genes, cDNAs, and random genomic sequence; (2) to characterize haplotype variation in these sequences; and (3) to provide initial estimates of linkage disequilibrium (LD) in soybean. Approximately 28.7 kbp of coding sequence, 37.9 kbp of noncoding perigenic DNA, and 9.7 kbp of random noncoding genomic DNA were sequenced in each of 25 diverse soybean genotypes. Over the >76 kbp, mean nucleotide diversity expressed as Watterson's
was 0.00097. Nucleotide diversity was 0.00053 and 0.00111 in coding and in noncoding perigenic DNA, respectively, lower than estimates in the autogamous model species Arabidopsis thaliana. Haplotype analysis of SNP-containing fragments revealed a deficiency of haplotypes vs. the number that would be anticipated at linkage equilibrium. In 49 fragments with three or more SNPs, five haplotypes were present in one fragment while four or less were present in the remaining 48, thereby supporting the suggestion of relatively limited genetic variation in cultivated soybean. Squared allele-frequency correlations (r2) among haplotypes at 54 loci with two or more SNPs indicated low genome-wide LD. The low level of LD and the limited haplotype diversity suggested that the genome of any given soybean accession is a mosaic of three or four haplotypes. To facilitate SNP discovery and the development of a transcript map, subsets of four to six diverse genotypes, whose sequence analysis would permit the discovery of at least 75% of all SNPs present in the 25 genotypes as well as 90% of the common (frequency >0.10) SNPs, were identified.
SINGLE DNA base differences between homologous DNA fragments plus small insertions and deletions (indels), collectively referred to as single-nucleotide polymorphisms (SNPs), have been shown to be the most abundant source of DNA polymorphisms in humans (![]()
![]()
![]()
![]()
![]()
![]()
The frequency and nature of SNPs in plants is beginning to receive considerable attention. A number of reports in Arabidopsis thaliana (L.) Heynh. and maize (Zea mays ssp. mays L.) have provided estimates of sequence diversity in these species. In soybean (Glycine max L. Merr.), which is an autogamous species, the analysis of DNA sequence variation has been mainly confined to single genes or DNA fragments with the goal of defining gene structure, function, or evolutionary relationships. ![]()
![]()
![]()
![]()
(![]()
(![]()
= 0.00085 (![]()
= 0.015 (![]()
As a consequence of linkage disequilibrium (LD), reduced genetic variability in the form of limited haplotype diversity is a frequent result. SNPs are a useful tool to quantify LD and the analysis of SNP haplotypes has been the focus of recent studies. ![]()
0.10). Generally consistent results were reported by ![]()
![]()
In plants there are limited data relating to genome-wide haplotype diversity, but haplotype structure can be inferred from analyses of the decline of LD. In maize, ![]()
![]()
![]()
250 kbp (
1 cM). Studies of haplotype diversity in soybean are limited. ![]()
In this report we assess the frequency of SNPs and SNP haplotype diversity in 143 DNA fragments. These fragments were derived from coding and noncoding DNA associated with the coding regions, as well as random genomic DNA of soybean, based upon sequence analysis of a group of 25 selected soybean genotypes representative of the genetic base of North American soybean. The rationale for making these determinations was (1) to permit a comparison of SNP frequency and haplotype diversity with other plant and animal species, (2) to provide a preliminary estimate of linkage disequilibrium in soybean, and (3) to develop a strategy for SNP discovery aimed at the development of a SNP-based soybean linkage map.
| MATERIALS AND METHODS |
|---|
Soybean plant material and DNA isolation:
Ancestors of North American soybean cultivars:
On the basis of pedigree analysis, ![]()
![]()
|
DNA isolation:
DNA was extracted from bulked leaf tissue of 3050 plants of each of the soybean genotypes by the method described by ![]()
Selection and testing of PCR primers:
From full-length genes:
A total of 90 full-length soybean genes were selected from GenBank to represent a range of functions (Table 2). Primers were designed using OLIGO primer design software (National Biolabs, St. Paul) with the goal of amplifying fragments of
500600 bp in length containing approximately equal amounts of coding and noncoding DNA. In several instances two sets of primers were selected to genes to obtain additional sequence data.
|
From cDNAs: A total of 88 soybean cDNAs were selected from GenBank (Table 2). Sequences including the poly(A) tail were preferentially selected so primers could be designed as close to the 3'-end of the transcript as possible. Primers were designed as described above with predicted amplicon lengths of 300500 bp. The rationale for the shorter predicted amplicon length was based upon the likelihood of an intervening intron(s). Additional information relating to primer sequences, fragment lengths, numbers of bases of coding and noncoding sequences, as well as the presence of SNPs in amplicons derived using PCR primers designed to complete both genes and cDNAs can be found at http://www.genetics.org/supplemental/ as Table S2.
From sequences containing mapped simple sequence repeats:
The development of a large number of soybean simple sequence repeat (SSR) loci (![]()
From bacterial artificial chromosome subclones:
The simple SSR marker BARC-Satt309 is closely linked to the rhg1 locus, which is reported to be the most important gene conditioning resistance to the soybean cyst nematode. Primers to BARC-Satt309 were used to identify a bacterial artificial chromosome (BAC) clone, UMN-K4, as previously described by ![]()
![]()
BLAST search of SSR flanking regions and BAC subclones: Each SSR-containing sequence and each BAC subclone were analyzed using BLASTN and TBLASTX against the nonredundant and the expressed sequence tag [EST (est)] databases to determine if any portions of the BAC subclones or SSR flanking sequence were coding regions.
Initial examination of PCR primers: All PCR primers were used to amplify genomic DNA of one or two soybean genotypes. In most cases the cultivar Lincoln was used, but in some instances either Minsoy or Noir 1 DNA was used as template. Amplification reactions used standard PCR reagents including 30 ng of genomic DNA template, 1.5 mM Mg2+, 0.15 µM of 3'- and 5'-end primers, 100 µM of each nucleotide, 1x PCR buffer (10 mM Tris-HCL pH 8.3, 50 mM KCl), and 2 units of Taq DNA polymerase in a total volume of 50 µl. PCR cycling conditions were as follows: 45 sec denaturation at 92°, 45 sec annealing at 50° (or higher depending upon optimal annealing indicated by OLIGO), and 45 sec extension at 68° for 32 cycles on a MJ Tetrad thermocycler (MJ Research, Watertown, MA). The products were analyzed on a 1.5% agarose gel stained with ethidium bromide. Those primer sets that produced what appeared to be a single product were selected for further testing. Those that produced no products or multiple products were further examined using lower annealing temperatures or higher Mg2+ (those giving no products) or higher annealing temperature or lower Mg2+ (those giving multiple products). After these analyses, the amplicons from those primer sets producing what appeared to be single amplicons were selected for sequence analysis.
Sequence analysis of PCR products amplified from genomic DNA:
After the initial determination that a set of PCR primers appeared to produce a single amplicon from genomic DNA, the PCR product was directly sequenced using one of the PCR primers with BigDye Terminator cycle sequencing as described above. The results of this sequence analysis determined if the PCR product was derived from a single locus or if it was the result of amplification from two or more homeologous regions. In those cases in which the sequence traces appeared to be derived from a single locus, analysis with AutoAssembler software (Perkin-Elmer, Applied Biosystems) was used to detect ambiguous base calls that appeared as "heterozygotes" that would not be anticipated from the sequence analysis of a homozygous soybean genotype. In most instances, such "heterozygotes" would indicate the presence of two or more paralogues.
Those primer sets that produced a single amplicon suitable for sequencing were used to amplify genomic DNA from each of the remaining 24 genotypes listed in Table 1. The sequence of each of these products was determined as described above. When necessary, products were sequenced from both ends to assure accurate sequence determination.
Single-nucleotide polymorphism discovery:
The sequence data from each amplicon were analyzed with PolyBayes SNP detection software (![]()
0.99) were resequenced and reanalyzed with PolyBayes. In no case was any type of tandem repeat variant considered to be a SNP.
Statistical analyses:
Nucleotide diversity (
and
):
Nucleotide diversity was estimated as
, the number of segregating sites (![]()
) as per ![]()
, the mean pairwise differences, and its standard deviation S(
) (![]()
and
for synonymous (silent nucleotide substitutions) and nonsynonymous (replacement) sites, the numbers of synonymous and nonsynonymous sites were calculated using DNASP sequence polymorphism software version 3.5 (![]()
Tajima's D:
![]()
Gene diversity:
Gene diversity (![]()
P2ij, where Pij is the frequency of the jth allele for ith locus summed across all alleles in the locus. In the case of the SNPs reported here, there were only two alleles at a locus.
Distribution of SNPs in coding and noncoding DNA:
To determine if SNPs were evenly distributed in the fragments assayed, theoretical SNP cumulative frequency distributions were calculated for SNPs in both coding and noncoding DNA on the basis of the assumption of uniform distribution. In the case of SNPs in coding regions, the cumulative frequency distribution of coding SNPs (cSNPs) was calculated assuming a uniform distribution of SNPs in these fragments. This distribution was compared with the actual cumulative frequency distribution in these fragments using a Kolmogorov-Smirnov (KS) test (![]()
SNP haplotype frequencies in sequenced fragments:
The number of haplotypes among the 25 genotypes in fragments containing two or more SNPs was determined by visual inspection. A permutation algorithm based on ![]()
LD in introductions from the Far East: For purposes of calculating LD only genotypes reported to be direct introductions from the Far East were analyzed. This was done to eliminate genotypes that would be anticipated to have reduced LD as a result of hybridization and subsequent recombination. LD was analyzed on the following three subsets of data using haplotypes determined from fragments containing two or more SNPs:
- Subclones derived from BAC clone UMN-K4: The haplotypes of BAC subclones were used in the calculation of squared allele-frequency correlations (r2;
WEIR 1996 ) with the multiple-allele option of Tassel 0.2 (http://statgen.ncsu.edu/buckler/software/TASSEL/TASSEL.htm;
REMINGTON et al. 2001 ). The significance of r2 (P < 0.05) was determined via permutation analysis using 1000 permutations.
- Loci on soybean linkage group G: The SNP haplotypes of SSR flanking regions on soybean linkage group G (
CREGAN et al. 1999A ) were used in the calculation of squared allele-frequency correlations as described above. The r2 values were plotted against the known genetic distances between loci to examine the relationship of genetic distance and LD.
- Remaining loci with undefined genome positions (genome-wide LD): Twelve of the 65 fragments that contained two or more SNPs were located in proximity to each other on linkage group G. Because of this known linkage relationship, only one of these fragments (flanking regions of SSR locus BARC-Satt309) was included in the analysis of genome-wide LD. Squared allele-frequency correlations were calculated as described above using haplotypes of the 54 remaining loci.
| RESULTS |
|---|
Nature and frequency of SNPs in soybean:
Sequence data for 25 soybean genotypes were obtained from fragments amplified using PCR primers derived from 66 complete GenBank genes and 50 cDNAs. In addition to the genes and cDNAs, sequence data were obtained from 13 BAC subclones and 15 SSR flanking regions. The BLASTN analysis indicated that one of the BAC subclones was homologous to a G. max aspartokinase-homoserine dehydrogenase gene (GenBank accession no.
AF049708). The remaining BAC and SSR flanking sequences were tentatively classified as noncoding. In total,
28.7 kbp of coding sequence, 9.3 kbp of 5'- and 3'-UTR, 22.9 kbp of intron, and 5.8 kbp perigenic genomic sequence as well as 9.7 kbp of random noncoding genomic sequence data were obtained for each of the 25 genotypes (Table 3). A total of 280 SNPs including 233 single-base changes and 47 indels were identified in 143 amplicons totaling
76.3 kbp of sequence. The mean frequency of the least common allele at the 280 loci was 0.23 with a mean gene diversity of 0.35. The distribution of the allele frequencies of the least common allele are shown in Table 4. A total of 212 SNPs (76%) could be considered common (frequency >0.10). Of the 233 single-base changes, transitions accounted for 112 (48%) and transversions for 121 (52%). ![]()
![]()
![]()
|
|
The mean nucleotide diversity (
) in the 76.3 kbp of sequence analyzed was 0.00097 (Table 3). The estimate of nucleotide diversity in coding DNA (
= 0.00053) was less than half that in noncoding sequence associated with genes (
= 0.00111). Nucleotide diversities in the UTRs, introns, and genomic sequence adjacent to genes were similar, ranging from
= 0.000870.00126. In random noncoding genomic sequence from BAC clones and SSR flanking regions nucleotide diversity of
= 0.00179 was numerically, but nonsignificantly, higher than that of the genomic DNA associated with genes.
Tajima's D was determined across loci in the various functional regions of genes and in genomic DNA (Table 3) to provide information on population structure such as genetic bottlenecks and expanding population size that would be anticipated to affect the entire genome rather than specific genes. Tajima's D values were generally positive although none was significant. D was significant at P < 0.10 in the 9.7 kbp of random noncoding genomic sequence derived from BAC clones and SSR flanking regions.
Polymorphisms in coding regions:
In the 28.7 kbp of coding sequence analyzed, the 57 cSNPs included 51 single-base changes and six indels. Of the 51 single-base changes, 13, 8, and 30 were detected in the first, second, and third codon positions, respectively. A total of 25 were synonymous (no alteration in amino acid) while 26 were nonsynonymous or replacement SNPs that included a single-base change in the third position in the start codon of the glycinin gene (GenBank accession no.
X52863) of PI 88788 that had been reported previously by ![]()
GAC (Gly Tyr
Asp). The remaining two indels, one insertion of a codon CCA and one deletion of two codons CGACCA, were found in GenBank accession nos.
X63198 and
M13759, respectively.
The DNASP analysis indicated that of the 28.7 kbp of coding sequence 22.1 kbp (77%) were nonsynonymous. Thus, about three-quarters of randomly occurring single-base changes in coding DNA would be anticipated to result in an amino acid alteration. However, of the 57 cSNPs, 32 were nonsynonymous, which included 26 single-base changes and 6 indels, while 25 were synonymous. The nucleotide diversity of synonymous changes,
= 0.00100, while not significantly greater than that of nonsynonymous changes,
= 0.00038, was 2.6-fold greater. The higher frequency of synonymous cSNPs suggests selection against mutations that result in an amino acid replacement.
Failure to obtain sequence data from genes and cDNAs:
High-quality sequence data were obtained from 65 of the 90 complete genes to which PCR primers were designed. The failure to obtain data from the remaining 25 resulted from either the failure of primers to amplify (4 cases) or the amplification of two or more products as determined via agarose gel electrophoresis (3 cases). The failure to obtain data from the remaining 18 genes was the result of sequence analyses that indicated heterogeneous template, as would be anticipated if members of a gene family or homeologous loci were amplified. In the case of cDNAs, high-quality sequence data for all 25 genotypes were obtained for only 50 of the 88 cDNAs for which primers were designed. The failure to obtain data from the remaining 38 was the result of failure to amplify a product (5 cases), the amplification of multiple products as determined by agarose gel electrophoresis (12 cases), and poor quality sequence data from what appeared to be a single PCR product on agarose (21 cases). As with the complete genes, this latter outcome generally appeared to result from multiple sequencing templates.
Heterogeneity of nucleotide diversity among DNA fragments:
The average length of the 143 amplicons analyzed for the presence of SNPs was 534 bp with a mean of 1.95 SNPs per amplicon. There was no sequence variation in 47 of the fragments while only 1 SNP was discovered in 30 of the 143 (Table 5), suggesting an uneven distribution of sequence variation in this sample of amplicons. The Kolmogorov-Smirnov test was done to compare the observed cumulative frequency distributions of SNPs in fragments with the theoretical distributions on the basis of the assumption of mutations being evenly distributed across the 143 fragments. In both coding and noncoding sequences, the observed and theoretical frequency distributions were determined to be significantly different (P < 0.01), indicating that there was heterogeneity in the nucleotide diversity of both the coding and noncoding DNA fragments included in this study.
|
SNP haplotypes and haplotype frequency:
The number of SNP haplotypes present among the 25 soybean genotypes was determined in each of the 66 fragments that contained two or more SNPs (Table 5). Gene diversity based upon haplotypes was 0.52. In only one case were more than four haplotypes observed among the 25 genotypes. In this instance five haplotypes were found. The permutation analyses of the 49 fragments with three or more SNPs indicated that 44 of the 49 fragments had an empirical probability of the limited number of haplotypes observed of
0.001. A total of 30 of the 49 fragments never had a single permutation randomly generated with as few haplotypes present as that observed in the original data. In the five fragments where probability did not exceed 0.001, allele frequencies at one or more SNP loci were maximally asymmetric. The analysis indicated a shortage of haplotypes in relation to the number that would be anticipated at linkage equilibrium.
LD in introductions from the Far East:
Among subclones of BAC UMN-K4:
Squared allele-frequency correlations (r2) were calculated among each of the seven subclones with two or more SNPs discovered in 16 genotypes that were direct introductions from Asia (Table 1). The ancestral cultivar Illini was eliminated from the analysis because it was determined to be identical at all SNP loci to A.K. (Harrow). Both Illini and A.K. (Harrow) were selected from the older cultivar A.K. and were anticipated to be similar. The mean r2 value for the 21 estimates of LD was 0.36 and 18 of the 21 estimates of r2 indicated significant LD (P < 0.05). The positions of the subclones in the 110-kbp BAC clone UMN-K4 BAC are not known; however, a simulation analysis using 1000 permutations indicated that seven 550-bp fragments drawn at random from a 110-kb BAC would span a region of at least 53.9 kbp (P > 0.95). Thus, significant r2 values among most subclones suggest that LD exists over a distance of
50 kbp in this region of the soybean genome.
Among loci on soybean linkage group G: The SNP haplotypes of flanking regions of seven SSR loci in soybean linkage group G were used to provide an initial estimate of the relationship of LD with genetic map distance. The loci cover a map distance of 12.5 cM. The mean r2 of the 21 pairwise estimates of LD was 0.14 and only four of the r2 values were significant (P < 0.05). The trend line developed from the plot of r2 against genetic map distance is presented in Fig 1. Although these data are limited, it appears that LD has significantly decayed at distances of 2.02.5 cM, which is roughly equivalent to 1.01.5 mbp.
|
Genome-wide LD: Squared allele-frequency correlations were calculated among the haplotypes of 54 loci with two or more SNPs. The mean r2 over all pairwise estimates was 0.091 with only 8.9% of the r2 values significant at P < 0.05. This result indicated a low level of genome-wide LD in the set of 16 soybean accessions used in the analysis.
A subset of genotypes with maximum SNP diversity:
If the 25 genotypes analyzed here are representative of North American cultivated soybean, the limited number of haplotypes suggested that SNP discovery might proceed with the sequence analysis of a relatively small selected set of genotypes. The three genotypes whose analysis would detect the largest proportion (71%) of all 280 SNPs and 83% of the 212 common SNPs discovered among all 25 genotypes were Peking, PI 209332, and Tokyo (Table 6). The addition of the fourth genotype, Noir 1, brought these figures to 78 and 91%, respectively. Two sets of 8 genotypes would permit the discovery of >90% of the total SNPs and 98% of the common SNPs. A total of 89% of the total and 99% of the common SNPs were polymorphic in the 14 soybean genotypes that contributed 80.5% of the allelic diversity present in North American soybeans. The 6 genotypes Minsoy, Noir 1, Archer, Peking, Evans, and PI 209332 represent the parents of recombinant inbred line (RIL) mapping populations available in our laboratory (University of Utah Minsoy x Noir 1 and Archer x Minsoy and University of Minnesota Evans x Peking and Evans x PI 209332). The SNPs detected in these 6 genotypes included 85% of the total and 93% of the common SNPs. Totals of 83 and 89% of the total and common SNPs could be mapped in at least one of the aforementioned RIL populations, respectively.
|
| DISCUSSION |
|---|
Nucleotide diversity:
The results of this survey provide the first extensive sampling of DNA sequence diversity in cultivated soybean. The initial estimate suggests that mean nucleotide diversity is much lower in soybean (
= 0.00097) than in the wild plant A. thaliana. Numerous reports of sequence variation in individual Arabidopsis genes (![]()
![]()
![]()
![]()
![]()
= 0.0096) 10-fold greater than that in soybean. This calculation was based on >14 kbp of sequence from each of 25 inbreds and exotic landraces. The level of sequence diversity in an inbreeding species is expected to be lower than that in an outcrossing species because of smaller effective population size (![]()
![]()
![]()
= 0.0017) and LEAFY (
= 0.0033; calculated from ![]()
![]()
The ratio of synonymous to nonsynonymous changes in soybean (2.6) was somewhat lower than the ratio of 4.8 reported in maize (![]()
![]()
![]()
A notable difference between sequence diversity in soybean vs. reports in humans was the relative levels of nucleotide diversity in coding and noncoding sequences. In the reports of ![]()
![]()
![]()
![]()
![]()
![]()
![]()
Limited haplotype diversity:
The small number of haplotypes observed in our data suggests that the genetic base of cultivated soybean is built upon a small group of progenitor genotypes. This may be the result of a small number of domestication events from the wild relative G. soja. Alternatively, the limited haplotype diversity observed here may be only the result of the narrow genetic base of North American soybean germplasm or of limited variability in G. soja. A number of reports have documented the small group of progenitor genotypes that form the genetic base of North American soybean germplasm (COMMITTEE ON GENETIC VULNERABILITY OF MAJOR CROPS 1972; ![]()
Linkage disequilibrium in soybean:
Our data indicated that over relatively short distances of perhaps 50 kbp there is little decay in LD in soybean. This conclusion is based upon limited data from a set of subclones derived from one BAC clone that is 110 kbp in length. This finding is in marked contrast to reports in maize indicating that LD, as estimated by r2, decayed to values <0.10 within 1500 bp (![]()
![]()
![]()
![]()
The second estimate of LD reported here was also based upon limited data derived from the SNPs discovered in seven SSR flanking regions on soybean linkage group G. These data are quite variable as evidenced by large deviations from the trend line developed from the plot of the squared allele-frequency correlations on genetic map distance (Fig 1). Nonetheless, LD as estimated by r2 decays to <0.10 at genetic map distances >2.5 cM. Recent work by ![]()
250 kbp. These authors found a generally similar level of LD decay across the Arabidopsis genome. Reports from other species have noted wide variation in LD decay in different genome regions. For example, highly variable rates of LD decline were observed among different maize genes (![]()
![]()
![]()
Our assessment of genome-wide LD used haplotypes at 54 loci that we assumed were distributed across the soybean genome. There was no reason to suggest that randomly selected genes and cDNAs would derive from only one or a few linkage groups. This analysis, like those of localized LD, used the subset of 16 Asian soybean introductions that were not derivatives of modern breeding programs and therefore artificial hybridization and recombination had not contributed to the dissipation of genome-wide LD. Among the 66 loci with two or more SNPs there was an average of 3.1 haplotypes. The lack of genome-wide LD coupled with the limited haplotype diversity suggests that the cultivated soybean genome is a mosaic of a limited number of haplotypes that may be the result of recombination among three or four ancestral haplotypes. Some natural outcrossing does occur in soybean despite its autogamous nature. The progeny of these rare outcrosses might have one or more distinctive features that would cause them to be the target of selection. Such cycles of outcrossing and selection could result in substantial recombination over a period of >3000 years since the estimated time of the domestication of the soybean, which probably took place in China during the Shang Dynasty (ca. 17001100 BC) or earlier (![]()
![]()
![]()
A soybean transcript map:
One of the objectives of this research was to develop a strategy for SNP discovery aimed at the development of a SNP-based soybean linkage map. The large amount of soybean EST sequence data is a resource that may be useful for in silico SNP discovery as well as for the design of sequence-tagged sites (STSs) for SNP discovery via resequencing. The mapping of these SNPs would create a transcript map with candidate genes to associate with quantitative trait loci. Information on nucleotide diversity, the rate of success with which STSs can be developed from EST and genomic sequence, and SNP distribution allow an estimate of the feasibility of creating such a map. Of the 178 primer sets designed to complete genes and cDNAs, 115 yielded a sequence-tagged site from which sequence data were obtained. To a great extent, the failure to convert primer sets into STSs was the result of the amplification of multiple sites. Previous reports of genome duplication in soybean suggest the occurrence of tetraploidization as well as other duplication events (![]()
![]()
A total of 216 SNPs were discovered in the 66,634 bases of DNA analyzed in the 116 gene-derived STSs (GenBank genes + cDNAs + 1 BAC subclone; Table 3) or a rate of 3.24 SNPs/kbp in these 25 soybean genotypes. The average length of the 116 gene fragments was 574 bp and at the rate of 3.24 SNPs/kbp, one would anticipate 1.86 SNPs per fragment. Under these circumstances, a rough estimate of the probability of finding at least one SNP in a 574-bp fragment is [1 - (0.99676)574] = 0.776 and one would anticipate 90 of the 116 STSs to contain at least 1 SNP. However, at least 1 SNP was discovered in only 74 of the gene or perigenic DNA fragments analyzed. The heterogeneous distribution of SNPs was detected by the KS tests and is a phenomenon that is not unique to soybean. Similar evidence of wide differences in nucleotide diversity of genes and gene fragments has been reported in maize (![]()
![]()
![]()
![]()
![]()
A strategy for SNP discovery in soybean:
While genome duplication and heterogeneity of nucleotide diversity across fragments will negatively impact the likelihood of successfully discovering sequence variation in a particular DNA fragment, the knowledge that nucleotide diversity is more than twofold greater in noncoding perigenic DNA than in coding sequence suggests that SNP discovery should focus on these noncoding regions. The increasing availability of 3'-UTR data in soybean will be useful in this regard. Another approach to maximizing the usefulness of the large soybean EST database is an intron prediction protocol being used in SNP discovery in cattle (Bos taurus; ![]()
10 kbp of random genomic sequence, the level of sequence diversity was higher, although nonsignificantly higher than that in the noncoding DNA associated with genes. This suggests that the BAC-end sequence will be a good source of data for SNP discovery as will SSR flanking regions.
The data reported here from a diverse set of soybean genotypes indicate the feasibility of large-scale SNP discovery in soybean and also provide guidance for such an effort. Discovery needs to focus, when possible, on primer design to noncoding sequence, where nucleotide diversity is expected to be greatest. Primer testing may be expedited by heteroduplex analysis of a homozygous genotype to eliminate those primer sets that amplify multiple (and heterogeneous) amplicons. Putative locus-specific primer sets can then be used to amplify genomic DNA of a pool of diverse genotypes followed by heteroduplex analysis to identify SNP-containing fragments. Heteroduplex analysis for SNP discovery using denaturing HPLC is well established (![]()
![]()
The identification of a small set of soybean genotypes in which sequence diversity is maximized will enhance the efficiency of SNP discovery. Sequence analysis of the six genotypes Minsoy, Noir 1, Archer, Evans, Peking, and PI 209332 (Table 6) is likely to identify most sequence variants in North American soybean germplasm. Likewise, most of these variants will segregate in at least one of the RIL mapping populations available in our laboratory. These readily available populations are well characterized and have well-developed molecular genetic maps. An important and unanswered question is the utility of SNPs discovered in North American germplasm in a wider range of cultivated and wild soybean germplasm. If North American genotypes represent a unique soybean subpopulation in terms of sequence and haplotype diversity, then the strategy suggested here for SNP discovery will need to be modified. If we are to successfully mine germplasm using the power of association analysis it is important to have at least a basic understanding of the variability of the target germplasm for which these analyses are intended.
| FOOTNOTES |
|---|
1 Present address: Abbott Laboratories, Abbott Park, IL 60064. ![]()
| ACKNOWLEDGMENTS |
|---|
The authors thank Joann Mudge for her assistance in the identification of BAC clone UMN-K4. The authors thank the United Soybean Board (USB grants 9222 and 1243) for support of this research.
Manuscript received May 7, 2002; Accepted for publication December 12, 2002.
| LITERATURE CITED |
|---|
AGUADE, M., 2001 Nucleotide sequence variation at two genes of the phenylpropanoid pathway, the FAH1 and F3H genes, in Arabidopsis thaliana.. Mol. Biol. Evol. 18:1-9.
CARDON, L. R. and J. I. BELL, 2001 Association study designs for complex diseases. Nat. Rev. Genet. 2:91-99.[Medline]
CARGILL, M., D. ALTSHULER, J. IRELAND, P. SKLAR, and K. ARDLIE et al., 1999 Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat. Genet. 22:231-238.[Medline]
CHURCHILL, G. A. and R. W. DOERGE, 1994 Empirical threshold values for quantitative trait mapping. Genetics 138:963-971.[Abstract]
COLLINS, F. S., L. D. BROOKS, and A. CHARKRAVARTI, 1998 A DNA polymorphism discovery resource for research on human genetic variation. Genome Res. 8:1229-1231.
COMMITTEE ON GENETIC VULNERABILITY OF MAJOR CROPS, 1972 Genetic Vulnerability of Major Crops. National Academy of Science, Washington, DC.
COOPER, D. N., B. A. SMITH, H. J. COOKE, S. NIEMANN, and J. SCHMIDTKE, 1985 An estimate of unique DNA sequence heterozygosity in the human genome. Hum. Genet. 69:201-205.[Medline]
CORYELL, V. H., H. JESSEN, J. M. SCHUPP, D. WEBB, and P. KEIM, 1999 Allele-specific hybridization markers for soybean. Theor. Appl. Genet. 98:690-696.
CREGAN, P. B., T. JARVIK, A. L. BUSH, R. C. SHOEMAKER, and K. G. LARK et al., 1999a An integrated genetic linkage map of the soybean genome. Crop Sci. 39:1464-1490.
CREGAN, P. B., J. MUDGE, E. W. FICKUS, L. F. MAREK, and D. DANESH et al., 1999b Targeted isolation of simple sequence repeat markers through the use of bacterial artificial chromosomes. Theor. Appl. Genet. 98:919-928.
CREGAN, P. B., J. MUDGE, E. W. FICKUS, D. DANESH, and R. DENNY et al., 1999c Two simple sequence repeat markers to select for soybean cyst nematode resistance conditioned by the rhg1 locus. Theor. Appl. Genet. 99:811-818.
DELANNAY, X., D. M. RODGERS, and R. G. PALMER, 1983 Relative genetic contributions among ancestral lines to North American soybean cultivars. Crop Sci. 23:944-949.
GIBBONS, J. D., 1976 Nonparametric Methods for Quantitative Analysis, pp. 5677. Holt, Rinehart & Winston, New York.
GIZLICE, Z., T. E. CARTER, and J. W. BURTON, 1994 Genetic base for North American public soybean cultivars released between 1947 and 1988. Crop Sci. 34:1143-1151.
HALUSHKA, M. K., J. B. FAN, K. BENTLEY, L. HSIE, and N. SHEN et al., 1999 Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis. Nat. Genet. 22:239-247.[Medline]
HANFSTINGL, U., A. BERRY, E. A. KELLOGG, J. T. COSTA, and W. RUDIGER et al., 1994 Haplotypic divergence coupled with lack of diversity at the Arabidopsis thaliana alcohol dehydrogenase locus: Roles for both balancing and directional selection? Genetics 138:811-828.[Abstract]
HYMOWITZ, T., 1990 Soybeans: the success story, pp.159163 in Advances in New Crops, edited by J. JANICK and J. SIMON. Timber Press, Portland, OR.
JIN, L., P. A. UNDERHILL, P. J. OEFNER, and L. L. CAVALLI-SFORZA, 1995 Systematic search for polymorphisms in the human genome using denaturing high-performance liquid chromatography (DHPLC). Am. J. Hum. Genet. 57(Suppl.):A26.
KAWABE, A. and N. T. MIYASHITA, 1999 DNA variation in the basic chitinase locus (ChiB) region of the wild plant Arabidopsis thaliana.. Genetics 153:1445-1453.
KAWABE, A., K. YAMANE, and N. T. MIYASHITA, 2000 DNA polymorphism at the cytosolic phosphoglucose isomerase (PgiC) locus of the wild plant Arabidopsis thaliana.. Genetics 156:1339-1347.
KEIM, P., T. OLSON, and R. SHOEMAKER, 1988 A rapid protocol for isolating soybean DNA. Soybean Genet. Newsl. 15:150-152.
KRUGLYAK, L., 1997 The use of a genetic map of biallelic markers in linkage studies. Nat. Genet. 17:21-24.[Medline]
KUITTINEN, H. and M. AGUADE, 2000 Nucleotide variation at the CHALCONE ISOMERASE locus in Arabidopsis thaliana.. Genetics 155:863-872.
KWOK, P.-Y., Q. DENG, H. ZAKERI, and D. A. NICKERSON, 1996 Increasing the information content of STS-based genome maps: identifying polymorphisms in mapped STSs. Genomics 31:123-126.[Medline]
LINDBLAD-TOH, K., E. WINCHESTER, M. J. DALY, D. G. WANG, and J. N. HIRSCHHORN et al., 2000 Large-scale discovery and genotyping of single-nucleotide polymorphisms in the mouse. Nat. Genet. 24:381-385.[Medline]
MARTH, G. T., I. KORF, M. D. YANDELL, R. T. YEH, and Z. GU et al., 1999 A general approach to single-nucleotide polymorphism discovery. Nat. Genet. 23:452-456.[Medline]
MORIYAMA, E. N. and J. R. POWELL, 1996 Intraspecific nuclear DNA variation in Drosophila. Mol. Biol. Evol. 13:261-277.[Abstract]
NORDBORG, M., B. CHARLESWORTH, and D. CHARLESWORTH, 1996 Increased levels of polymorphism surrounding selectively maintained sites in highly selfing species. Proc. R. Soc. Lond. Ser. B 263:1033-1039.
NORDBORG, M., J. O. BOREVITZ, J. BERGELSON, C. C. BERRY, and J. CHORY et al., 2002 The extent of linkage disequilibrium in Arabidopsis thaliana.. Nat. Genet. 30:190-193.[Medline]
OLSEN, K. M., A. WOMACK, A. R. GARRETT, J. I. SUDDITH, and M. D. PURUGGANAN, 2002 Contrasting evolutionary forces in the Arabidopsis thaliana floral developmental pathway. Genetics 160:1641-1650.
PATIL, N., A. J. BERNO, D. A. HINDS, W. A. BARRETT, and J. M. DOSHI et al., 2001 Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21. Science 294:1669-1670.
POLLAK, E., 1987 On the theory of partially inbreeding finite populations. I. Partial selfing. Genetics 117:353-360.
PURUGGANAN, M. D. and J. I. SUDDITH, 1999 Molecular population genetics of floral homeotic loci: departures from the equilibrium-neutral model at the APETALA3 and PISTILLATA genes of Arabidopsis thaliana.. Genetics 151:839-848.
REICH, D. E., M. CARGILL, S. BOLK, J. IRELAND, and P. C. SABETI et al., 2001 Linkage disequilibrium in the human genome. Nature 411:199-204.[Medline]
REMINGTON, D. L, J. M. THORNSBERRY, Y. MATSUOKA, L. M. WILSON, and S. R. WHITT et al., 2001 Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc. Natl. Acad. Sci. USA 98:11479-11484.
ROZAS, J. and R. ROZAS, 1999 DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174-175.
SCALLON, B. J., C. D. DICKINSON, and N. C. NIELSEN, 1987 Characterization of a null-allele for the Gy4 glycinin gene from soybean. Mol. Gen. Genet. 208:107-113.
SHOEMAKER, R. C., K. POLZIN, J. LABATE, J. SPECHT, and E. C. BRUMMER et al., 1996 Genome duplication in soybean (Glycine subgenus soja). Genetics 144:329-338.[Abstract]
STEPHENS, J. C., J. A. SCHNEIDER, D. A. TANGUAY, J. CHOI, and T. ACHARYA et al., 2001 Haplotype variation and linkage disequilibrium in 313 human genes. Science 293:489-493.
STONE, R. T., W. M. GROSSE, E. CASAS, T. P. SMITH, and J. W. KEELE et al., 2002 Use of bovine EST data and human genomic sequences to map 100 gene-specific bovine markers. Mamm. Genome 13:211-215.[Medline]
TAILLON-MILLER, P., E. E. PIERNOT, and P.-Y. KWOK, 1999 Efficient approach to unique single-nucleotide polymorphism discovery. Genome Res. 9:499-505.
TAJIMA, F., 1983 Evolutionary relationship of DNA sequences in finite populations. Genetics 105:437-460.
TAJIMA, F., 1989 Statistical nethods for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595.
TENAILLON, M. I., M. C. SAWKINS, A. D. LONG, R. L. GAUT, and J. F. DOEBLEY et al., 2001 Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp. mays L.). Proc. Natl. Acad. Sci. USA 98:9161-9166.
WANG, D. G., J. B. FAN, C. J. SIAO, A. BERNO, and P. YOUNG et al., 1998 Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 280:1077-1082.
WATTERSON, G. A., 1975 On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7:256-276.[Medline]
WEIR, B. S., 1990 Genetic Data Analysis Methods for Discrete Genetic Data. Sinauer Associates, Sunderland, MA.
WEIR, B. S., 1996 Genetic Data Analysis II. Sinauer Associates, Sunderland, MA.
XUE, Z. T., M. L. XU, W. SHEN, N. L. ZHUANG, and W. M. HU et al., 1992 Characterization of a Gy4 glycinin gene from soybean Glycine max cv. Forrest. Plant Mol. Biol. 18:897-908.
ZAKHAROVA, E. S., S. M. EPISHIN, and Y. P. VINETSKI, 1989 An attempt to elucidate the origin of cultivated soybean via comparison of nucleotide sequences encoding glycinin B4 polypeptide of cultivated soybean, Glycine max, and its presumed wild progenitor, Glycine soja.. Theor. Appl. Genet. 78:852-856.
ZHU, T., L. SHI, J. J. DOYLE, and P. KEIM, 1995 A single nuclear locus phylogeny of soybean based on DNA sequence. Theor. Appl. Genet. 90:991-999.
This article has been cited by other articles:
![]() |
D. L. Hyten, J. R. Smith, R. D. Frederick, M. L. Tucker, Q. Song, and P. B. Cregan Bulked Segregant Analysis Using the GoldenGate Assay to Locate the Rpp3 Locus that Confers Resistance to Soybean Rust in Soybean Crop Sci., January 1, 2009; 49(1): 265 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Kaczorowski, K.-S. Kim, B. W. Diers, and M. E. Hudson Microarray-Based Genetic Mapping Using Soybean Near-Isogenic Lines and Generation of SNP Markers in the Rag1 Aphid-Resistance Interval The Plant Genome, November 1, 2008; 1(2): 89 - 98. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Gaitan-Solis, I.-Y. Choi, C. Quigley, P. Cregan, and J. Tohme Single Nucleotide Polymorphisms in Common Bean: Their Discovery and Genotyping Using a Multiplex Detection System The Plant Genome, November 1, 2008; 1(2): 125 - 134. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tian, L.-C. Luo, S. Ge, and Z.-Y. Zhang Clear Genetic Structure of Pinus kwangtungensis (Pinaceae) Revealed by a Plastid DNA Fragment with a Novel Minisatellite Ann. Bot., July 1, 2008; 102(1): 69 - 78. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Xia, Y. Tsubokura, M. Hoshi, M. Hanawa, C. Yano, K. Okamura, T. A. Ahmed, T. Anai, S. Watanabe, M. Hayashi, et al. An Integrated High-density Linkage Map of Soybean with RFLP, SSR, STS, and AFLP Markers Using A Single F2 Population DNA Res, January 11, 2008; (2008) dsm027v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. D. Beavis, F. D. Schilkey, and S. M. Baxter Translational Bioinformatics: At the Interface of Genomics and Quantitative Genetics Crop Sci., December 18, 2007; 47(Supplement_3): S-32 - S-43. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Yamasaki, S. I. Wright, and M. D. McMullen Genomic Screening for Artificial Selection during Domestication and Improvement in Maize Ann. Bot., October 1, 2007; 100(5): 967 - 973. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Arunyawat, W. Stephan, and T. Stadler Using Multilocus Sequence Data to Assess Population Structure, Natural Selection, and Linkage Disequilibrium in Wild Tomatoes Mol. Biol. Evol., October 1, 2007; 24(10): 2310 - 2322. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Smith Pedigree Background Changes in U.S. Hybrid Maize between 1980 and 2004 Crop Sci., September 1, 2007; 47(5): 1914 - 1926. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Kolkman, S. T. Berry, A. J. Leon, M. B. Slabaugh, S. Tang, W. Gao, D. K. Shintani, J. M. Burke, and S. J. Knapp Single Nucleotide Polymorphisms and Linkage Disequilibrium in Sunflower Genetics, September 1, 2007; 177(1): 457 - 468. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Missaoui, B. K. Ha, D. V. Phillips, and H.R. Boerma Single Nucleotide Polymorphism Detection of the Rcs3 Gene for Resistance to Frogeye Leaf Spot in Soybean Crop Sci., July 30, 2007; 47(4): 1681 - 1690. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-K. Ha, R. S. Hussey, and H. R. Boerma Development of SNP Assays for Marker-Assisted Selection of Two Southern Root-Knot Nematode Resistance QTL in Soybean Crop Sci., July 16, 2007; 47(S2): S-73 - S-82. [Abstract] [Full Text] [PDF] |
||||
![]() |
I.-Y. Choi, D. L. Hyten, L. K. Matukumalli, Q. Song, J. M. Chaky, C. V. Quigley, K. Chase, K. G. Lark, R. S. Reiter, M.-S. Yoon, et al. A Soybean Transcript Map: Gene Distribution, Haplotype and Single-Nucleotide Polymorphism Analysis Genetics, May 1, 2007; 176(1): 685 - 696. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Hyten, I.-Y. Choi, Q. Song, R. C. Shoemaker, R. L. Nelson, J. M. Costa, J. E. Specht, and P. B. Cregan Highly Variable Patterns of Linkage Disequilibrium in Multiple Soybean Populations Genetics, April 1, 2007; 175(4): 1937 - 1944. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Stracke, T. Presterl, N. Stein, D. Perovic, F. Ordon, and A. Graner Effects of Introgression and Recombination on Haplotype Structure and Linkage Disequilibrium Surrounding a Locus Encoding Bymovirus Resistance in Barley Genetics, February 1, 2007; 175(2): 805 - 817. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Heuertz, E. De Paoli, T. Kallman, H. Larsson, I. Jurman, M. Morgante, M. Lascoux, and N. Gyllenstrand Multilocus Patterns of Nucleotide Diversity, Linkage Disequilibrium and Demographic History of Norway Spruce [Picea abies (L.) Karst] Genetics, December 1, 2006; 174(4): 2095 - 2105. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Hyten, Q. Song, Y. Zhu, I.-Y. Choi, R. L. Nelson, J. M. Costa, J. E. Specht, R. C. Shoemaker, and P. B. Cregan Impacts of genetic bottlenecks on soybean genome diversity PNAS, November 7, 2006; 103(45): 16666 - 16671. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Simko, K. G. Haynes, and R. W. Jones Assessment of Linkage Disequilibrium in Potato Genome With Single Nucleotide Polymorphism Markers Genetics, August 1, 2006; 173(4): 2237 - 2245. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Liu and J. M. Burke Patterns of Nucleotide Diversity in Wild and Cultivated Sunflower Genetics, May 1, 2006; 173(1): 321 - 330. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Walling, R. Shoemaker, N. Young, J. Mudge, and S. Jackson Chromosome-Level Homeology in Paleopolyploid Soybean (Glycine max) Revealed Through Integration of Genetic and Chromosome Maps Genetics, March 1, 2006; 172(3): 1893 - 1900. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Breseghello and M. E. Sorrells Association Mapping of Kernel Size and Milling Quality in Wheat (Triticum aestivum L.) Cultivars Genetics, February 1, 2006; 172(2): 1165 - 1177. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Messina, J. W. Jones, K. J. Boote, and C. E. Vallejos A Gene-Based Model to Simulate Soybean Development and Yield Responses to Environment Crop Sci., January 24, 2006; 46(1): 456 - 466. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Shultz, D. Kurunam, K. Shopinski, M. J. Iqbal, S. Kazi, K. Zobrist, R. Bashir, S. Yaegashi, N. Lavu, A. J. Afzal, et al. The Soybean Genome Database (SoyGD): a browser for display of duplicated, polyploid, regions and sequence tagged sites on the integrated physical and genetic maps of Glycine max Nucleic Acids Res., January 1, 2006; 34(suppl_1): D758 - D765. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Vroh Bi, M. D. McMullen, H. Sanchez-Villeda, S. Schroeder, J. Gardiner, M. Polacco, C. Soderlund, R. Wing, Z. Fang, and E. H. Coe Jr. Single Nucleotide Polymorphisms and Insertion-Deletions for Genetic Markers and Anchoring the Maize Fingerprint Contig Physical Map Crop Sci., December 2, 2005; 46(1): 12 - 21. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. V. Krutovsky and D. B. Neale Nucleotide Diversity and Linkage Disequilibrium in Cold-Hardiness- and Wood Quality-Related Candidate Genes in Douglas Fir Genetics, December 1, 2005; 171(4): 2029 - 2041. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Yamasaki, M. I. Tenaillon, I. Vroh Bi, S. G. Schroeder, H. Sanchez-Villeda, J. F. Doebley, B. S. Gaut, and M. D. McMullen A Large-Scale Screen for Artificial Selection in Maize Identifies Candidate Agronomic Loci for Domestication and Crop Improvement PLANT CELL, November 1, 2005; 17(11): 2859 - 2872. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. D. Bilyeu and P. R. Beuselinck Genetic Divergence between North American Ancestral Soybean Lines and Introductions with Resistance to Soybean Cyst Nematode Revealed by Chloroplast Haplotype J. Hered., September 1, 2005; 96(5): 593 - 599. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Van, E.-Y. Hwang, M. Y. Kim, H. J. Park, S.-H. Lee, and P. B. Cregan Discovery of SNPs in Soybean Genotypes Frequently Used as the Parents of Mapping Populations in the United States and Korea J. Hered., September 1, 2005; 96(5): 529 - 535. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Isidore, B. Scherrer, B. Chalhoub, C. Feuillet, and B. Keller Ancient haplotypes resulting from extensive molecular rearrangements in the wheat A genome have been maintained in species of three different ploidy levels Genome Res., April 1, 2005; 15(4): 526 - 536. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ramirez, M. A. Graham, L. Blanco-Lopez, S. Silvente, A. Medrano-Soto, M. W. Blair, G. Hernandez, C. P. Vance, and M. Lara Sequencing and Analysis of Common Bean ESTs. Building a Foundation for Functional Genomics Plant Physiology, April 1, 2005; 137(4): 1211 - 1227. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. R. Brown, G. P. Gill, R. J. Kuntz, C. H. Langley, and D. B. Neale Nucleotide diversity and linkage disequilibrium in loblolly pine PNAS, October 19, 2004; 101(42): 15255 - 15260. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Sundstrom, M. T. Webster, and H. Ellegren Reduced Variation on the Chicken Z Chromosome Genetics, May 1, 2004; 167(1): 377 - 385. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Stacey, L. Vodkin, W. A. Parrott, and R. C. Shoemaker National Science Foundation-Sponsored Workshop Report. Draft Plan for Soybean Genomics Plant Physiology, May 1, 2004; 135(1): 59 - 70. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Zhu, Y. L.
- Articles by Cregan, P. B.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Zhu, Y. L.
- Articles by Cregan, P. B.












