| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 171, 2029-2041, December 2005, Copyright © 2005
doi:10.1534/genetics.105.044420
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
,2
* Institute of Forest Genetics, Pacific Southwest Research Station, U.S. Department of Agriculture Forest Service, Davis, California 95616 and
Department of Plant Sciences, University of California, Davis, California 95616
2 Corresponding author: Institute of Forest Genetics, Pacific Southwest Research Station, USDA Forest Service, Department of Plant Sciences, University of California, 1 Shields Ave., Davis, CA 95616.
E-mail: dbneale{at}ucdavis.edu
| ABSTRACT |
|---|
|
|
|---|
= 0.00655 ± 0.00082 on average, respectively. The nonsynonymous (replacement) nucleotide substitutions were almost five times less frequent than synonymous ones and substitutions in noncoding regions. LD decayed relatively slowly but steadily within genes. Haploblock analysis was used to define haplotype tag SNPs (htSNPs). These data will help to select SNPs for association mapping, which is already in progress.
Frost damage can negatively affect the annual growth of Douglas fir trees, particularly in the spring when new needle tissue is delicate and vulnerable. Fall frosts can damage actively elongating shoots in the autumn and adversely affect growth the following spring. Therefore, fall and spring cold hardiness are important adaptive traits in Douglas fir that show high genetic variation in common garden studies and vary among populations from environmentally diverse locations (reviewed in WHEELER et al. 2005).
Quantitative trait loci (QTL) mapping studies have confirmed these observations and have allowed us to begin dissecting these complex traits (JERMSTAD et al. 2001a,b, 2003; WHEELER et al. 2005). Several genomic regions responsible for genetic control of growth rhythm and cold-hardiness traits were found, but QTL mapping does not reveal which individual genes are responsible for these effects.
Association mapping is a powerful population genomic approach that unlike QTL mapping can identify individual genes and alleles that are responsible for phenotypic differences in adaptive traits (NEALE and SAVOLAINEN 2004). However, limited genetic resources and the large genome of Douglas fir prevent a full genome scan. Instead, we plan to carry out a candidate gene-based association mapping using single-nucleotide polymorphisms (SNPs) (REBBECK et al. 2004). SNPs are excellent markers for association mapping of genes controlling complex traits (e.g., BROOKES 1999; RAFALSKI 2002; CARLSON et al. 2004). However, to carry out association mapping it is necessary first to discover SNPs in candidate genes of interest and to study their variation. To achieve our goals we (1) developed a list of candidate genes for adaptive traits on the basis of data available from other plant species, (2) found their homologs or orthologs among Douglas fir genomic and EST sequences, (3) designed single-gene-specific primers to amplify single-gene PCR products, (4) sequenced them, (5) performed SNP discovery, (6) analyzed their diversity and LD, and (7) selected SNPs for association mapping. These steps are described in this article for a set of 18 genes that included late embryogenesis abundant protein genes, dehydrins, and other cold-induced and wood quality-related genes. The studied genes are mostly unlinked and represent a wide variety of protein-coding genes. Therefore, they are likely to reflect general genome variation in Douglas fir.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
500 genes and proteins that included cold acclimation, cold induced, cold resistant, chaperones, cryoprotectins, calmodulins, some dehydrins, LEA, and other cold-hardiness-related candidate genes and proteins (e.g., CLOSE 1997; PALVA and HEINO 1998; THOMASHOW 1998, 1999, 2001; WANNER and JUNTTILA 1999; SEKI et al. 2001, 2002; FOWLER and THOMASHOW 2002; NOGUEIRA et al. 2003; PROVART et al. 2003; RABBANI et al. 2003; BROWSE and LANGE 2004; COOK et al. 2004). Then, using BLASTX, BLASTN, and TBLASTX tools they were compared with all available Douglas fir sequences submitted to GenBank, including most of the
11,700 ESTs obtained recently from Douglas fir seedlings in our laboratory (http://staff.vbi.vt.edu/estap). The highly homologous Douglas fir sequences that matched sequences in our database of cold-resistance-related genes and proteins were used to design PCR primers for sequencing. For this study we preferably selected those genes that were also positional candidates that collocated with cold-hardiness QTL in a previous study (WHEELER et al. 2005). A number of candidate genes were previously mapped by RFLP analysis using cDNAs as hybridization probes (JERMSTAD et al. 1998). Most of the positional candidates were good expressional and functional candidates. For comparison, we also included three wood quality-related genes that were recently studied in loblolly pine, Pinus taeda (BROWN et al. 2004a). The details on the list of 18 candidate genes used in this study are presented in Table 1.
|
Nucleotide diversity analysis:
Haplotypes were directly inferred from sequencing PCR products amplified in haploid megagametophytes from the SNP discovery panel. Multiple sequence alignments were analyzed using the DNA sequence polymorphism (DNASP) software version 4.0 (ROZAS et al. 2003). Insertions and deletions (indels) were excluded from all estimates. Haplotype diversity (Hd) was computed using Equation 8.4 in NEI (1987), except that n was used instead of 2n. Nucleotide diversity was estimated by
W from the number of polymorphic segregating (S) sites (WATTERSON 1975, Equation 1.4a, but on a base pair basis; NEI 1987, Equation 10.3) and by
(NEI 1987, Equations 10.5 or 10.6, but on a per gene basis). Heterogeneity of
W among loci was assessed by using a likelihood-ratio test in which the probability of the observed number of segregating sites in a sample was calculated under the null hypothesis of a common, genomewide 4Neµ (Pg) and the alternative hypothesis of locus-specific values of 4Neµ (Pl), where an average for 18 genes
W was considered as a genomewide estimate. These probabilities were based on all (silent and nonsynonymous) segregating sites and were calculated for each gene by using the computer simulations that are implemented in DNASP. Simulations were based on the coalescent process for a neutral infinite-sites model and assumed a large constant population size (HUDSON 1991). Then, the likelihood-ratio test statistic 2 ln(Pl/Pg) was calculated for each gene. Under certain assumptions this statistic is distributed as a
2 with m 1 d.f., where m is equal to the number of loci (see also BROWN et al. 2004a).
Neutrality tests:
Neutrality test statistics D (TAJIMA 1989, Equation 38), D*, and F* (FU and LI 1993, pp. 700 and 702, respectively) were calculated and tested using 10,000 simulations to test the hypothesis that mutations in the gene are selectively neutral (KIMURA 1983). If a population sample fits the infinite-sites model,
and
W have equal expectations. TAJIMA (1989) developed the D-test statistic, which is
W divided by the standard deviation of this difference. The difference between
and
W (Tajima's D) reflects the degree of nonequilibrium conditions in the genetic history of the population. The D*-test statistic is based on the differences between the number of singletons (mutations appearing only once among the sequences) and the total number of mutations. The F*-test statistic is based on the differences between the number of singletons and the average number of nucleotide differences between pairs of sequences. Significantly negative values for these statistics are consistent with negative (purifying) selection and can also indicate a recent selective sweep of a linked mutation, whereas significantly positive values are consistent with positive, balancing, or diversifying selection for two or more alleles (KREITMAN 2000). To find regions under selection within genes the distributions of the D-, D*-, and F*-statistics were studied along the gene sequences using a sliding window with a window length and step size of 100 and 25 sites, respectively. Coalescence simulations without recombination were used to test deviations of the observed
- and
W-estimates from average values and the significance of the D-, D*-, and F*-statistics (HUDSON 1991).
The nonsynonymous (dN; amino acid replacing) to synonymous (dS; no amino acid replacing) substitution ratio is a strong indicator of selection (LI 1997). The dN/dS ratio measures the magnitude and direction of selective pressure on a gene sequence, with ratios = 1, <1, and >1 indicating neutral evolution, negative selection, and positive selection, respectively. The average number of potentially nonsynonymous and synonymous substitution sites, estimates of the number of nonsynonymous (dN) and synonymous (dS) substitutions per site, variance and *standard errors, and Z-test [
] for neutrality (dN = dS) were computed using the molecular evolutionary genetics analysis (MEGA) software version 3.0 (http://www.megasoftware.net; KUMAR et al. 2004) and the distance-based modified Nei-Gojobori method (NEI and GOJOBORI 1986) with the Jukes-Cantor model (JUKES and CANTOR 1969) and bootstrap based on 1000 replicates (NEI and KUMAR 2000).
Analysis of LD and haploblock structure within genes:
LD descriptive statistics r2 (HILL and ROBERTSON 1968) and D' (LEWONTIN 1964) were calculated using TASSEL (http://www.maizegenetics.net/bioinformatics/tasselindex.htm) and DNASP software. When more than two alleles were present at a locus, a weighted average of D' or r2 was calculated (FARNIR et al. 2000). If there were only two alleles at both loci, then a one-sided Fisher's exact test was calculated to determine the significance of LD. If there were more than two alleles, then permutations were used to calculate the proportion of permuted gamete distributions that are less probable than the observed gamete distribution under the null hypothesis of independence (WEIR 1996). Only parsimony informative sites were included in the analysis of the LD decay within genes over distance. LD between genes was analyzed using alleles with a frequency of
15% for all genes, except the ERD15-like gene, for which alleles were less frequent.
Selection of htSNPs for association mapping:
To select haplotype tag SNPs (htSNPs) for association mapping, within-gene haploblock structure and haplotype coverage were studied using HaploblockFinder (ZHANG and JIN 2003; http://cgi.uc.edu/
kzhang), SNPtagger (XIAYI and CARDON 2003; http://www.well.ox.ac.uk/
xiayi/haplotype/index.html), and SNPCherryPicker (HARRIS et al. 2003) software. These programs use different approaches and criteria to select htSNPs, and, therefore, we believe that they complement each other, and their combined use helps us select the consensus set of htSNPs.
Population structure:
Using MEGA a consensus neighbor-joining (NJ) tree (SAITOU and NEI 1987) was reconstructed for all 28 Douglas fir samples on the basis of the 1000 Jukes-Cantor pairwise distance matrices (JUKES and CANTOR 1969) calculated from the bootstrap-generated multiple-nucleotide alignments for all 18 genes combined. The FST statistic (WEIR 1996), which measures the genetic variance among populations divided by the total genetic variance of the entire population, was used to quantify the degree of genetic differentiation between population samples from the six regions included in the SNP discovery panel using the ARLEQUIN ver. 2.0 software (EXCOFFIER et al. 2004; http://lgb.unige.ch/arlequin). The analysis of molecular variance (AMOVA) approach implemented in Arlequin (EXCOFFIER et al. 1992) is essentially similar to other approaches based on analyses of variance of gene frequencies, but it takes into account the number of mutations between haplotypes. The sample differentiation was also tested, using an exact test based on haplotype frequencies (RAYMOND and ROUSSET 1995; GOUDET et al. 1996). The nearest-neighbor statistic (Snn) that measures how often the "nearest neighbors" (in sequence space) of sequences are from the same locality in geographic space was used to test for population differentiation among six regions and two states, from which samples were collected, as described in HUDSON (2000).
| RESULTS |
|---|
|
|
|---|
843 bp (Table 1). In total, 15,183 bp of genomic DNA for 18 genes or 441,664 bp considering all samples were sequenced. Indels were found in 12 sequences, with the average number of 2.7 indels per sequence and the average length of 14.8 bp per indel. However, if the TBE gene with numerous large indels is excluded from analysis, the average numbers are 1.9 indels per sequence and 4.2 bp per indel. The average numbers of exons and introns were 1.7 and 0.9 per sequence, respectively, for all 18 partially and completely sequenced genes, or 2.2 and 1.2 per gene for 6 genes that were sequenced completely. The exon and intron sizes varied greatly, with the average lengths of 281 and 246 bp for all genes, respectively, or 292 and 402 bp for 6 genes that were sequenced completely. However, if the TBE gene, which had uncommonly large introns, is excluded from analysis, then the average lengths of exons and introns based on the five completely sequenced genes become 258 and 246 bp, respectively, which are very similar to the values based on all 18 genes.
Nucleotide diversity:
Four hundred SNPs were found in 18 genes, or 1 SNP for every 46 bp (Table 2). With the exception of 6 trinucleotide SNPs, all segregating sites had only two alternative nucleotides. Almost one-third of SNPs were singletons, and most SNPs (349) were either synonymous or in noncoding regions. Haplotype diversity was very high with Hd = 0.827 ± 0.043 and an average number of 11 different haplotypes per gene (Table 3). The total nucleotide diversities were also relatively high with
= 0.00655 ± 0.00082 and
W = 0.00702 ± 0.00269 on average. The estimates of nucleotide diversity,
and
W, varied significantly across loci (P < 0.00006 in the heterogeneity test) with values as low as
= 0.00237 in the 4CL2 gene and
W = 0.00229 in the formin-like gene and almost six to seven times higher in the LEA-EMB11-like gene, where
= 0.01378 and
W = 0.01594. Coalescence simulations also showed that the EF1A, TBE, AT1, and LEA-EMB11-like genes had statistically higher than average values for haplotype number (h) and diversity (Hd), while the formin-like gene had the lowest values for h, Hd, and
W. In general,
and
W were similar with a tendency for
W to be slightly higher than
, apparently as a result of an excess of low-frequency SNPs (supplemental Figure 1S at http://www.genetics.org/supplemental/). Due to this, the neutrality test statistics tended to be negative (Table 3).
|
|
= 0.00210 vs.
= 0.01055 and
W = 0.00261 vs.
W = 0.01132 for nonsynonymous vs. silent substitutions, respectively.
|
The neutrality of sequence polymorphism was also assessed using the ratio of nonsynonymous (dN) to synonymous (dS) nucleotide substitutions. Six genes had a dN/dS ratio significantly <1 (supplemental Table 3S at http://www.genetics.org/supplemental/). Only the 4CL1 gene had dN/dS > 1, but it was not statistically significant.
LD, haploblock structure within genes, and selection of htSNPs for association mapping:
A considerable amount of LD was found within sequences. A total of 4349 pairwise comparisons were estimated for parsimony informative sites among pairs of sites within 18 genes. Almost one-third of them (1316) showed LD statistically significant by a Fisher's exact test, which remained significant for 326 pairs even after Bonferroni correction. Figure 2 shows LD estimates (r2) plotted against the pairwise distances between parsimony SNPs within all 18 genes. The LD declined linearly as distances between sites increased, but a fair amount of LD remained, even for pairs separated by >500 bp. There were a few significant LDs for tightly linked or even for unlinked genes (supplemental Figure 3S at http://www.genetics.org/supplemental/), but none of them remained significant after Bonferroni correction. Depending on the threshold values used to define a block, from 1 up to 58 haploblocks per gene were revealed (supplemental Table 4S at http://www.genetics.org/supplemental/). These thresholds included a minimal LD value, minimal frequency of the SNP allele to be included, minimal chromosome and haplotype coverage, and htSNP coverage. However, using reasonable thresholds, there were approximately
23 haploblocks per gene (except very long genes such as TBE) that could be genotyped with approximately four to five SNPs per gene on average.
|
| DISCUSSION |
|---|
|
|
|---|
Average nucleotide diversity in Douglas fir was higher than that in human and soybean, but lower than that in maize, and similar to that in Drosophila (Table 5). The similarity to Drosophila is not completely surprising given that both Douglas fir and Drosophila have large population sizes and high outcrossing rates. Compared with other conifers, Douglas fir has higher levels of diversity than do loblolly pine (BROWN et al. 2004a; NEALE and SAVOLAINEN 2004; S. C. GONZÁLEZ-MARTÍNEZ, E. ERSOZ, G. R. BROWN, N. C. WHEELER and D. B. NEALE, unpublished results), Scots pine (P. sylvestris) (DVORNYK et al. 2002; GARCÍA-GIL et al. 2003), and sugi (KADO et al. 2003). Potentially, a difference in mutation rates and/or in historic effective population sizes (Ne) between Douglas fir and pines could explain the difference in the level of nucleotide diversity observed in Douglas fir and loblolly pine. Unfortunately, we are unaware of any direct estimates of mutation rate at the nucleotide level in pines vs. Douglas fir. Indirect estimates that are inferred from observed nucleotide differentiation between closely related pine species are based on assumptions of the neutral model as well as on the rough assumptions of divergence time and Ne. These estimates can be highly biased and produce a circular argument. Unfortunately, paleobotanical data are also very incomplete and highly inconclusive. Therefore, there are no unambiguous data that would suggest that Douglas firs have maintained large Ne during the Holocene or Pleistocene, while pines have not. However, more importantly, our study, as well as other conifer studies cited above, revealed manyfold difference in estimates of
and
W between different genes, which highlights the problems of comparing variation among species when estimates are based on one or a few loci (e.g., DVORNYK et al. 2002; GARCÍA-GIL et al. 2003; INGVARSSON 2005). Comparisons among species should be either based on many loci or, ideally, restricted to orthologous loci.
|
- and
W-values were similar in this study. Both
- and
W-values estimate the equilibrium neutral parameter
= 4Neµ for autosomal loci, a central parameter in population genetic models for the balance between mutation and random genetic drift, where Ne is the effective population size and µ is the neutral mutation rate per nucleotide site. This parameter summarizes the rate at which processes of mutation and random genetic drift generate and maintain variation within a gene, assuming that natural selection has not been operating. Although the number of segregating sites does not represent all the information in the sample, under the neutral infinite-sites model the frequency spectrum of sites is determined by
, which in turn is estimated by S. Violations of the assumptions of the infinite-sites model will lead to biases in the estimate of
. The similarity of
- and
W-values shows that those violations were not significant. Nevertheless, negative values of the neutrality test statistics (Tajima's D, D*, and F*) and dN/dS < 1 in most studied genes suggested that they are mainly under negative selection or reflect a recent population expansion. However, it is difficult, if not impossible, to distinguish between population growth and selection, if only intraspecific polymorphism is studied. The frequency spectrum can be different for different genes, depending on the combined effect of many factors, such as mutation, population size, recombination rate, gene conversion, and selection intensity. Comparison of intraspecific and interspecific polymorphism in orthologous genes between closely related species can help to detect or confirm genes under selection (HUDSON et al. 1987; MCDONALD and KREITMAN 1991; KREITMAN 2000).
The rate of decay of LD with distance is a critical factor that affects the success of association mapping on the basis of SNPs in candidate genes. If LD affects large regions or genomic blocks, then association with phenotypic traits would be easier to detect, but it would be more difficult to assign it to the particular candidate gene or quantitative trait nucleotide (QTN). If LD decays quickly, then the associations found between a particular SNP and phenotypic trait would be more likely to be causative rather than due to linkage with other unknown genes. LD is a result of the interplay of many factors, such as mutation and recombination rates, mating system, selection, population size, structure, and history. The intragenic recombination that affects LD within genes was estimated in this study, but not presented and discussed here because we believe that the limited sample size was insufficient for its reliable estimation. The estimation of recombination requires that considerably larger segments of contiguous DNA be sequenced, and more data should be collected to fully address this problem. LD varies greatly in different species ranging from 2001500 bp in maize up to 50100 kb in Arabidopsis (see RAFALSKI and MORGANTE 2004, Table 2, for review). Our data indicate that LD decayed >50% over relatively short segments (from r2 =
0.25 to
0.10 within 2000 bp, Figure 2). These data confirmed recent studies in loblolly pine and suggest that conifers may have LD at the lower end of the spectrum (BROWN et al. 2004a; S. C. GONZÁLEZ-MARTÍNEZ, E. ERSOZ, G. R. BROWN, N. C. WHEELER and D. B. NEALE, unpublished results), making these species potentially very amenable for candidate gene vs. genomewide-based studies (NEALE and SAVOLAINEN 2004). Unlike candidate gene-based association studies the genomewide scans depend more on strong LD over long regions in the genome. However, it should be noted that this study was not specifically designed to address LD in the genome, but rather within genes, and many distal pairwise comparisons are underrepresented because studied sequences were relatively short.
A few significant LDs were found between tightly linked or even between unlinked genes (supplemental Figure 3S at http://www.genetics.org/supplemental/), although none of them remained significant after Bonferroni correction. Nevertheless, these associations could be a sign of either population substructure or strong epistatic interactions between genes. The latter one is especially likely for the EF1 and 60SRPL31a genes, because both are involved in ribosomal biosynthesis, and for the F3H1 and F3H2 genes that are apparently involved in the same metabolic pathway.
Association mapping requires careful selection of SNPs for genotyping. Our data will help us select the most informative and potentially useful htSNPs in 18 candidate genes for association mapping. We developed a complex approach that takes into account all available data to increase the likelihood of detecting associations. The polymorphic SNPs that were discovered in this study in coding regions, which cause nonsynonymous substitutions, mark haploblocks, and are under positive selection, are the best candidates for the association mapping study that is now in progress.
Connecting phenotype with genotype is the fundamental aim of genetics (BOTSTEIN and RISCH 2003). The candidate gene-based association studies are considered as one of the best approaches to connect complex phenotypes with genotypes (PFLIEGER et al. 2001; BOTSTEIN and RISCH 2003; NEALE and SAVOLAINEN 2004; REBBECK et al. 2004). This study proved that Douglas fir meets the most important conditions for candidate gene-based association studies such as high phenotypic variation, high SNP polymorphism in candidate genes, and moderate LD. The lack of population subdivision observed in the SNP discovery panel will also facilitate association mapping. However, it is too early to make a conclusion about population structure. The further study of a much larger sample of
1300 trees from an association study will provide more data on population structure.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
| LITERATURE CITED |
|---|
|
|
|---|
AAGAARD, J. E., K. V. KRUTOVSKII and S. H. STRAUSS, 1998a RAPDs and allozymes exhibit similar levels of diversity and differentiation among populations and races of Douglas-fir. Heredity 81: 6978.[CrossRef]
AAGAARD, J. E., K. V. KRUTOVSKII and S. H. STRAUSS, 1998b RAPD markers of mitochondrial origin exhibit lower population diversity and higher differentiation than RAPDs of nuclear origin in Douglas fir. Mol. Ecol. 7: 801812.[CrossRef]
AITKEN, S. N., and W. T. ADAMS, 1997 Spring cold hardiness under strong genetic control in Oregon populations of coastal Douglas-fir. Can J. For. Res. 27: 17731778.[CrossRef]
ANEKONDA, T. S., W. T. ADAMS, S. N. AITKEN, D. B. NEALE, K. D. JERMSTAD et al., 2000 Genetics of cold-hardiness in a cloned full-sib family of coastal Douglas-fir. Can. J. For. Res. 30: 837840.[CrossRef]
ARANDA, M. A., M. ESCALER, D. WANG and A. J. MAULE, 1996 Induction of HSP70 and polyubiquitin expression associated with plant virus replication. Proc. Natl. Acad. Sci. USA 93: 1528915293.
BOREVITZ, J. O., and M. NORDBORG, 2003 The impact of genomics on the study of natural variation in Arabidopsis. Plant Physiol. 132: 718725.
BOTSTEIN, D., and N. RISCH, 2003 Discovering genotypes underlying human phenotypes: past successes for Mendelian disease, future approaches for complex disease. Nat. Genet. 33(Suppl.): 228237.
BROOKES, A. J., 1999 The essence of SNPs. Gene 234: 177186.[CrossRef][Medline]
BROWN, G. R., D. L. BASSONI, G. P. GILL, J. R. FONTANA, N. C. WHEELER et al., 2003 Identification of quantitative trait loci influencing wood property traits in loblolly pine (Pinus taeda L.). III. QTL verification and candidate gene mapping. Genetics 164: 15371546.
BROWN, G. R., G. P. GILL, R. J. KUNTZ, C. H. LANGLEY and D. B. NEALE, 2004a Nucleotide diversity and linkage disequilibrium in loblolly pine. Proc. Natl. Acad. Sci. USA 101: 1525515260.
BROWN, G. R., G. P. GILL, R. J. KUNTZ, J. A. BEAL, D. NELSON et al., 2004b Associations of candidate gene single nucleotide polymorphisms with wood property phenotypes in loblolly pine. Plant & Animal Genome XII Conference, San Diego, January 1014, 2004 (http://www.intl-pag.org/pag/12/abstracts/W22_PAG12_98.html).
BROWSE, J., and B. M. LANGE, 2004 Counting the cost of a cold-blooded life: metabolomics of cold acclimation. Proc. Natl. Acad. Sci. USA 101: 1499614997.
CAMPBELL, R. K., and F. C. SORENSEN, 1978 Effect of test environment on expression of clines and on delimitation of seed zones in Douglas-fir. Theor. Appl. Genet. 51: 233246.[CrossRef]
CAMPBELL, R. K., and A. I. SUGANO, 1975 Phenology of bud burst in Douglas-fir related to provenance, photoperiod, chilling and flushing temperature. Bot. Gaz. 136: 290298.[CrossRef]
CARGILL, M., D. ALTSHULER, J. IRELAND, P. SKLAR, K. ARDLIE et al., 1999 Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat. Genet. 22: 231238.[CrossRef][Medline]
CARLSON, C. S., M. A. EBERLE, L. KRUGLYAK and D. A. NICKERSON, 2004 Mapping complex disease loci in whole-genome association studies. Nature 429: 446452.[CrossRef][Medline]
CLOSE, T. J., 1997 Dehydrins: a commonality in the response of plants to dehydration and low temperature. Physiol. Plant. 100: 291296.[CrossRef]
COOK, D., S. FOWLER, O. FIEHN and M. F. THOMASHOW, 2004 A prominent role for the CBF cold response pathway in configuring the low-temperature metabolome of Arabidopsis. Proc. Natl. Acad. Sci. USA 101: 1524315248.
DONG, J.-Z., and D. I. DUNSTAN, 1996 Expression of abundant mRNAs during somatic embryogenesis of white spruce [Picea glauca (Moench) Voss]. Planta 199: 459466.[Medline]
DVORNYK, V., A. SIRVIÖ, M. MIKKONEN and O. SAVOLAINEN, 2002 Low nucleotide diversity at the pal1 locus in the widely distributed Pinus sylvestris. Mol. Biol. Evol. 19: 179188.
DUBOS, C., G. LE PROVOST, D. POT, F. SALIN, C. LALANE et al., 2003 Identification and characterization of water-stress-responsive genes in hydroponically grown maritime pine (Pinus pinaster) seedlings. Tree Physiol. 23: 169179.[Medline]
EWING, B., and P. GREEN, 1998 Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8: 186194.
EWING, B., L. HILLIER, M. WENDL and P. GREEN, 1998 Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8: 175185.
EXCOFFIER, L., P. E SMOUSE and J. M. QUATTRO, 1992 Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131: 479491.[Abstract]
EXCOFFIER, L., S. SCHNEIDER and D. ROESSLI, 2004 ARLEQUIN ver 2.0: a software for population genetics data analysis (http://lgb.unige.ch/arlequin).
FARNIR, F., W. COPPIETERS, J.-J. ARRANZ, P. BERZI, N. CAMBISANO et al., 2000 Extensive genome-wide linkage disequilibrium in cattle. Genome Res. 10: 220227.
FEDER, M. E., and T. MITCHELL-OLDS, 2003 Evolutionary and ecological functional genomics. Nat. Rev. Genet. 4: 649655.[CrossRef]
FLINT-GARCÍA, S. A., J. M. THORNSBERRY and E. S. BUCKLER, IV, 2003 Structure of linkage disequilibrium in plants. Annu. Rev. Plant. Biol. 54: 357374.[CrossRef][Medline]
FOWLER, S., and M. F. THOMASHOW, 2002 Arabidopsis transcriptome profiling indicates that multiple regulatory pathways are activated during cold acclimation in addition to the CBF cold response pathway. Plant Cell 14: 16751690.
FU, Y.-X., and W.-H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133: 693709.[Abstract]
GARCÍA-GIL, M. R., M. MIKKONEN and O. SAVOLAINEN, 2003 Nucleotide diversity at two phytochrome loci along a latitudinal cline in Pinus sylvestris. Mol. Ecol. 12: 11951206.[CrossRef][Medline]
GLAZIER, A. M., J. H. NADEAU and T. J. AITMAN, 2002 Finding genes that underlie complex traits. Science 298: 23452349.
GOLDSTEIN, D. B., and M. E. WEALE, 2001 Population genomics: linkage disequilibrium holds the key. Curr. Biol. 11: R576R579.[CrossRef][Medline]
GORDON, D., C. ABAJIAN and P. GREEN, 1998 Consed: a graphical tool for sequence finishing. Genome Res. 8: 195202.
GORDON, D., C. DESMARAIS and P. GREEN, 2001 Automated finishing with autofinish. Genome Res. 11: 614625.
GOUDET, J., M. RAYMOND, T. DE MEEÜS and F. ROUSSET, 1996 Testing differentiation in diploid populations. Genetics 144: 19331940.[Abstract]
HALUSHKA, M. K., J. B. TAN, K. BENTLEY, L. HSIE, N. P. SHEN et al., 1999 Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis. Nat. Genet. 22: 239247.[CrossRef][Medline]
HARRIS, M., J. M. MARTIN, J. F. PEDEN and C. J. RAWLINGS, 2003 SNP cherry picker: maximizing the chance of finding an association with a disease SNP. Bioinformatics 19: 21412143.
HERTZBERG, M., H. ASPEBORG, J. SCHRADER, A. ANDERSSON, R. ERLANDSSON et al., 2001 A transcriptional roadmap to wood formation. Proc. Natl. Acad. Sci. USA 98: 1473214737.
HILL, W. G., and A. ROBERTSON, 1968 Linkage disequilibrium in finite populations. Theor. Appl. Genet. 38: 226231.[CrossRef]
HUDSON, R. R., 1991 Gene genealogies and the coalescent process. Oxf. Surv. Evol. Biol. 7: 144.
HUDSON, R. R., 2000 A new statistic for detecting genetic differentiation. Genetics 155: 20112014.
HUDSON, R. R., M. KREITMAN and M. AGUADE, 1987 A test of neutral molecular evolution based on nucleotide data. Genetics 116: 153159.
INGVARSSON, P. K., 2005 Nucleotide polymorphism and linkage disequilibrium within and among natural populations of European aspen (Populus tremula L., Salicaceae). Genetics 169: 945953.
JERMSTAD, K. D., D. L. BASSONI, N. C. WHEELER and D. B. NEALE, 1998 A sex-averaged linkage map in coastal Douglas-fir (Pseudotsuga menziesii [Mirb.] Franco) based on RFLP and RAPD markers. Theor. Appl. Genet. 97: 762770.[CrossRef]
JERMSTAD, K. D., D. L. BASSONI, K. S. JECH, N. C. WHEELER and D. B. NEALE, 2001a Mapping of quantitative trait loci controlling adaptive traits in coastal Douglas-fir. I. Timing of vegetative bud flush. Theor. Appl. Genet. 102: 11421151.[CrossRef]
JERMSTAD, K. D., D. L. BASSONI, N. C. WHEELER, T. S. ANEKONDA, S. N. AITKEN et al., 2001b Mapping of quantitative trait loci controlling adaptive traits in coastal Douglas-fir. II. Spring and fall cold-hardiness. Theor. Appl. Genet. 102: 11521158.[CrossRef]
JERMSTAD, K. D., D. L. BASSONI, K. S. JECH, G. A. RITCHIE, N. C. WHEELER et al., 2003 Mapping of quantitative trait loci controlling adaptive traits in coastal Douglas fir. III. QTL by environment interactions. Genetics 165: 14891506.
JUKES, T. H, and C. R. CANTOR, 1969 Evolution of protein molecules, pp. 21132 in Mammalian Protein Metabolism, edited by H. N. MUNRO. Academic Press, New York.
KADO, T., H. YOSHIMARU, Y. TSUMURA and H. TACHIDA, 2003 DNA variation in a conifer, Cryptomeria japonica (Cupressaceae sensu lato). Genetics 164: 15471559.
KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK.
KIYOSUE, T., K. YAMAGUCHI-SHINOZAKI and K. SHINOZAKI, 1994 ERD15, a cDNA for a dehydration-induced gene from Arabidopsis thaliana. Plant Physiol. 106: 1707.[CrossRef][Medline]
KREITMAN, M., 2000 Methods to detect selection in populations with applications to the human. Annu. Rev. Genomics Hum. Genet. 1: 539559.[CrossRef][Medline]
KRUTOVSKY, K. V., M. TROGGIO, G. R. BROWN, K. D. JERMSTAD and D. B. NEALE, 2004 Comparative mapping in the Pinaceae. Genetics 168: 447461.
KUMAR, S., K. TAMURA and M. NEI, 2004 MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief. Bioinform. 5: 150163.
KUWABARA, C., D. TAKEZAWA, T. SHIMADA, T. HAMADA, S. FUJIKAWA et al., 2002 Abscisic acid- and cold-induced thaumatin-like protein in winter wheat has an antifungal activity against snow mould, Microdochium nivale. Physiol. Plant. 115: 101110.[CrossRef][Medline]
LEWONTIN, R. C., 1964 The interaction of selection and linkage. I. General considerations: heterotic models. Genetics 49: 4967.
LI, P., and W. T. ADAMS, 1993 Genetic control of bud phenology in pole-size trees and seedlings of coastal Douglas-fir. Can. J. For. Res. 23: 10431051.
LI, W.-H., 1997 Molecular Evolution. Sinauer, Sunderland, MA.
LUIKART, G., P. R. ENGLAND, D. TALLMON, S. JORDAN and P. TABERLET, 2003 The power and promise of population genomics: from genotyping to genome typing. Nat. Rev. Genet. 4: 981994.[Medline]
MCDONALD, J. H., and M. KREITMAN, 1991 Adaptive protein evolution at the Adh locus in Drosophila. Nature 351: 652654.[CrossRef][Medline]
MERKLE, S. A., and W. T. ADAMS, 1987 Pattern of allozyme variation within and among Douglas-fir breeding zones in southwest Oregon. Can. J. For. Res. 17: 402407.
MORAN, G. F., and W. T. ADAMS, 1989 Microgeographical patterns of allozyme differentiation in Douglas-fir from southwest Oregon. For. Sci. 35: 315.
MORIYAMA, E. N., and J. R. POWELL, 1996 Intraspecific nuclear DNA variation in Drosophila. Mol. Biol. Evol. 13: 261277.[Abstract]
NEALE, D. B., and O. SAVOLAINEN, 2004 Association genetics of complex traits in conifers. Trends Plant Sci. 9: 325330.[CrossRef][Medline]
NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, New York.
NEI, M., and T. GOJOBORI, 1986 Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 3: 418426.[Abstract]
NEI, M., and S. KUMAR, 2000 Molecular Evolution and Phylogenetics. Oxford University Press, New York.
NOGUEIRA, F. T. S., V. E. DE ROSA, JR., M. MENOSSI, E. C. ULIAN and P. ARRUDA, 2003 RNA expression profiles and data mining of sugarcane response to low temperature. Plant Physiol. 132: 18111824.
NORDBORG, M., and H. INNAN, 2002 Molecular population genetics. Curr. Opin. Plant Biol. 5: 6973.[CrossRef][Medline]
PALVA, E. T., and P. HEINO, 1998 Molecular mechanism of plant cold acclimation and freezing tolerance, pp. 314 in Plant Cold Hardiness, edited by P. H. LI and T. H. H. CHEN. Plenum, New York.
PFLIEGER, S., V. LEFEBVRE and M. CAUSSE, 2001 The candidate gene approach in plant genetics: a review. Mol. Breed. 7: 275291.[CrossRef]
POT, D., L. MCMILLAN, C. ECHT, G. LE PROVOST, P. GARNIER-GERE et al., 2005 Nucleotide variation in genes involved in wood formation in two pine species. New Phytol. 167: 101112.[CrossRef][Medline]
PROVART, N. J., P. GIL, W. CHEN, B. HAN, H.-S. CHANG et al., 2003 Gene expression phenotypes of Arabidopsis associated with sensitivity to low temperatures. Plant Physiol. 132: 893906.
RABBANI, M. A., K. MARUYAMA, H. ABE, M. A. KHAN, K. KATSURA et al., 2003 Monitoring expression profiles of rice genes under cold, drought, and high-salinity stresses and abscisic acid application using cDNA microarray and RNA gel-blot analyses. Plant Physiol. 133: 17551767.
RAFALSKI, A. J., 2002 Novel genetic mapping tools in plants: SNPs and LD-based approaches. Plant Sci. 162: 329333.[CrossRef]
RAFALSKI, A., and M. MORGANTE, 2004 Corn and humans: recombination and linkage disequilibrium in two genomes of similar size. Trends Genet. 20: 103111.[CrossRef][Medline]
RAYMOND, M., and F. ROUSSET, 1995 An exact test for population differentiation. Evolution 49: 12801283.[CrossRef]
REBBECK, T. R., M. SPITZ and X. WU, 2004 Assessing the function of genetic variants in candidate gene association studies. Nat. Rev. Genet. 5: 589597.[Medline]
REHFELDT, G. E., 1983 Genetic variability within Douglas-fir populations: implications for tree improvement. Silvae Genet. 32: 914.
REHFELDT, G. E., 1989 Ecological adaptations in Douglas-fir (Pseudotsuga menziesii var. glauca): a synthesis. For. Ecol. Manage. 28: 203215.
ROZAS, J., J. C. SÁNCHEZ-DELBARRIO, X. MESSEGUER and R. ROZAS, 2003 DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19: 24962497.
SAITOU, N., and M. NEI, 1987 The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4: 406425.[Abstract]
SCHLÖTTERER, C., 2002 Towards a molecular characterization of adaptation in local populations. Curr. Opin. Genet. Dev. 12: 683687.[CrossRef][Medline]
SCHMID, K. J., S. RAMOS-ONSINS, H. RINGYS-BECKSTEIN, B. WEISSHAAR and T. MITCHELL-OLDS, 2005 A multilocus sequence survey in Arabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism. Genetics 169: 16011615.
SEKI, M., M. NARUSAKA, H. ABE, M. KASUGA, K. YAMAGUCHI-SHINOZAKI et al., 2001 Monitoring the expression pattern of 1300 Arabidopsis genes under drought and cold stresses by using a full-length cDNA microarray. Plant Cell 13: 6172.
SEKI, M., M. NARUSAKA, J. ISHIDA, T. NANJO, M. FUJITA et al., 2002 Monitoring the expression profiles of 7000 Arabidopsis genes under drought, cold and high-salinity stresses using a full-length cDNA microarray. Plant J. 31: 279292.[CrossRef][Medline]
STEINER, K. C., 1979 Variation in bud-burst timing among populations of interior Douglas-fir. Silvae Genet. 28: 7679.
TAJIMA, F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123: 585595.
TENAILLON, M. I., M. C. SAWKINS, A. D. LONG, R. L. GAUT, J. F. DOEBLEY et al., 2001 Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp mays L.). Proc. Natl. Acad. Sci. USA 98: 91619166.
THOMASHOW, M. F., 1998 Role of cold-responsive genes in plant freezing tolerance. Plant Physiol. 118: 17.
THOMASHOW, M. F., 1999 Plant cold acclimation: freezing tolerance genes and regulatory mechanisms. Annu. Rev. Plant Physiol. Plant Mol. Biol. 50: 571599.[CrossRef]
THOMASHOW, M. F., 2001 So what's new in the field of plant cold acclimation? Lots! Plant Physiol. 125: 8993.
VIARD, F., Y. A. EL-KASSABY and K. RITLAND, 2001 Diversity and genetic structure in populations of Pseudotsuga menziesii (Pinaceae) at chloroplast microsatellite loci. Genome 44: 336344.[Medline]
WANNER, L. A., and O. JUNTTILA, 1999 Cold-induced freezing tolerance in Arabidopsis. Plant Physiol. 120: 391400.
WATTERSON, G. A., 1975 On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7: 256276.[CrossRef][Medline]
WEIR, B. S., 1996 Genetic Data Analysis II. Sinauer Associates, Sunderland, MA.
WHEELER, N. C., K. D. JERMSTAD, K. V. KRUTOVSKY, S. N. AITKEN, G. T. HOWE et al., 2005 Mapping of quantitative trait loci controlling adaptive traits in coastal Douglas-Fir. IV. Cold-hardiness QTL verification and candidate gene mapping. Mol. Breed. 15: 145156.[CrossRef]
XIAYI, K., and L. R. CARDON, 2003 Efficient selective screening of haplotype tag SNPs. Bioinformatics 19: 287288.
ZHANG, K., and L. JIN, 2003 HaploBlockFinder: haplotype block analyses. Bioinformatics 19: 13001301.
ZHU, Y. L., Q. J. SONG, D. L. HYTEN, C. P. VAN TASSELL, L. K. MATUKUMALLI et al., 2003 Single-nucleotide polymorphisms in soybean. Genetics 163: 11231134.
ZWICK, M. E., D. J. CUTLER and A. CHAKRAVARTI, 2000 Patterns of genetic variation in Mendelian and complex traits. Annu. Rev. Genomics Hum. Genet. 1: 387407.[CrossRef]