Genetics, Vol. 164, 1547-1559, August 2003, Copyright © 2003

DNA Variation in a Conifer, Cryptomeria japonica (Cupressaceae sensu lato)

Tomoyuki Kadoa, Hiroshi Yoshimarub, Yoshihiko Tsumurab, and Hidenori Tachidaa
a Department of Biology, Faculty of Sciences, Kyushu University, Fukuoka 810-8560, Japan
b Forestry and Forest Products Research Institute, Kukizaki, Ibaraki 305-8687, Japan

Corresponding author: Hidenori Tachida, Faculty of Sciences, Kyushu University, Ropponmatsu, Fukuoka, 810-8560, Japan., htachscb{at}mbox.nc.kyushu-u.ac.jp (E-mail)

Communicating editor: M. AGUADÉ


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

We investigated the nucleotide variation of a conifer, Cryptomeria japonica, and the divergence between this species and its closest relative, Taxodium distichum, at seven nuclear loci (Acl5, Chi1, Ferr, GapC, HemA, Lcyb, and Pat). Samples of C. japonica were collected from three areas, Kantou-Toukai, Hokuriku, and Iwate. No apparent geographic differentiation was found among these samples. However, the frequency spectrum of the nucleotide polymorphism revealed excesses of intermediate-frequency variants, which suggests that the population was not panmictic and a constant size in the past. The average nucleotide diversity, {pi}, for silent sites was 0.00383. However, values of {pi} for silent sites vary among loci. Comparisons of polymorphism to divergence among loci (the HKA test) showed that the polymorphism at the Acl5 locus was significantly lower. We also observed a nearly significant excess of replacement polymorphisms at the Lcyb locus. These results suggested possibilities of natural selection acting at some of the loci. Intragenic recombination was detected only once at the Chi1 locus and was not detected at the other loci. The low level of population recombination rate, 4Nr, seemed to be due to both low level of recombination, r, and small population size, N.


RECENT progress in DNA sequencing techniques enables us to study details of genetic variation and divergence. Studies of polymorphism and divergence across several genes are still limited to a few model organisms, however. It is important to study genetic variation in various species in diverse taxa, since the effects of evolutionary factors are expected to differ among species. In the past, genetic variation in many plant species has been measured by electrophoresis. HAMRICK and GODT 1990 Down, HAMRICK and GODT 1996 Down summarized these allozyme data, paying attention to characteristics of the species such as life cycle, pollination system, seed dispersal, and mating system. They showed that these characteristics were indeed related to the amount and pattern of genetic variation.

In plants, nucleotide variation in nuclear genes has been studied well in herbaceous plants, especially in Arabidopsis and its relatives. These studies have provided us interesting information about these plants' past population structures and natural selection (e.g., INNAN et al. 1996 Down; FILATOV and CHARLESWORTH 1999 Down; KAWABE and MIYASHITA 1999 Down; STAHL et al. 1999 Down; KUITTINEN and AGUADE 2000 Down; SAVOLAINEN et al. 2000 Down; AGUADE 2001 Down); however, there has been only one such published report about woody plants (DVORNYK et al. 2002 Down). Woody plants usually have a longer generation time, and thus the relative effects of mutation and selection may be different from those in herbaceous plants.

In this study, we surveyed nucleotide variation in a conifer, Cryptomeria japonica (called sugi in Japanese) and its divergence from Taxodium distichum var. distichum (bald cypress, distributed in the southeast parts of the United States). These species belong to Cupressaceae sensu lato, and the phylogenetic relationship of the family has been inferred from plastid DNA sequences (GADEK et al. 2000 Down; KUSUMI et al. 2000 Down). Natural forests of C. japonica are distributed from Aomori Prefecture (40°42'N) to Yaku Island (30°15'N) in the Japanese Archipelago (HAYASHI 1960 Down), although the populations are presently discontinuous and scattered in limited areas due to past human exploitation. This species has a long generation time and sometimes lives more than a thousand years. It is mostly outbreeding and wind pollinated, like other gymnosperms. HAMRICK and GODT 1990 Down, HAMRICK and GODT 1996 Down reported that species having such characteristics showed low genetic differentiation. Indeed, low genetic differentiation among local populations (Fst ~0.05) was reported using allozymes and cleaved amplified polymorphic sequence (CAPS) markers in this species (TOMARU et al. 1994 Down; TSUMURA and TOMARU 1999 Down). While low genetic differentiation among populations has been observed, geographical variation of morphological traits (MURAI 1947 Down) and terpene components (YASUE et al. 1987 Down) has been known to occur in this species. These studies suggested that the species was mainly divided into two groups, called Omote sugi (Pacific Ocean side type) and Ura sugi (Japan Sea side type). The discordance between morphological differentiation and genetic differentiation is an unsolved problem needing explanation in this species (TOMARU et al. 1994 Down).

One of the advantages of studying C. japonica is that some information about the past population structure and mutation rate is available. First, TSUKADA 1982 Down, surveying fossil pollen data all over Japan, suggested that C. japonica retreated to a few refugia during the last glacial period and expanded its range rapidly after that. Retreat to refugia is not confined to the last glacial period. MIYOSHI et al. 1999 Down showed that emergences of the pollens of the species corresponded to several interglacial periods extending back to 400,000 years ago, although only one location was examined in the study. Second, since this species is one of the most important timber species in Japan, humans have been exploiting it for thousands of years. As a result, natural forests of C. japonica have been reduced and now are distributed in limited areas. In addition, plantations were instituted several hundred years ago (TODA 1973 Down; OHBA 1993 Down). More recently, a massive plantation was begun after 1945. Currently this species occupies >45% of the artificial forests in Japan (one-eighth of the area of Japan). And finally, KUSUMI et al. 2002 Down studied the molecular evolution of 11 nuclear loci in C. japonica and its related species. They found a large amount of variation in synonymous substitution rates among loci. Such knowledge will be helpful in the analysis of diversity patterns.

In this study, we investigated nucleotide variation at seven nuclear loci in three populations of C. japonica and the divergence from its closest relative, T. distichum. By analyzing variation at multiple loci from the same set of individuals, we aimed to separate locus-specific patterns from the patterns found across loci and assess the effects of selection, population structure, mutation, and recombination on the molecular evolution of this group of conifers.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Samples:
Since this species has been massively planted after 1945, we had to use care in our sampling. We collected seeds from the plus trees that have been kept in local seed orchards. The plus trees are elite trees that have favorable characteristics such as fast growth, pathogen resistance, and freeze resistance. The plus trees from which the seeds were taken mostly came from local artificial forests where seedlings have been used for plantation. Since the plus trees were generated before 1945, our samples are considered to represent the artificial forests before the massive plantation started after 1945. Samples were collected from three areas, Kantou-Toukai, Hokuriku, and Iwate, and the samples were considered to represent the Pacific Sea side (Omote sugi), the Japan Sea side (Ura sugi), and a northern part of the Pacific Sea side (Omote sugi) types, respectively (see Fig 1). Sixteen plus trees were chosen from each area, and seeds were taken from each tree. In some cases different seeds from the same mother tree were used for different loci.



View larger version (22K):
In this window
In a new window
Download PPT slide
 
Figure 1. Locations of origins of the plus trees. Locations of the forests from which the plus trees were collected are indicated by circles.

Loci:
We investigated the nucleotide variation of 7 nuclear loci. Five of the 7 loci were chosen from the 11 loci studied by KUSUMI et al. 2002 Down. Primers for PCR amplification were those used for sequenced-tagged-site markers (TSUMURA et al. 1997 Down; our unpublished results) or were designed from the expressed sequence tag database by UJINO-IHARA et al. 2000 Down. On the basis of homology, the 7 loci were considered to encode spermine synthase [Acaulis 5 (Acl5)], class I chitinase (Chi1), ferredoxin (Ferr), cytosolic glyceraldehyde-3-phosphate dehydrogenase (GapC), glutamyl-tRNA reductase (HemA), lycopene ß-cyclase (Lcyb), and phosphoribosylanthranilate transferase (Pat), respectively. The Acl5 consisted of 10 exons (1-281, 370-436, 526-620, 710-812, 896-984, 1091-1247, 2214-2289, 2405-2463, 2555-2691, and 2775-2850), nine introns, and 3' untranslated regions (UTR). The Chi1 consisted of a 5' UTR (1-296), 3 exons (297-735, 845-992, and 1093-1468), and two introns. The Ferr consisted of an exon (1-345). The GapC consisted of 5 exons (1-25, 217-314, 1246-1388, 1475-1558, and 1660-1749) and five introns. The HemA consisted of an exon (1-1047). The Lcyb consisted of an exon (1-1575) and a 3' UTR. The Pat consisted of an exon (1-573) and a 3' UTR.

Molecular methods:
Haploid genomic DNA was isolated from megagametophytes of each tree of C. japonica using the DNeasy plant mini kit (QIAGEN, Valencia, CA). In gymnosperms, megagametophytes are of maternal origin and haploid. Since the DNA samples are haploid, direct sequencing is straightforward and haplotypes can be easily determined. Seeds were sown on wet sterilized paper in a plastic plate. Seed coats and embryos were removed after germination. Using a pestle, we crushed a fresh megagametophyte in a 1.5-ml tube with Buffer AP1 of the DNeasy plant mini kit and then followed the manufacturer's instructions for use of the kit. In total, ~100 ng of DNA was obtained from a single seed. In addition to the primers used by KUSUMI et al. 2002 Down, we designed primers for PCR amplifications and for cycle sequencing reactions. All primers used for PCR amplifications are listed in Table 1. The PCR products were purified using Geneclean II (Bio101) or QiaQuick PCR purification kit (QIAGEN). Products were directly sequenced for both strands on ABI Prism 377 or ABI Prism 3100 automatic sequencers using BigDye terminator cycle sequencing ready reaction kit ver. 2 or ver. 3 (PE Biosystems, Foster City, CA). For sequencing of the Acl5, Chi1, and Lcyb of T. distichum, we used one of the diploid DNA samples of TSUMURA et al. 1999 Down. We cloned PCR products amplified by PyroBest DNA polymerase (TaKaRa) into a pGEM T-easy vector (Promega, Madison, WI). We sequenced more than six clones and chose the most frequent haplotype for analyses. We used published data by KUSUMI et al. 2002 Down for the Ferr, GapC, HemA, and Pat sequences of T. distichum (GenBank accession nos. AB075538, AB076715, AB075574, and AB076584).


 
View this table:
In this window
In a new window

 
Table 1. PCR primers used to amplify the seven loci

Mapping of the loci:
Linkage maps of C. japonica have been constructed using CAPS and other molecular markers (MUKAI et al. 1995 Down; NIKAIDO et al. 2000 Down; IWATA et al. 2001 Down). The Ferr and Pat loci had been mapped previously, but the other five loci had not, so we mapped those loci by the pseudo-testcross method (GRATTAPAGLIA and SEDEROFF 1994 Down). The segregation data were analyzed together with other markers using JoinMap (CPRO-DLO). The Acl5 and HemA loci could not be mapped because of the lack of suitable polymorphisms. The Chi1 and Lcyb were mapped onto the same linkage group, and the map distance between them was ~75 cM. The Ferr, GapC, and Pat were mapped to different linkage groups, respectively.

Data analysis:
DNA sequences were aligned manually. Analyses of sequence data (estimators of nucleotide variation, recombination rate, and gene flow; test of linkage disequilibrium; statistics for tests of neutrality; and coalescent simulations) were performed using DnaSP ver. 3.53 (ROZAS and ROZAS 1999 Down). We wrote a coalescent simulation program to obtain the P values of a test for detecting population growth, R2 (RAMOS-ONSINS and ROZAS 2002 Down), by modifying the program described by HUDSON 1990 Down. We used sequences of T. distichum, one of the closest relatives of C. japonica, to perform the Hudson-Kreitman-Aguadé (HKA) test (HUDSON et al. 1987 Down) and McDonald-Kreitman (MK) test (MCDONALD and KREITMAN 1991 Down). Phylogenetic networks were drawn manually and using Network ver. 3.0 (BANDELT et al. 1999 Down). Permutation tests for population subdivision among three areas were performed using programs described by HUDSON 2000 Down.

In addition, we tested goodness of fit of the observed frequency spectrum to that expected under the neutrality and panmixis with constant population size. Consider a polymorphic site with two types of nucleotides. Let i be the number of the rarer variants at this site, and designate the sample configuration of the site by [i:n-i] where n is the sample size. On the basis of sample configurations, we categorized polymorphic sites into three classes consisting of low-frequency (1:47 to 8:40), intermediate-frequency (9:39 to 16:32), and high-frequency (17:33 to 24:24) polymorphisms. We define as a test statistic, where Sj is the number of polymorphic sites that were categorized into the jth class and j is its expected value. A coalescent simulation program modified from that by HUDSON 1990 Down was made assuming the neutrality, panmixis, and constant population size. No recombination within a gene was assumed. A total of 10,000 replicate runs were executed to estimate j and obtain the distribution of {chi}2 under the null hypothesis of the neutrality, panmixis, and constant size. For the simulation, we used the observed number of segregating sites, S, at each locus to condition the number of mutations and assumed that loci are unlinked.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Forty-eight alleles (16 for each area) were sequenced for each locus. We call the partial sequences at respective loci by their putative locus names. The polymorphic sites are summarized in Fig 2. While almost all of the polymorphic sites had two variants, two sites at the Lcyb locus (positions 972 and 1353) had, respectively, three variants each. They were excluded from the analysis of linkage disequilibrium and estimation of the minimum number of recombination events. We used the number of mutations, but not the number of segregating sites, for the estimations and tests. In addition, we excluded gaps in the following analyses. Haplotype networks were constructed on the basis of the sequences, and they are shown in Fig 3.



View larger version (59K):
In this window
In a new window
Download PPT slide
 
Figure 2. A summary of polymorphic sites. The notations N, S, 5, 3, I, and G in Types denote nonsynonymous, synonymous, 5' UTR, 3' UTR, intron, and gap sites, respectively. For nonsynonymous polymorphic sites (type N), the majority and minority amino acids are shown below N using one-letter abbreviations. Asterisks indicate singleton sites. x, consensus sequence is (CTT)5 and + denotes (CTT)7 (from 2351 to 2370 at the Acl5); y, consensus sequence is (AG)3 and - denotes (AG)2 (from 2531 to 2536 at the Acl5); z, consensus sequence is 7-bp deletion and + denotes GTCTCGA (from 1607 to 1613 at the GapC).



View larger version (18K):
In this window
In a new window
Download PPT slide
 
Figure 3. Haplotype networks. Numbers in circles denote numbers of haplotypes. Italic numbers on the branches denote nucleotide substitutions between connected haplotypes, and branches without numbers have single substitutions.

Population subdivision:
Values of Fst estimated by the method of HUDSON et al. 1992 Down are summarized in Table 2. Estimated Fst values range from -0.05587 to 0.1681, and their averages over loci between any pairs of the areas are <4%. Results of the test utilizing Snn (HUDSON 2000 Down) among three areas are also summarized in Table 2. Snn statistics were significant at the 5% level in three of the comparisons between two areas at the Acl5 and GapC loci. Although a few of the comparisons showed statistical significance, we found no clear tendency of population subdivision across loci. In addition, the averages of Fst across loci were low (Table 2). Hence, we regard the present samples of C. japonica as homogeneous and pool the data of the three populations in the following discussion.


 
View this table:
In this window
In a new window

 
Table 2. Summary of Fst and the P values of Snn statistics for population subdivisions

Nucleotide variation:
Statistics of nucleotide variation, S, {pi}, and {theta}w, are summarized in Table 3. The average of the nucleotide diversity, {pi}sil, at silent sites over all loci was 0.00383. However, values of {pi}sil varied among loci. The values of {pi}sil ranged from 0.00017 at the HemA to 0.00813 at the Chi1. Values of {theta}w at silent sites, {theta}w_sil, were 0.00303 on the average and ranged from 0.00092 at the HemA to 0.00644 at the Chi1. Coalescent simulations were performed to obtain the probability that {pi}sil is smaller or larger than the observed value at each locus when the parameter {theta} is assumed to be the estimated average of {pi}sil per site across loci, 0.00383, using DnaSP ver. 3.53. Let {pi} and {theta} be the corresponding values for the sequenced region of each gene. We obtained P({pi} <= 0.00066|{theta} = 0.00383) = 0.0016 at the Acl5, P({pi} > 0.00813|{theta} = 0.00383) = 0.0501 at the Chi1, and P({pi} <= 0.00017|{theta} = 0.00383) = 0.0507 at the HemA.


 
View this table:
In this window
In a new window

 
Table 3. Summary of nucleotide variations at the seven loci

Total divergence, K, silent divergence, Ksil, and replacement divergence, Krep, per site between C. japonica and T. distichum are summarized in Table 4.


 
View this table:
In this window
In a new window

 
Table 4. Summary of divergence between C. japonica and T. distichum

Tests of neutrality:
Values of Fu and Li's F* and D* (FU and LI 1993 Down) and Tajima's D (TAJIMA 1989A Down) are summarized in Table 5. No significant deviation from the neutral expectation was found. Three test statistics, Fs, rg, and R2, for detecting population growth are also summarized in Table 5 (HARPENDING 1994 Down; FU 1997 Down; RAMOS-ONSINS and ROZAS 2002 Down). No significant deviations were found.


 
View this table:
In this window
In a new window

 
Table 5. Summary of the results of neutrality tests at the seven loci

The frequency spectrum obtained from all polymorphic sites of the seven loci is shown in Fig 4. There were excesses of intermediate-frequency variants. The observed value of {chi}2 was 14.268 and P({chi}2 >= 14.268) = 0.0122 under the neutral and panmictic model with constant size. The average and variance of {chi}2 under the null hypothesis were 3.000 and 9.810, respectively.



View larger version (26K):
In this window
In a new window
Download PPT slide
 
Figure 4. The frequency spectrum of the nucleotide polymorphisms obtained from all polymorphic sites at the seven loci. Broken line shows expected values based on a coalescent simulation assuming the neutrality, random mating, and constant size (see text).

P values of the multilocus HKA tests were 0.0620 at all sites and 0.1850 at silent sites. The results of multiple HKA tests are summarized in Table 6. Significantly lower polymorphisms were found at Acl5. A nearly significant excess of nonsynonymous polymorphisms was found at the Lcyb locus by the MK test (Table 7). This observation is supported by the low probability of finding the ratios of polymorphisms and fixed mutations if we compare the nonsynonymous variation at the locus against the synonymous or silent variation at all loci (Table 7).


 
View this table:
In this window
In a new window

 
Table 6. Summary of P values of multiple HKA tests


 
View this table:
In this window
In a new window

 
Table 7. Summary of MK tests

Recombination:
We tested the significance of linkage disequilibria for all pairs of polymorphic sites within and between loci, excluding sites with only singleton variation. While no pairs of sites that were located in different loci showed significant levels of linkage disequilibria after the Bonferroni correction, there were high levels of linkage disequilibria between pairs of sites within each locus (Fig 5). The minimum number of recombination events within each locus, RM, was estimated using HUDSON and KAPLAN's (1985) method (Table 8). Values of RM were one at the Chi1 locus and zero at the other six loci. Estimates of the population recombination rate, C (=4Nr), by HUDSON 1987 Down are summarized in Table 8. However, the values of C may be overestimated as WALL 2000 Down suggested. For example, if we use the estimated value of C = 0.0037 at the GapC locus, we obtain P(RM <= 0|{theta} = 0.00624, 4Nr = 0.0037) = 0.0091 using a coalescent simulation where the observed pairwise difference at the GapC was used as an estimate of the {theta} value. The estimate of RM was zero at this locus.



View larger version (29K):
In this window
In a new window
Download PPT slide
 
Figure 5. Statistical significance of linkage disequilibria among informative sites. There were no informative sites at the HemA.


 
View this table:
In this window
In a new window

 
Table 8. Summary of statistics for intragenic recombinations at the seven loci


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

DNA samples:
In this study, we used DNA samples from artificial forests. Since our samples were obtained from plus trees that have been propagated locally by seedlings and that were planted before the massive plantations started after 1945, we consider them to represent old artificial forests and hopefully to reflect natural populations. Indeed, TOMARU et al. 1994 Down reported that the genetic diversity among plus trees measured by protein electrophoresis was similar to that in natural populations. However, humans have been planting this species for several hundred years and effects of old human plantations are difficult to assess. As discussed below, a past reduction of population size was indicated from our data and this polymorphism pattern may be specific to samples from artificial forests. To examine this issue, we need to investigate samples from natural populations and we are currently analyzing diploid DNAs obtained from natural forests.

Amount of DNA variation:
The mean of the silent nucleotide diversity, {pi}sil, across all loci was 0.00383. This value is higher than that in humans (e.g., ~0.001; NACHMAN et al. 1998 Down), similar to that in Scots pine (0.0049 at the pal1; DVORNYK et al. 2002 Down), and lower than those in Drosophila (e.g., 0.0134 in Drosophila melanogaster and 0.0377 in D. simulans; MORIYAMA and POWELL 1996 Down) and Arabidopsis (0.01 in amplified fragment length polymorphism data of MIYASHITA et al. 1999 Down). However, the values varied among loci and ranged from 0.00017 to 0.00813. The Ksil values between C. japonica and T. distichum also varied and ranged from 0.0452 to 0.1352. KUSUMI et al. 2002 Down also reported large variation of synonymous substitution rates among 11 loci in Cupressaceae, which includes C. japonica.

Population structures:
HAMRICK and GODT 1990 Down, HAMRICK and GODT 1996 Down, summarizing allozyme data from many plant species, concluded that outbreeding and wind-pollinated species generally have low levels of genetic differentiation among local populations. The observed low values of Fst are consistent with this general observation. Similar results were also obtained in natural populations of this species using allozyme and CAPS markers (TOMARU et al. 1994 Down; TSUMURA and TOMARU 1999 Down). Almost the whole range of C. japonica was surveyed in these two studies. Thus, the present population of C. japonica seems to be nearly panmictic.

However, excesses of intermediate-frequency variants were observed across loci and this pattern could not be explained by the panmictic model with constant population size. Thus, we need to consider some past demographic events to explain the observation. One hint comes from the tendency of Tajima's D across loci. If we exclude the data of Acl5 and HemA where we found very low levels of variability, all Tajima's D values were positive (Table 5). This pattern appears if population size was recently reduced (TAJIMA 1989B Down, TAJIMA 1995 Down). This pattern would persist even if an expansion occurred afterward until singleton mutations accumulate in the population. In this case, we expect to observe excesses of intermediate-frequency variants. Therefore, one explanation for the excesses of intermediate-frequency variants is that there was a reduction of population size in a recent past. Fossil pollen data suggest that there was a size reduction in this species during the last glacial period that persisted ~20,000 years (TSUKADA 1982 Down; TAKAHARA 1998 Down). Also, artificial forests were grown using seeds from natural populations at some points in the past and certainly have experienced size reductions. Currently, we cannot specify when the reduction occurred or what caused it but surveys of linkage disequilibria among suitable markers may give information on the time frame of this event.

Intragenic recombination:
The recombination rate is also an important parameter affecting patterns of DNA polymorphism. Low levels of recombination make hitchhiking and background selection more effective (MAYNARD SMITH and HAIGH 1974 Down; KAPLAN et al. 1989 Down; BEGUN and AQUADRO 1992 Down; CHARLESWORTH et al. 1993 Down). To compare recombination rates among species, we need good estimates of C = 4Nr. However, as WALL 2000 Down pointed out, estimators of C are biased and have very large variances when parameters {theta} and C are small (say, in the case of {theta} <= 3 as in our case). Both lengths of sequences and the amount of variation here seem not enough to estimate C. Therefore, here we compare the minimum number of recombination events, RM, per informative site. In C. japonica, only one intragenic recombination event was detected at the Chi1 locus. In other species, intragenic recombination events were detected more frequently. Values of RM per informative site at seven loci in Arabidopsis thaliana (0.08–0.15, KUITTINEN and AGUADE 2000 Down) are larger than those in C. japonica (the highest value being 0.0526 at Chi1). Note that A. thaliana is mostly selfing and thus seems to have very low effective recombination rates. Therefore, the population recombination rate, C = 4Nr, of C. japonica seems to be very low. This indicates that either the recombination rate between adjacent sites is low or the population size in C. japonica is small compared to that in A. thaliana.

In fact, the recombination rate between adjacent sites in C. japonica is lower than that in A. thaliana. The genome size of C. japonica is 22.09 pg per diploid genome (HIZUME et al. 2001 Down), corresponding to ~105 Mb per haploid genome. Its genome size is 100 times larger than that of A. thaliana. On the other hand, estimates of the total map distance of C. japonica range from 1109.1 cM (IWATA et al. 2001 Down) to 1992.3 cM (NIKAIDO et al. 2000 Down), depending on the types of markers used. Even if we take the larger value of 2000 cM, the total map distance of C. japonica is only 3 or 4 times larger than that of A. thaliana (~600 cM; http://nasc.nott.ac.uk/newrimap.html). The recombination rate between adjacent sites is estimated to be about one-thirtieth of that in A. thaliana on the average.

Recently, DVORNYK et al. 2002 Down reported nucleotide variation at the pal1 locus in a conifer, Pinus sylvestris (Scots pine). Interestingly, while the level of nucleotide variation in Scots pine was similar to the average in C. japonica, the estimate of RM from their data was 2, and RM per informative site at the pal1 locus in Scots pine was 0.25, which is much higher than the equivalent measurements in C. japonica. Pines are known to have even larger genome sizes than C. japonica, and the total map distance is reported to be similar to that of C. japonica (CHAGNE et al. 2002 Down). To explain the different levels of population recombination parameter and the similar levels of nucleotide diversity in Scots pine and C. japonica, we invoke the smaller population size and higher mutation rate in C. japonica. While the distribution of C. japonica is restricted to Japan, Scots pine is widely distributed in the Eurasian continent and thus is expected to have a larger population size than C. japonica. KUSUMI et al. 2002 Down reported that the average synonymous substitutions per site among Taxodioideae including C. japonica and Sequoioideae, whose putative common ancestor Parataxodium appeared from the Cretaceous (ARNOLD and LOWTHER 1955 Down), was 0.387 while that between two Pinus species that diverged in the Cretaceous was 0.0357 (Table 6 of DVORNYK et al. 2002 Down). Hence, the mutation rate in Pinus is about one-tenth of that of Cupressaceae. Although we need more data in both species groups to draw any definite conclusion, the contrasting patterns found in the two species groups of conifers seem to reflect differences of genetic and demographic parameters.

Possibility of natural selections:
Significantly low levels of polymorphisms were observed at the Acl5 locus (Table 6). The pattern may be explained by a recent selective sweep at or near this locus (MAYNARD SMITH and HAIGH 1974 Down; KAPLAN et al. 1989 Down). Negative values of D, D*, and F* statistics are also consistent with this hypothesis (Table 5). The Acl5 is homologous to the ACAULIS5 gene of A. thaliana, which encodes spermine synthase (HANZAWA et al. 2000 Down). Spermine and other polyamines have been implicated in various plant growth and developmental processes (KAKKAR and SAWHNEY 2002 Down). Indeed acaulis5 mutants of A. thaliana showed defects in the elongation of stem internode (HANZAWA et al. 2000 Down) but there have been no reports on the physiological roles of this gene in conifers. The indication of a selective sweep makes it worthwhile to investigate the physiological roles of this gene, for example, by measuring levels of expression of the gene and amounts of spermine in various tissues of C. japonica.

Excesses of replacement polymorphisms at the Lcyb locus were observed (Table 7). Although a nearly neutral mutation model with constant population size produces this pattern (TACHIDA 2000 Down), replacement variants observed at this locus were not rare and this is inconsistent with the expectations of the model. We considered a few possibilities that may explain the pattern. First, the pattern may indicate that this locus became a pseudogene in C. japonica. However, the {pi}rep/{pi}sil ratio at this locus was 0.52, which indicates that this locus is still under the selective constraint (Table 4). Second, this gene might have experienced a recent relaxation of purifying selection. We note that five of the eight replacement polymorphic sites were found in the first 200 bp of the gene (Fig 2), which are not well conserved among species (data not shown). If nonsynonymous substitutions in this region are nearly neutral (OHTA 1992 Down) and if a reduction of population size occurred recently in C. japonica as discussed previously, we may observe many replacement polymorphisms with intermediate-to-high variant frequencies in this region. Currently this explanation seems most plausible for the observation but we need to examine polymorphisms of this gene in other related species to evaluate its validity.


*  FOOTNOTES

Sequence data from this article have been deposited with the DDBJ/EMBL/GenBank Data Libraries under accession nos. AB096270, AB096608, AB075538, AB076715, AB075574, and AB076584. Back


*  ACKNOWLEDGMENTS

We thank M. Takahashi, Y. Moriguchi, H. Yomogida, S. Ito, M. Saito, and T. Kondo for their help collecting samples. We also thank N. Tani for his help in mapping. We thank M. Aguadé and two anonymous reviewers for their comments. This work was partially supported by grants from the Program for the Promotion of Basic Research Activities for Innovative Biosciences (PROBRAIN) and Uehara Memorial Foundation.

Manuscript received July 11, 2002; Accepted for publication April 10, 2003.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

AGUADÉ, M., 2001  Nucleotide sequence variation at two genes of the phenylpropanoid pathway, the FAH1 and F3H genes, in Arabidopsis thaliana.. Mol. Biol. Evol. 18:1-9.[Abstract/Free Full Text]

ARNOLD, C. A. and J. C. LOWTHER, 1955  A new Cretaceous conifer from northern Alaska. Am. J. Bot. 42:522-528.

BANDELT, H. J., P. FORSTER, and A. ROHL, 1999  Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 16:37-48.[Abstract]

BEGUN, D. J. and C. F. AQUADRO, 1992  Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster.. Nature 356:519-520.[Medline]

CHAGNE, D., C. LALANNE, D. MADUR, S. KUMAR, and J. M. FRIGERIO et al., 2002  A high density genetic map of maritime pine based on AFLPs. Ann. For. Sci. 59:627-636.

CHARLESWORTH, B., T. H. MORGAN, and D. CHARLESWORTH, 1993  The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303.[Abstract]

DVORNYK, V., A. SIRVIO, M. MIKKONEN, and O. SAVOLAINEN, 2002  Low nucleotide diversity at the pal1 locus in the widely distributed Pinus sylvestris.. Mol. Biol. Evol. 19:179-188.[Abstract/Free Full Text]

FILATOV, D. A. and D. CHARLESWORTH, 1999  DNA polymorphism, haplotype structure and balancing selection in the Leavenworthia PgiC locus. Genetics 153:1423-1434.[Abstract/Free Full Text]

FU, Y. X., 1997  Statistical tests of neutrality for DNA samples from a population. Genetics 147:915-925.[Abstract]

FU, Y. X. and W. H. LI, 1993  Statistical tests of neutrality of mutations. Genetics 133:693-709.[Abstract]

GADEK, P. A., D. L. ALPERS, M. M. HESLEWOOD, and C. J. QUINN, 2000  Relationships within Cupressaceae sensu lato: a combined morphological and molecular approach. Am. J. Bot. 87:1044-1057.[Abstract/Free Full Text]

GRATTAPAGLIA, D. and R. SEDEROFF, 1994  Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-test-cross: mapping strategy and RAPD markers. Genetics 137:1121-1137.[Abstract]

HAMRICK, J. L., and M. J. W. GODT, 1990 Allozyme diversity in plant species, pp. 43–63 in Plant Population Genetics, Breeding and Genetic Resources, edited by A. H. D. BROWN, M. T. CLEGG, A. L. KAHLER and B. S. WEIR. Sinauer Associates, Sunderland, MA.

HAMRICK, J. L. and M. J. W. GODT, 1996  Effects of life history traits on genetic diversity in plant species. Philos. Trans. R. Soc. Lond. B 351:1291-1298.

HANZAWA, Y., T. TAKAHASHI, A. J. MICHAEL, D. BURTIN, and D. LONG et al., 2000  ACAULIS5, an Arabidopsis gene required for stem elongation, encodes a spermine synthase. EMBO J. 19:4248-4256.[Medline]

HARPENDING, H. C., 1994  Signature of ancient population growth in a low-resolution mitochondrial DNA mismatch distribution. Hum. Biol. 66:591-600.[Medline]

HAYASHI, Y., 1960 Taxonomical and Phytogeographical Study of Japanese Conifers. Norin-Shuppan, Tokyo (in Japanese).

HIZUME, M., T. KONDO, F. SHIBATA, and R. ISHIZUKA, 2001  Flow cytometric determination of genome size in the Taxodiaceae, Cupressaceae sensu stricto and Sciadopityaceae. Cytologia 66:307-311.

HUDSON, R. R., 1987  Estimating the recombination parameter of a finite population model without selection. Genet. Res. 50:245-250.[Medline]

HUDSON, R. R., 1990 Gene genealogy and the coalescent process, pp. 1–44 in Oxford Surveys in Evolutionary Biology, edited by P. H. HARVEY and L. PARTRIDGE. Oxford University Press, New York.

HUDSON, R. R., 2000  A new statistic for detecting genetic differentiation. Genetics 155:2011-2014.[Abstract/Free Full Text]

HUDSON, R. R. and N. L. KAPLAN, 1985  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147-164.[Abstract/Free Full Text]

HUDSON, R. R., M. KREITMAN, and M. AGUADÉ, 1987  A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159.[Abstract/Free Full Text]

HUDSON, R. R., M. SLATKIN, and W. P. MADDISON, 1992  Estimation of levels of gene flow from DNA sequence data. Genetics 132:583-589.[Abstract]

INNAN, H., F. TAJIMA, R. TERAUCHI, and N. T. MIYASHITA, 1996  Intragenic recombination in the Adh locus of the wild plant Arabidopsis thaliana.. Genetics 143:1761-1770.[Abstract]

IWATA, H., T. UJINO-IHARA, K. YOSHIMURA, K. NAGASAKA, and Y. MUKAI et al., 2001  Cleaved amplified polymorphic sequence markers in sugi, Cryptomeria japonica D. Don, and their locations on a linkage map. Theor. Appl. Genet. 103:881-895.

KAKKAR, R. J. and V. K. SAWHNEY, 2002  Polyamine research in plants—a changing perspective. Physiol. Plant. 116:281-292.

KAPLAN, N. L., R. R. HUDSON, and C. H. LANGLEY, 1989  "Hitchhiking effect" revisited. Genetics 123:887-899.[Abstract/Free Full Text]

KAWABE, A. and N. T. MIYASHITA, 1999  DNA variation in the basic chitinase locus (ChiB) region of the wild plant Arabidopsis thaliana.. Genetics 153:1445-1453.[Abstract/Free Full Text]

KUITTINEN, H. and M. AGUADÉ, 2000  Nucleotide variation at the CHALCONE ISOMERASE locus in Arabidopsis thaliana.. Genetics 155:863-872.[Abstract/Free Full Text]

KUSUMI, J., Y. TSUMURA, H. YOSHIMARU, and H. TACHIDA, 2000  Phylogenetic relationships in Taxodiaceae and Cupressaceae sensu stricto based on matK gene, chlL gene, trnL-trnF IGS region, and trnL intron sequences. Am. J. Bot. 87:1480-1488.[Abstract/Free Full Text]

KUSUMI, J., Y. TSUMURA, H. YOSHIMARU, and H. TACHIDA, 2002  Molecular evolution of nuclear genes in Cupressaceae, a group of conifer trees. Mol. Biol. Evol. 19:736-747.[Abstract/Free Full Text]

MAYNARD SMITH, J. and J. HAIGH, 1974  The hitch-hiking effect of a favorable gene. Genet. Res. 23:23-35.[Medline]

MCDONALD, J. H. and M. KREITMAN, 1991  Adaptive protein evolution at the Adh locus in Drosophila.. Nature 351:652-654.[Medline]

MIYASHITA, N. T., A. KAWABE, and H. INNAN, 1999  DNA variation in the wild plant Arabidopsis thaliana revealed by amplified fragment length polymorphism analysis. Genetics 152:1723-1731.[Abstract/Free Full Text]

MIYOSHI, N., T. FUJIKI, and Y. MORITA, 1999  Palynology of a 250-m core from Lake Biwa: a 430,000-year record of glacial-interglacial vegetation change in Japan. Rev. Paleobot. Palynol. 104:267-283.

MORIYAMA, E. N. and J. R. POWELL, 1996  Intraspecific nuclear DNA variation in Drosophila.. Mol. Biol. Evol. 13:261-277.[Abstract]

MUKAI, Y., Y. SUYAMA, Y. TSUMURA, T. KAWAHARA, and H. YOSHIMARU et al., 1995  A linkage map for sugi (CRIPTOMERIA-JAPONICA) based on RFLP, RAPD and isozyme loci. Theor. Appl. Genet. 90:835-840.

MURAI, S., 1947 Major forest tree species in the Tohoku region and their varietal problems, pp. 131–151 in Kokudo Saiken Zourin Gijutsu Kouenshu, edited by AOMORI-RINYU-KYOUKAI, Aomoririnyu-kyoukai, Aomori (in Japanese).

NACHMAN, M. W., V. L. BAUER, S. L. CROWELL, and C. F. AQUADRO, 1998  DNA variability and recombination rate at X-linked loci in humans. Genetics 150:1133-1141.[Abstract/Free Full Text]

NIKAIDO, A. M., T. UJINO, H. IWATA, K. YOSHIMURA, and H. YOSHIMURA et al., 2000  AFLP and CAPS linkage maps of Cryptomeria japonica.. Theor. Appl. Genet. 100:825-831.

OHBA, K., 1993 Clonal forestry with sugi (Cryptomeria japonica), pp. 66–90 in Clonal Forestry II, Conservation and Application, edited by M. R. AHUJA and W. J. LIBBY. Springer, New York.

OHTA, T., 1992  The nearly neutral theory of molecular evolution. Annu. Rev. Syst. Ecol. 23:263-286.

RAMOS-ONSINS, S. E. and J. ROZAS, 2002  Statistical properties of new neutrality tests against population growth. Mol. Biol. Evol. 19:2092-2100.[Abstract/Free Full Text]

ROZAS, J. and R. ROZAS, 1999  DnaSP version 3.0: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174-175.[Abstract/Free Full Text]

SAVOLAINEN, O., C. H. LANGLEY, B. P. LAZZARO, and H. FREVILLE, 2000  Contrasting patterns of nucleotide polymorphism at the alcohol dehydrogenase locus in the outcrossing Arabidopsis lyrata and the selfing Arabidopsis thaliana.. Mol. Biol. Evol. 17:645-655.[Abstract/Free Full Text]

STAHL, E. A., G. DWYER, R. MAURICIO, M. KREITMAN, and J. BERGELSON, 1999  Dynamics of disease resistance polymorphism at the Rpm1 locus of Arabidopsis.. Nature 400:667-671.[Medline]

TACHIDA, H., 2000  Molecular evolution in a multisite nearly neutral mutation model. J. Mol. Evol. 50:69-81.[Medline]

TAJIMA, F., 1989a  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595.[Abstract/Free Full Text]

TAJIMA, F., 1989b  The effect of change in population size on DNA polymorphism. Genetics 123:597-601.[Abstract/Free Full Text]

TAJIMA, F., 1995  Effect of non-random sampling on the estimation of parameters in population genetics. Genet. Res. 66:267-276.[Medline]

TAKAHARA, H., 1998 Sugi-rin no hensen, pp. 207–223 in Nippon Rettou Syokusei-si, edited by Y. YASUDA and N. MIYOUSHI. Asakura-Shoten, Tokyo (in Japanese).

TODA, R., 1973 Tree breeding, pp. 1–45 in History of Forestry, edited by JAPAN FORESTRY ASSOCIATION. Japan Forestry Association, Tokyo (in Japanese).

TOMARU, N., Y. TSUMURA, and K. OHBA, 1994  Genetic variation and population differentiation in natural populations of Cryptomeria japonica.. Plant Species Biol. 9:191-199.

TSUKADA, M., 1982  Cryptomeria japonica: glacial refugia and late-glacial and postglacial migration. Ecology 63:1091-1105.

TSUMURA, Y. and N. TOMARU, 1999  Genetic diversity of Cryptomeria japonica using co-dominant markers based on sequenced-tagged sites. Theor. Appl. Genet. 98:396-404.

TSUMURA, Y., Y. SUYAMA, K. YOSHIMURA, N. SHIRATO, and Y. MUKAI, 1997  Sequence-tagged-sites (STSs) of cDNA clones in Cryptomeria japonica and their evaluation as molecular markers in conifers. Theor. Appl. Genet. 94:764-772.

TSUMURA, Y., N. TOMARU, Y. SUYAMA, and S. BACCHUS, 1999  Genetic diversity and differentiation of Taxodium in the southeastern United States using cleaved amplified polymorphic sequences. Heredity 83:229-238.

UJINO-IHARA, T., K. YOSHIMURA, Y. UGAWA, H. YOSHIMARU, and K. NAGASAKA et al., 2000  Expression analysis of ESTs derived from the inner bark of Cryptomeria japonica.. Plant Mol. Biol. 43:451-457.[Medline]

WALL, J. D., 2000  A comparison of estimators of the population recombination rate. Mol. Biol. Evol. 17:156-163.[Abstract/Free Full Text]

YASUE, M., K. OGIYAMA, S. SUTO, H. TSUKAHARA, and F. MIYAHARA et al., 1987  Geographical differentiation of natural Cryptomeria stands analyzed by diterpene hydrocarbon constituents of individual trees. J. Jpn. For. Soc. 69:152-156.




This article has been cited by other articles:


Home page
J HeredHome page
H. Chen, P. L. Morrell, M. de la Cruz, and M. T. Clegg
Nucleotide Diversity and Linkage Disequilibrium in Wild Avocado (Persea americana Mill.)
J. Hered., March 14, 2008; (2008) esn016v1.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
E. Eveno, C. Collada, M. A. Guevara, V. Leger, A. Soto, L. Diaz, P. Leger, S. C. Gonzalez-Martinez, M. T. Cervera, C. Plomion, et al.
"Contrasting Patterns of Selection at Pinus pinaster Ait. Drought Stress Candidate Genes as Revealed by Genetic Differentiation Analyses"
Mol. Biol. Evol., February 1, 2008; 25(2): 417 - 437.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. Zhou, K. Zeng, W. Wu, X. Chen, Z. Yang, S. Shi, and C.-I Wu
Population Genetics of Speciation in Nonmodel Organisms: I. Ancestral Polymorphism in Mangroves
Mol. Biol. Evol., December 1, 2007; 24(12): 2746 - 2754.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. Heuertz, E. De Paoli, T. Kallman, H. Larsson, I. Jurman, M. Morgante, M. Lascoux, and N. Gyllenstrand
Multilocus Patterns of Nucleotide Diversity, Linkage Disequilibrium and Demographic History of Norway Spruce [Picea abies (L.) Karst]
Genetics, December 1, 2006; 174(4): 2095 - 2105.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. Liu and J. M. Burke
Patterns of Nucleotide Diversity in Wild and Cultivated Sunflower
Genetics, May 1, 2006; 173(1): 321 - 330.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
X.-F. Ma, A. E. Szmidt, and X.-R. Wang
Genetic Structure and Evolutionary History of a Diploid Hybrid Pine Pinus densata Inferred from the Nucleotide Variation at Seven Gene Loci
Mol. Biol. Evol., April 1, 2006; 23(4): 807 - 816.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. C. Gonzalez-Martinez, E. Ersoz, G. R. Brown, N. C. Wheeler, and D. B. Neale
DNA Sequence Variation and Selection of Tag Single-Nucleotide Polymorphisms at Candidate Genes for Drought-Stress Response in Pinus taeda L.
Genetics, March 1, 2006; 172(3): 1915 - 1926.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Lynch
The Origins of Eukaryotic Gene Structure
Mol. Biol. Evol., February 1, 2006; 23(2): 450 - 468.
[Abstract] [Full Text] [PDF]