Genetics, Vol. 166, 1887-1895, April 2004, Copyright © 2004

Evolution of DMY, a Newly Emergent Male Sex-Determination Gene of Medaka Fish

Jianzhi Zhanga
a Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109

Corresponding author: Jianzhi Zhang, University of Michigan, 3003 Natural Science Bldg., 830 North University Ave., Ann Arbor, MI 48109., jianzhi{at}umich.edu (E-mail)

Communicating editor: S. YOKOYAMA


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The Japanese medaka fish Oryzias latipes has an XX/XY sex-determination system. The Y-linked sex-determination gene DMY is a duplicate of the autosomal gene DMRT1, which encodes a DM-domain-containing transcriptional factor. DMY appears to have originated recently within Oryzias, allowing a detailed evolutionary study of the initial steps that led to the new gene and new sex-determination system. Here I analyze the publicly available DMRT1 and DMY gene sequences of Oryzias species and report the following findings. First, the synonymous substitution rate in DMY is 1.73 times that in DMRT1, consistent with the male-driven evolution hypothesis. Second, the ratio of the rate of nonsynonymous nucleotide substitution (dN) to that of synonymous substitution (dS) is significantly higher in DMY than in DMRT1. Third, in DMRT1, the dN/dS ratio for the DM domain is lower than that for non-DM regions, as expected from the functional importance of the DM domain. But in DMY, the opposite is observed and the DM domain is likely under positive Darwinian selection. Fourth, only one characteristic amino acid distinguishes all DMY sequences from all DMRT1 sequences, suggesting that a single amino acid change may be largely responsible for the establishment of DMY as the male sex-determination gene in medaka fish.


MOST animals have two sexes. However, whether an undifferentiated embryonic gonad eventually develops into a testis or an ovary is determined by genetic, environmental, or both factors, depending on the species concerned. For placental mammals, in which females have two X chromosomes and males have one X and one Y chromosome, male sex is determined by a Y-linked gene SRY (BERTA et al. 1990 Down; SINCLAIR et al. 1990 Down). It has been proposed that SRY originated by divergence from its X-linked homologous gene SOX3, probably due to the lack of recombination between the two genes (STEVANOVIC et al. 1993 Down; FOSTER and GRAVES 1994 Down). It is difficult to delineate the initial steps that led to the emergence of SRY, because SRY likely originated before the separation of placental and marsupial mammals (FOSTER et al. 1992 Down; FOSTER and GRAVES 1994 Down) about 170 million years ago (MYA; KUMAR and HEDGES 1998 Down). The identification of the male sex-determination gene DMY in the Japanese medaka fish Oryzias latipes (MATSUDA et al. 2002 Down; NANDA et al. 2002 Down) provides an excellent opportunity for such a study of the origin of sex-determination genes. Similar to the situation in mammals, O. latipes has XX/XY sex chromosomes. But, in contrast to the small size of mammalian Y, medaka Y is identical in morphology to X. Because the Y chromosome usually undergoes evolutionary degeneration when it no longer recombines with X (NEI 1970 Down; CHARLESWORTH and CHARLESWORTH 2000 Down), the observation in medaka suggests a recent origin of its Y as well as its sex-determination system. In fact, the Y-specific region in medaka originated from transposition of an autosomal region and it spans ~280 kb, containing only one functional gene, DMY (NANDA et al. 2002 Down). A naturally occurring insertion in DMY that leads to the truncation of the gene results in XY females (MATSUDA et al. 2002 Down). A second mutation that reduces the level of DMY expression also results in a high frequency of XY females (MATSUDA et al. 2002 Down). These observations demonstrate the necessity for DMY in male sex determination and strongly suggest it to be the medaka counterpart of the mammalian SRY gene (MATSUDA et al. 2002 Down; NANDA et al. 2002 Down). A recent survey of several different strains, however, identified some male medaka fish that do not possess DMY, suggesting the possibility of other contributing factors in male sex determination (NANDA et al. 2003 Down). DNA sequence analysis shows that DMY is a paralog of an autosomal gene DMRT1, which encodes a conserved zinc-finger transcription factor that is found in mammals, birds, reptiles, fruitflies, and nematodes (ZARKOWER 2001 Down; MATSUDA et al. 2002 Down; NANDA et al. 2002 Down). Phylogenetic and evolutionary analyses further indicated that the duplication event was recent, as the DMY gene is found only in two closely related species (O. latipes and O. curvinotus), but is absent from other species of the genus Oryzias so far examined (KONDO et al. 2003 Down; MATSUDA et al. 2003 Down). In this work, I analyzed the publicly available DNA sequences of the DMY and DMRT1 genes from Oryzias in an attempt to address the following questions. First, when did the duplication occur? Second, is the mutation rate in DMY higher than that in DMRT1? This question is raised because DMY is located on Y, which is always in males, while DMRT1 is in males half the time and in females half the time. The hypothesis of male-driven evolution asserts that the mutation rate is higher in males than in females due to more rounds of cell divisions in the male germ line (LI et al. 2002 Down). Third, does DMY exhibit a high rate of protein evolution? I raise this question because the mammalian male sex-determination gene SRY shows a rapid pace of sequence evolution among species of primates and rodents (TUCKER and LUNDRIGAN 1993 Down; WHITFIELD et al. 1993 Down). Fourth, is there any difference in amino acid substitution pattern between DMY and DMRT1, as such a difference may provide information on the critical regions of the protein that confers the new function of DMY? Finally, how many amino acid substitutions may be responsible for DMY's new role in sex determination?


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The DMY and DMRT1 gene sequences from O. latipes HNI strain, O. latipes Carbio strain, O. curvinotus, and the DMRT1 sequence of O. celebensis were obtained from GenBank. The GenBank accession numbers are listed in the legend to Fig 1. The protein sequences were aligned using CLUSTAL X (THOMPSON et al. 1997 Down) and the DNA sequences were then aligned following the protein alignment. It was found that there is a one-nucleotide deletion near the end of the coding region in the O. latipes HNI DMY gene, which results in a frameshift and earlier termination of the open reading frame (see Fig 1). The nucleotide sequence alignment was thus manually adjusted by incorporating this deletion, and the sequence region that is downstream of this deletion is not used in the evolutionary rate analysis because of the difficulty in assigning synonymous and nonsynonymous differences. Gene trees were reconstructed using the neighbor-joining method (SAITOU and NEI 1987 Down) implemented in MEGA2 (KUMAR et al. 2001 Down), as well as the likelihood method (FELSENSTEIN 1981 Down) implemented in PAUP* (SWOFFORD 1998 Down). Several different distance measures or substitution models (Jukes-Cantor, Kimura's two parameter, Tajima-Nei, and Tamura-Nei; NEI and KUMAR 2000 Down) were used. The bootstrap test (FELSENSTEIN 1985 Down; 2000 replications for neighbor joining and 200 replications for likelihood) was used to examine the reliability of the reconstructed trees. Four-cluster analysis was performed by the PHYLTEST program (RZHETSKY et al. 1995 Down). Ancestral gene sequences were reconstructed for interior nodes by the distance-based Bayesian method (ZHANG and NEI 1997 Down). Numbers of synonymous (s) and nonsynonymous (n) substitutions on tree branches were counted as in ZHANG et al. 1997 Down. A maximum likelihood method (YANG 1998 Down) was used to analyze changes in w = dN/dS in the evolution of DMY and DMRT1 genes. Here dN and dS refer to the numbers of nonsynonymous and synonymous substitutions per site, respectively. The likelihood method (YANG et al. 2000 Down) was also used for testing positive selection on individual sites.



View larger version (17K):
In this window
In a new window
Download PPT slide
 
Figure 1. Protein sequence alignment of DMY and DMRT1 genes of medaka fish (Oryzias species). The DM domain is boxed. Amino acid substitutions in DMY that occurred in the common ancestor of O. latipes and O. curvinotus (see tree b of Fig 2) are indicated by an asterisk. Sequences downstream of the vertical line are not used in the evolutionary rate analyses because of a frameshifting deletion in O. latipes (HNY) DMY. Carbio and HNI are two strains of O. latipes. The sequences are obtained from GenBank, with the accession numbers as follows: O. latipes (Carbio) DMY, AY129240; O. latipes (HNI) DMY, AY129241; O. curvinotus DMY, AB091695; O. latipes (Carbio) DMRT1, AF319994; O. latipes (HNI) DMRT1, AY157712; O. curvinotus DMRT1, AB091696; and O. celebensis DMRT1, AY239587.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Phylogenetic relationships of DMRT1 and DMY genes of Oryzias species:
The DMY gene was found only in O. latipes (HNI and Carbio strains) and O. curvinotus, but not in other species of the Oryzias genus so far examined (KONDO et al. 2003 Down; MATSUDA et al. 2003 Down). This would suggest that the gene duplication that gave rise to DMY occurred in the common ancestor of O. latipes and O. curvinotus after this ancestor diverged from other Oryzias species (MATSUDA et al. 2003 Down). To verify this hypothesis, I first aligned the DMY and DMRT1 sequences (Fig 1) and then reconstructed a gene tree by the neighbor-joining method (Fig 2A). The tree shows the clustering of DMY sequences from the two strains of O. latipes. However, the DMY gene of O. curvinotus does not cluster with those of O. latipes. Rather, it clusters with O. curvinotus DMRT1. Nevertheless, this grouping has only 50% bootstrap support, suggesting a possibility that it may not reflect the true relationships among the sequences. A likelihood analysis resulted in the same tree with slightly different bootstrap numbers (Fig 2A). From these analyses, it appears that the DMY genes from the two strains of O. latipes form a cluster (with 100% bootstrap support) and the DMRT1 genes from O. latipes form another cluster (with >97% bootstrap support). It is the relations of these two clusters and the DMY and DMRT1 genes of O. curvinotus that are in question. I then used a distance-based four-cluster analysis, which is particularly suitable for evaluating the relationships among four monophyletic groups (RZHETSKY et al. 1995 Down). This analysis shows that none of the three possible groupings of the above four clusters is significantly better than the other groupings (P > 0.2; Fig 2 legend). That is, there is not sufficient phylogenetic information in the data that can resolve the relationships among the four clusters. It is well known that sex-determination mechanisms are highly variable among animals (ZARKOWER 2001 Down). It is thus very unlikely that an identical sex-determination system evolved twice in two closely related species by independent duplications of the same gene. Therefore, it is parsimonious to assume that DMY originated only once in Oryzias, as depicted in Fig 2B. Further analyses are based mainly on this assumption. However, to examine whether the results obtained are robust, I also repeated all the analyses without using the two sequences of O. curvinotus, as the tree becomes well resolved when these sequences are excluded (Fig 2A).



View larger version (34K):
In this window
In a new window
Download PPT slide
 
Figure 2. Phylogenetic relationships of seven DMY and DMRT1 gene sequences of medaka fish. (a) The same tree generated by the neighbor-joining and likelihood methods. The neighbor-joining tree was reconstructed with KIMURA's (1980) two-parameter distance, whereas the likelihood tree was reconstructed under the general-reversible model with variable rates of substitutions across sites (plus a class of invariant sites). Bootstrap percentages from neighbor-joining and likelihood analyses are shown on interior branches. (b and c) Alternative trees when all possible relations among groups A, B, C, and D are considered. In a four-cluster analysis (see text), tree a is found to be better than tree c, which is in turn better than tree b; but none of the differences among the three trees are statistically significant (P > 0.2).

A higher rate of synonymous substitution in DMY than in DMRT1:
To study the rate of nucleotide substitution in the evolution of DMY and DMRT1, I inferred the ancestral gene sequences on all interior nodes of the tree shown in Fig 3, which has the same topology as the tree of Fig 2B. Because the sequences are relatively closely related, this inference is expected to be reliable. In fact, the computed average posterior probability is >99% for all the ancestral sequences. I then counted the numbers of synonymous and nonsynonymous substitutions on each tree branch (Fig 3A). There are a total of 31.6 synonymous substitutions in DMY, but only 18.3 in DMRT1 for all the branches since the duplication event. The difference between the two numbers is statistically significant (P = 0.03, one-tailed z-test), indicating a higher rate of synonymous substitution in DMY than in DMRT1. Because synonymous changes are more or less neutral and are generally immune to selection (except in the case of weak selection on codon usage in certain organisms), the above result suggests a higher mutation rate in DMY than in DMRT1. Since DMY is on the Y chromosome, which is always in males, and DMRT1 is on an autosome, which is in males half the time and in females half the time, my observation suggests a higher rate of mutation in males than in females. Note that this inference is not affected by the fact that some males do not have DMY (NANDA et al. 2003 Down) as long as DMY is always in males, which has never been violated. The ratio of the mutation rate in males to that in females may be estimated by {alpha}m = 1/(2A/Y – 1) (MIYATA et al. 1987 Down), where A/Y is the mutation rate ratio between autosome and Y chromosome and is 18.3/31.6 = 0.579 for this data set. Thus, {alpha}m is 6.32. This result supports the male-driven evolution hypothesis, which asserts that the mutation rate is higher in males than in females (MIYATA et al. 1987 Down; LI et al. 2002 Down). When the two sequences of O. curvinotus are excluded from the analysis (Fig 4), A/Y = 8.3/15.1 = 0.550 and {alpha}m is 10.1.



View larger version (41K):
In this window
In a new window
Download PPT slide
 
Figure 3. Nucleotide substitutions in the evolution of DMY and DMRT1 of medaka fish for (a) the entire protein, (b) the DM domain, and (c) the non-DM regions. The tree with a single origin of DMY, as depicted in Fig 2B, is used. The numbers of synonymous (s) and nonsynonymous (n) substitutions for each tree branch are shown on branches. The sums of n for all branches involving DMY (after gene duplication), DMRT1 (after duplication), and DMRT1 (before duplication) are indicated by respective {sum}n values. {sum}s values are similarly obtained. N and S are potential numbers of nonsynonymous and synonymous sites of the sequences, respectively. For the entire protein, N = 523.0 and S = 212.0. For the DM domain, N = 143.7 and S = 54.3. For non-DM regions, N = 379.3 and S = 157.7.



View larger version (37K):
In this window
In a new window
Download PPT slide
 
Figure 4. Nucleotide substitutions in the evolution of DMY and DMRT1 of medaka fish for (a) the entire protein, (b) the DM domain, and (c) the non-DM regions. The DMY and DMRT1 genes from O. curvinotus are excluded from the analysis because of their uncertain phylogenetic positions (Fig 2). The numbers of synonymous (s) and nonsynonymous (n) substitutions for each tree branch are shown on branches. The sums of n for all branches involving DMY (after gene duplication), DMRT1 (after duplication), and DMRT1 (before duplication) are indicated by respective {sum}n values. {sum}s values are similarly obtained. N and S are potential numbers of nonsynonymous and synonymous sites of the sequences, respectively. For the entire protein, N = 523.0 and S = 212.0. For the DM domain, N = 143.7 and S = 54.3. For non-DM regions, N = 379.3 and S = 157.7.

It is important to know when DMY originated. The divergence times for the species concerned are unknown, making it difficult to use DMRT1 or DMY data for time estimation directly. It has been estimated that the nucleotide mutation rate in mammals is on average 2.2 x 10–9/site/year (KUMAR and SUBRAMANIAN 2002 Down). If the mutation rate in medaka DMRT1 is similar to the above rate, it can be estimated that the duplication occurred (7.15/212.0)/(2.2 x 10–9) = 15.3 MYA. Here 7.15 is the average number of synonymous substitutions in DMRT1 from duplication to present, and 212.0 is the number of synonymous sites in the gene, estimated by the modified Nei-Gojobori method (ZHANG et al. 1998 Down). ELLEGREN and FRIDOLFSSON 2003 Down estimated the synonymous substitution rates for four autosomal genes from salmonid fish Oncorhynchus kisutch and O. tshawytscha, which were estimated to be separated ~5 MYA (OOHARA et al. 1997 Down). Weighted by the sequence length, these rates averaged 3.6 x 10–9/site/year. If the synonymous substitution rate in medaka DMRT1 is similar to this rate, I estimated that DMY originated (7.15/212.0)/3.6 x 10–9 = 9.4 MYA. When I exclude the two sequences from O. curvinotus and repeat the above estimation, the date of duplication becomes 5.7–9.2 MYA. Although these estimates may have large errors, it appears that the age of DMY is on the order of 10 MY.

A higher rate of nonsynonymous substitution in DMY1 than in DMRT1:
The evolutionary analysis also reveals a higher rate of nonsynonymous substitution in DMY than in DMRT1 (Fig 3A), as the total number of nonsynonymous substitutions since gene duplication is significantly greater in DMY (66.4) than in DMRT1 (21.7; P < 10–5, two-tailed z-test). This phenomenon is due in part to the elevation of mutation rate in DMY. It may also arise from an alteration in natural selection on DMY, compared to that in DMRT1. The n/s ratio is greater in DMY (66.4/31.6 = 2.10) than in DMRT1 (21.7/18.3 = 1.19) since gene duplication (Fig 3A). However, the difference is statistically insignificant (P = 0.12, Fisher's test, ZHANG et al. 1997 Down). I noted that the n/s ratio for DMRT1 after gene duplication (1.19) is similar to that before the duplication (31/31 = 1.00), and thus computed an average ratio [(21.7 + 31)/(18.3 + 31) = 1.07] for DMRT1. When this number is compared to the n/s ratio for DMY, I found that the former is significantly smaller than the latter (P = 0.02, Fisher's test). These results suggest that the acceleration in nonsynonymous substitutions of DMY is due to the elevation of mutation rate and a change in natural selection on the gene. To further confirm this result, I performed a likelihood analysis (YANG 1998 Down). The log likelihood value under the model of equal w (=dN/dS) among all branches of the tree in Fig 3A is –1929.68, significantly lower than that (–1926.38) under the model of one w for DMY and a different w for DMRT1 (P = 0.01, likelihood ratio test). This two-w model, however, is not significantly worse in fitting the data than a three-w model (log likelihood = –1926.27) that assigns one w for DMY, one for DMRT1 after gene duplication, and one for DMRT1 before duplication (P = 0.64, likelihood ratio test). These results are consistent with the above nonlikelihood-based analyses. Similar results are obtained when the two sequences from O. curvinotus are excluded, as shown in Fig 4A.

Altered rates of nonsynonymous substitution in the DM domain:
The DM domain (Fig 1) is the DNA-binding domain in DMRT1 and DMY. DNA-binding domains are usually the most conserved parts of transcriptional factors. For example, the DNA-binding homeodomain is conserved among 13 cognate Hox genes of vertebrates, some of which diverged over 600 MYA (ZHANG and NEI 1996 Down). DMRT1 appears to follow this rule as well. There are 0.049 nonsynonymous substitutions per nonsynonymous site in the DM domain on all the branches linking DMRT1, in comparison to 0.120 in the non-DM regions (Fig 3B and Fig C). The former is significantly smaller than the latter (P = 0.008, Fisher's test). Surprisingly, for DMY, the opposite is observed. There are 0.180 nonsynonymous substitutions per nonsynonymous site in the DM domain of DMY, in comparison to 0.107 in the non-DM regions (Fig 3B and Fig C), the difference between them being statistically significant (P = 0.02, Fisher's test). A direct comparison between DMY and DMRT1 can be made by a 2 x 2 table with the numbers of nonsynonymous substitutions in the DM domain of DMY (25.8), DM domain of DMRT1 (7), non-DM regions of DMY (40.6), and non-DM regions of DMRT1 (45.7), respectively. This comparison reveals a significant excess of nonsynonymous substitutions in the DM domain of DMY (P = 0.001, Fisher's test). However, the dN/dS ratio for the DM domain of DMY is 0.95, not significantly different from 1 (P > 0.5, Fisher's exact test). Thus, it is unclear whether the accelerated nonsynonymous substitution in the DM domain of DMY is due to a relaxation of functional constraints or positive Darwinian selection. To distinguish the two hypotheses, I used a likelihood method, which may be more powerful in detecting positive selection at individual sites (YANG et al. 2000 Down). In this analysis, I compared the likelihoods for the data set of three DMY sequences under models 1 and 2. Here model 1 assumes that the dN/dS ratio for a given site is either 0 or 1, while model 2 adds an extra class of sites to model 1. The result showed that model 2 fits the data significantly better than model 1 ({chi}2 = 14.3, 2 d.f., P = 0.0008), with an additional class of dN/dS = 9.8. Three codons were identified to be under positive selection with posterior probabilities >95%. To examine whether this result is robust against possible violations of assumptions made in models 1 and 2, I conducted a second test by comparing models 7 and 8. Model 7 assumes that the dN/dS ratio follows a ß-distribution between 0 and 1, while model 8 adds an extra class of sites to model 7. Model 8 was found to fit the data significantly better than model 7 ({chi}2 = 18.54, 2 d.f., P = 0.0001), with an additional class of dN/dS = 7.3. Here, the above three codons and two additional codons were found to be under positive selection with >95% posterior probability. Thus, the positive selection hypothesis is supported by likelihood. Although both tests favor the positive selection hypothesis, the results should be interpreted with caution as false detections of positive selection by likelihood have been reported recently (SUZUKI and NEI 2001 Down, SUZUKI and NEI 2002 Down).

When the two sequences from O. curvinotus are excluded from the analysis (Fig 4), qualitatively similar results are obtained. In particular, the dN/dS ratio (3.25) for the DM domain of DMY is significantly >1 (P = 0.03, Fisher's exact test). This finding, based on a more conservative test (ZHANG et al. 1997 Down), provides additional and strong evidence for the action of positive selection on the DM domain.

Characteristic amino acids of DMY:
There were no synonymous substitutions in DMY after its origin from duplication but before the split of O. latipes and O. curvinotus (Fig 3), suggesting that the duplication occurred shortly before the species separation (MATSUDA et al. 2003 Down). Interestingly, there were three amino acid substitutions during this short period of time, all located in the DM domain (Fig 1 and Fig 3). Because O. latipes and O. curvinotus have the same sex-determination mechanism, it may be inferred that one or more of the three substitutions are important to the establishment of DMY as a sex-determination gene in medaka. Among the three sites, only one (position 26 in O. latipes Carbio DMY) has remained invariant among orthologous DMY sequences. Thus, this site may be most critical to the function of DMY. This site is Thr in the three DMY sequences and Ser in the four DMRT1 sequences shown in Fig 1. I examined this site in all fish DMRT1 sequences available in GenBank and found that all of them have Ser. These sequences include green spotted pufferfish Tetraodon nigroviridis (AY152820), pufferfish Takifugu rubripes (AJ295039), platyfish Xiphophorus maculates (AF529187), tilapia Oreochromis niloticus (AF203489), trout Oncorhynchus mykiss (AF209095), halibut Hippoglossus hippoglossus (CAD44607), sturgeon Acipenser transmontanus (AY057061), cod Gadus morhua (CAD44608), zebrafish Danio rerio (AF080622), and wrasse Halichoeres tenuispinis (AAO18650). Furthermore, I examined this site in DMRT1 sequences of all nonfish vertebrates that are available in GenBank, including human Homo sapiens (NM_021951), mouse Mus musculus (NM_015826), rat Rattus rattus (AAK57706), pig Sus scrofa (AF216651), and chicken Gallus gallus (AAF19666), and found that all have Ser. Thus, the Ser at this position has been conserved among all vertebrate DMRT1 sequences so far examined, suggesting its functional importance. The substitution from Ser to Thr in DMY may therefore have significant functional consequences.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

In this article, I analyzed the rate and pattern of nucleotide substitutions in DMY, a newly emergent sex-determination gene in medaka fish, and made several observations. First, there is evidence for an elevation of the mutation rate in the Y-linked DMY, compared to its autosomal mother gene DMRT1. This result is in support of the male-driven evolution hypothesis, with the estimated male/female mutation rate ratio {alpha}m being about ~6–10. The male-driven evolution hypothesis has gained substantial support from evidence in mammals (MIYATA et al. 1987 Down; SHIMMIN et al. 1993 Down; CHANG et al. 1994 Down; NACHMAN and CROWELL 2000 Down; LI et al. 2002 Down; MAKOVA and LI 2002 Down; but see MCVEAN and HURST 1997 Down; BOHOSSIAN et al. 2000 Down) and some evidence in birds (ELLEGREN and FRIDOLFSSON 1997 Down; KAHN and QUINN 1999 Down; CARMICHAEL et al. 2000 Down). It has also gained support from a comparison of the sequence data of the Y-linked GH-2Y gene and autosomal GH-2 gene of salmonid fish (ELLEGREN and FRIDOLFSSON 2003 Down). Because the estimated {alpha}m values have large errors from either the salmon or the medaka data set, I combined the two data sets and estimated that {alpha}m = 7.7, with the 95% confidence interval of (1.36, {infty}). This confidence interval was determined using computer simulation as follows. The observed total number of synonymous substitutions on the two Y-linked genes (DMY and GH-2Y) is 52.6 and the corresponding number in the homologous autosomal genes (DMRT1 and GH-2) is 29.7. Because nucleotide substitutions may be considered a Poisson process, the number of synonymous substitutions on the two Y-linked genes is a Poisson variable with a mean of 52.6 and the corresponding number for the autosomal genes is an independent Poisson variable with a mean of 29.7. I generated 10,000 pairs of Poisson random variables and then estimated the 95% confidence interval for {alpha}m. As discussed by ELLEGREN and FRIDOLFSSON 2003 Down, it is currently unknown in fish how many cell divisions per generation occur in the male and female germ lines, but males produce much more sperm than females produce eggs and it is probable that there are more rounds of cell divisions in the male germ line. However, it should be stressed that this conclusion on male-driven evolution in fishes is tentative, as it is derived from only two genes. There appears to be substantial variation in mutation rate across genomic regions and more data are necessary to confirm this finding.

Second, a higher dN/dS value (0.85–1.17, depending on whether O. curvinotus is used) is found in DMY, in comparison to that (0.42–0.48) in DMRT1. This elevation is particularly prominent in the DM domain (0.95–3.25 vs. 0.24–0.19). It is interesting to note that SRY, the mammalian sex-determination gene on the Y chromosome, is also known to have a high dN/dS ratio in primates and rodents (TUCKER and LUNDRIGAN 1993 Down; WHITFIELD et al. 1993 Down; PAMILO and O'NEILL 1997 Down; WANG et al. 2002 Down). However, the rapid evolution is limited to regions that are outside the DNA-binding HMG domain. It is still debatable whether the high dN/dS ratio in SRY is due to positive selection or relaxation of functional constraints, and SRY genes of different species may be under different forms of selection (WANG et al. 2002 Down). Recent studies favor the view that Y-linked genes generally have elevated dN/dS in comparison to their homologous genes on X or autosomes (SANDSTEDT 2003 Down; TUCKER et al. 2003 Down). This may be due to several reasons, such as changes in gene function or expression and alleviated intensity of natural selection caused by lack of recombination or reduced effective population size for Y-linked genes. In this case of DMY, however, it is the DNA-binding DM domain that shows high dN/dS values. Both likelihood- and nonlikelihood-based methods suggest the action of positive selection. The selective agent, however, is not immediately clear. It is interesting to relate this finding to the observation that the DNA-binding homeodomains of several homeobox genes of mammals and Drosophila evolve rapidly by positive selection (SUTTON and WILKINSON 1997 Down; TING et al. 1998 Down; WANG and ZHANG 2004 Down) and all these homeobox genes are located in the X chromosome and expressed in the testis (WANG and ZHANG 2004 Down). The rapid evolution of these otherwise conserved DNA-binding (DM and homeobox) domains is intriguing. Their involvement in sexual differentiation and reproduction suggests the possibility that the Darwinian selection that acts on them is related to differential reproductive success and possibly speciation (TING et al. 1998 Down).

Third, I estimated that the duplication that generated the DMY gene took place ~10 MYA. The split of O. latipes and O. curvinotus followed shortly (assuming the tree of Fig 2B). There is only one amino acid position that is conserved among DMY sequences but differs between DMY and DMRT1. It is possible that this single amino acid substitution (Ser26Thr) played a major role in the establishment of DMY as the primary sex-determination gene in medaka fish. In this context, it is interesting to note the high plasticity of sex-determination mechanisms in animals (ZARKOWER 2001 Down). For example, single mutations in the tra-1 gene of the nematode Caenorhabditis elegans can change the sex-determination mechanism entirely (HODGKIN 1983 Down) and strains of C. elegans with different sex-determination mechanisms have been constructed in the laboratory (HODGKIN 2002 Down). Several placental mammals are known to lack the SRY gene (JUST et al. 1995 Down) or have multiple SRY genes (LUNDRIGAN and TUCKER 1997 Down). In the future, it will be interesting to examine the functional effect of Ser26Thr in the DMY gene of medaka fish and test whether this single amino acid substitution was the key to the origin of a new sex-determination gene.

Among vertebrates, SRY and DMY are the only sex-determination genes so far identified. A number of features are surprisingly similar between them, including the existence of close paralogs in the genome, higher dN/dS ratios than those of their paralogs, possible actions of positive selection, and occasional disappearances of the genes in males (JUST et al. 1995 Down; NANDA et al. 2003 Down). These features probably reflect the low functional and developmental constraints on the master controllers of sex-determination pathways and the easiness for natural selection to modify them. Studying additional species may reveal whether these observations are general to all master controllers of sex-determination pathways.

On a final note, the late geneticist Susumo Ohno pioneered evolutionary studies of sex chromosomes and gene duplication (OHNO 1967 Down, OHNO 1970 Down). He would be pleased to see the simple, yet extraordinary path to the emergence of the medaka sex chromosome and sex-determination mechanism, which vividly demonstrates the theory of evolution by gene duplication that he championed (reviewed in ZHANG 2003 Down).


*  ACKNOWLEDGMENTS

I thank David Webb for valuable comments on the manuscript. This work was supported by a startup fund from the University of Michigan and a research grant from the National Institutes of Health (GM-67030).

Manuscript received October 11, 2003; Accepted for publication January 7, 2004.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

BERTA, P., J. R. HAWKINS, A. H. SINCLAIR, A. TAYLOR, and B. L. GRIFFITHS et al., 1990  Genetic evidence equating SRY and the testis-determining factor. Nature 348:448-450.[CrossRef][Medline]

BOHOSSIAN, H. B., H. SKALETSKY, and D. C. PAGE, 2000  Unexpectedly similar rates of nucleotide substitution found in male and female hominids. Nature 406:622-625.[CrossRef][Medline]

CARMICHAEL, A. N., A. K. FRIDOLFSSON, J. HALVERSON, and H. ELLEGREN, 2000  Male-biased mutation rates revealed from Z and W chromosome-linked ATP synthase alpha-subunit (ATP5A1) sequences in birds. J. Mol. Evol. 50:443-447.[Medline]

CHANG, B. H., L. C. SHIMMIN, S. K. SHYUE, D. HEWETT-EMMETT, and W. H. LI, 1994  Weak male-driven molecular evolution in rodents. Proc. Natl. Acad. Sci. USA 91:827-831.[Free Full Text]

CHARLESWORTH, B. and D. CHARLESWORTH, 2000  The degeneration of Y chromosomes. Philos. Trans. R. Soc. Lond. B 355:1563-1572.[CrossRef][Medline]

ELLEGREN, H. and A. K. FRIDOLFSSON, 1997  Male-driven evolution of DNA sequences in birds. Nat. Genet. 17:182-184.[CrossRef][Medline]

ELLEGREN, H. and A. K. FRIDOLFSSON, 2003  Sex-specific mutation rates in salmonid fish. J. Mol. Evol. 56:458-463.[CrossRef][Medline]

FELSENSTEIN, J., 1981  Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17:368-376.[CrossRef][Medline]

FELSENSTEIN, J., 1985  Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39:783-791.[CrossRef]

FOSTER, J. W. and J. A. GRAVES, 1994  An SRY-related sequence on the marsupial X chromosome: implications for the evolution of the mammalian testis-determining gene. Proc. Natl. Acad. Sci. USA 91:1927-1931.[Abstract/Free Full Text]

FOSTER, J. W., F. E. BRENNAN, G. K. HAMPIKIAN, P. N. GOODFELLOW, and A. H. SINCLAIR et al., 1992  Evolution of sex determination and the Y chromosome: SRY-related sequences in marsupials. Nature 359:531-533.[CrossRef][Medline]

HODGKIN, J., 1983  Two types of sex determination in a nematode. Nature 304:267-268.[CrossRef][Medline]

HODGKIN, J., 2002  Exploring the envelope: systematic alteration in the sex-determination system of the nematode Caenorhabditis elegans.. Genetics 162:767-780.[Abstract/Free Full Text]

JUST, W., W. RAU, W. VOGEL, M. AKHVERDIAN, and K. FREDGA et al., 1995  Absence of Sry in species of the vole Ellobius. Nat. Genet. 11:117-118.[CrossRef][Medline]

KAHN, N. W. and T. W. QUINN, 1999  Male-driven evolution among Eoaves? A test of the replicative division hypothesis in a heterogametic female (ZW) system. J. Mol. Evol. 49:750-759.[CrossRef][Medline]

KIMURA, M., 1980  A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16:111-120.[CrossRef][Medline]

KONDO, M., I. NANDA, U. HORNUNG, S. ASAKAWA, and N. SHIMIZU et al., 2003  Absence of the candidate male sex-determining gene dmrt1b(Y) of medaka from other fish species. Curr. Biol. 13:416-420.[CrossRef][Medline]

KUMAR, S. and S. B. HEDGES, 1998  A molecular timescale for vertebrate evolution. Nature 392:917-920.

KUMAR, S. and S. SUBRAMANIAN, 2002  Mutation rates in mammalian genomes. Proc. Natl. Acad. Sci. USA 99:803-808.[Abstract/Free Full Text]

KUMAR, S., K. TAMURA, I. B. JAKOBSEN, and M. NEI, 2001  MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17:1244-1245.[Abstract/Free Full Text]

LI, W. H., S. YI, and K. MAKOVA, 2002  Male-driven evolution. Curr. Opin. Genet. Dev. 12:650-656.[CrossRef][Medline]

LUNDRIGAN, B. L. and P. K. TUCKER, 1997  Evidence for multiple functional copies of the male sex-determining locus, Sry, in African murine rodents. J. Mol. Evol. 45:60-65.[CrossRef][Medline]

MAKOVA, K. D. and W. H. LI, 2002  Strong male-driven evolution of DNA sequences in humans and apes. Nature 416:624-626.[CrossRef][Medline]

MATSUDA, M., Y. NAGAHAMA, A. SHINOMIYA, T. SATO, and C. MATSUDA et al., 2002  DMY is a Y-specific DM-domain gene required for male development in the medaka fish. Nature 417:559-563.[CrossRef][Medline]

MATSUDA, M., T. SATO, Y. TOYAZAKI, Y. NAGAHAMA, and S. HAMAGUCHI et al., 2003  Oryzias curvinotus has DMY, a gene that is required for male development in the medaka, O. latipes. Zool. Sci. 20:159-161.[CrossRef][Medline]

MCVEAN, G. T. and L. D. HURST, 1997  Evidence for a selectively favourable reduction in the mutation rate of the X chromosome. Nature 386:388-392.[CrossRef][Medline]

MIYATA, T., H. HAYASHIDA, K. KUMA, K. MITSUYASU, and T. YASUNAG, 1987  Male-driven molecular evolution: a model and a nucleotide sequence analysis. Cold Spring Harbor Symp. Quant. Biol. 52:863-867.[Abstract/Free Full Text]

NACHMAN, M. W. and S. L. CROWELL, 2000  Estimate of the mutation rate per nucleotide in humans. Genetics 156:297-304.[Abstract/Free Full Text]

NANDA, I., M. KONDO, U. HORNUNG, S. ASAKAWA, and C. WINKLER et al., 2002  A duplicated copy of DMRT1 in the sex-determining region of the Y chromosome of the medaka, Oryzias latipes. Proc. Natl. Acad. Sci. USA 99:11778-11783.[Abstract/Free Full Text]

NANDA, I., U. HORNUNG, M. KONDO, M. SCHMID, and M. SCHARTL, 2003  Common spontaneous sex-reversed XX males of the medaka Oryzias latipes. Genetics 163:245-251.[Abstract/Free Full Text]

NEI, M., 1970  Accumulation of nonfunctional genes on sheltered chromosomes. Am. Nat. 104:311-322.[CrossRef]

NEI, M., and S. KUMAR, 2000 Molecular Evolution and Phylogenetics. Oxford University Press, New York.

OHNO, S., 1967 Sex Chromosomes and Sex-Linked Genes. Springer-Verlag, Berlin.

OHNO, S., 1970 Evolution by Gene Duplication. Springer-Verlag, Berlin.

OOHARA, I., K. SAWANO, and T. OKAZAKI, 1997  Mitochondrial DNA sequence analysis of the Masu Salmon-phylogeny in the genus Oncorhynchus. Mol. Phylogenet. Evol. 7:71-78.[CrossRef][Medline]

PAMILO, P. and R. J. O'NEILL, 1997  Evolution of the Sry genes. Mol. Biol. Evol. 14:49-55.[Abstract]

RZHETSKY, A., S. KUMAR, and M. NEI, 1995  Four-cluster analysis: a simple method to test phylogenetic hypotheses. Mol. Biol. Evol. 12:163-167.[Abstract]

SAITOU, N. and M. NEI, 1987  The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406-425.[Abstract]

SANDSTEDT, S. A., 2003 Evolution of sex chromosomal genes, Smcy and Smcx, in the mouse genus Mus. Ph.D. Thesis, University of Michigan, Ann Arbor, MI.

SHIMMIN, L. C., B. H. CHANG, and W. H. LI, 1993  Male-driven evolution of DNA sequences. Nature 362:745-747.[CrossRef][Medline]

SINCLAIR, A. H., P. BERTA, M. S. PALMER, J. R. HAWKINS, and B. L. GRIFFITHS et al., 1990  A gene from the human sex-determining region encodes a protein with homology to a conserved DNA-binding motif. Nature 346:240-244.[CrossRef][Medline]

STEVANOVIC, M., R. LOVELL-BADGE, J. COLLIGNON, and P. N. GOODFELLOW, 1993  SOX3 is an X-linked gene related to SRY. Hum. Mol. Genet. 2:2013-2018.[Abstract/Free Full Text]

SUTTON, K. A. and M. F. WILKINSON, 1997  Rapid evolution of a homeodomain: evidence for positive selection. J. Mol. Evol. 45:579-588.[CrossRef][Medline]

SUZUKI, Y. and M. NEI, 2001  Reliabilities of parsimony-based and likelihood-based methods for detecting positive selection at single amino acid sites. Mol. Biol. Evol. 18:2179-2185.[Abstract/Free Full Text]

SUZUKI, Y. and M. NEI, 2002  Simulation study of the reliability and robustness of the statistical methods for detecting positive selection at single amino acid sites. Mol. Biol. Evol. 19:1865-1869.[Abstract/Free Full Text]

SWOFFORD, D. L., 1998 PAUP*: Phylogenetic Analysis Using Parsimony (*and Other Methods), Version 4.0. Sinauer, Sunderland, MA.

THOMPSON, J. D., T. J. GIBSON, F. PLEWNIAK, F. JEANMOUGIN, and D. G. HIGGINS, 1997  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25:4876-4882.[Abstract/Free Full Text]

TING, C., S. C. TSAUR, M. L. WU, and C.-I WU, 1998  A rapidly evolving homeobox at the site of a hybrid sterility gene. Science 282:1501-1504.[Abstract/Free Full Text]

TUCKER, P. K. and B. L. LUNDRIGAN, 1993  Rapid evolution of the sex determining locus in Old World mice and rats. Nature 364:715-717.[CrossRef][Medline]

TUCKER, P. K., R. M. ADKINS, and J. S. REST, 2003  Differential rates of evolution for the ZFY-related zinc finger genes, Zfy, Zfx, and Zfa in the mouse genus Mus. Mol. Biol. Evol. 20:999-1005.[Abstract/Free Full Text]

WANG, X. and J. ZHANG, 2004  Rapid evolution of mammalian X-linked testis-expressed homeobox genes. Genetics 167(in press).

WANG, X., J. ZHANG, and Y. P. ZHANG, 2002  Erratic evolution of SRY in higher primates. Mol. Biol. Evol. 19:582-584.[Free Full Text]

WHITFIELD, L. S., R. LOVELL-BADGE, and P. N. GOODFELLOW, 1993  Rapid sequence evolution of the mammalian sex-determining gene SRY. Nature 364:713-715.[CrossRef][Medline]

YANG, Z., 1998  Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol. Biol. Evol. 15:568-573.[Abstract]

YANG, Z., R. NIELSEN, N. GOLDMAN, and A. M. PEDERSEN, 2000  Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 155:431-449.[Abstract/Free Full Text]

ZARKOWER, D., 2001  Establishing sexual dimorphism: conservation amidst diversity? Nat. Rev. Genet. 2:175-185.[CrossRef][Medline]

ZHANG, J., 2003  Evolution by gene duplication: an update. Trends Ecol. Evol. 18:292-298.[CrossRef]

ZHANG, J. and M. NEI, 1996  Evolution of Antennapedia-class homeobox genes. Genetics 142:295-303.[Abstract]

ZHANG, J. and M. NEI, 1997  Accuracies of ancestral amino acid sequences inferred by the parsimony, likelihood, and distance methods. J. Mol. Evol. 44(Suppl. 1):S139-S146.

ZHANG, J., S. KUMAR, and M. NEI, 1997  Small-sample tests of episodic adaptive evolution: a case study of primate lysozymes. Mol. Biol. Evol. 14:1335-1338.[Medline]

ZHANG, J., H. F. ROSENBERG, and M. NEI, 1998  Positive Darwinian selection after gene duplication in primate ribonuclease genes. Proc. Natl. Acad. Sci. USA 95:3708-3713.[Abstract/Free Full Text]




This article has been cited by other articles:


Home page
GeneticsHome page
H. Niculita-Hirzel, M. Stock, and N. Perrin
A Key Transcription Cofactor on the Nascent Sex Chromosomes of European Tree Frogs (Hyla arborea)
Genetics, July 1, 2008; 179(3): 1721 - 1723.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. Cho, Z. Y. Huang, and J. Zhang
Sex-Specific Splicing of the Honeybee doublesex Gene Reveals 300 Million Years of Evolution at the Bottom of the Insect Sex-Determination Pathway
Genetics, November 1, 2007; 177(3): 1733 - 1741.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. Shirak, E. Seroussi, A. Cnaani, A. E. Howe, R. Domokhovsky, N. Zilberman, T. D. Kocher, G. Hulata, and M. Ron
Amh and Dmrta2 Genes Map to Tilapia (Oreochromis spp.) Linkage Group 23 Within Quantitative Trait Locus Regions for Sex Determination
Genetics, November 1, 2006; 174(3): 1573 - 1581.
[Abstract] [Full Text] [PDF]


Home page
Biol. Reprod.Home page
X. Huang, Y. Guo, Y. Shui, S. Gao, H. Yu, H. Cheng, and R. Zhou
Multiple Alternative Splicing and Differential Expression of dmrt1 During Gonad Transformation of the Rice Field Eel
Biol Reprod, November 1, 2005; 73(5): 1017 - 1024.
[Abstract] [Full Text] [PDF]