- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Lynch, M.
- Articles by Force, A.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Lynch, M.
- Articles by Force, A.
The Probability of Duplicate Gene Preservation by Subfunctionalization
Michael Lyncha and Allan Forceaa Department of Biology, University of Oregon, Eugene, Oregon 97403
Corresponding author: Michael Lynch, Department of Biology, University of Oregon, Eugene, OR 97403., mlynch{at}oregon.uoregon.edu (E-mail)
Communicating editor: A. G. CLARK
| ABSTRACT |
|---|
It has often been argued that gene-duplication events are most commonly followed by a mutational event that silences one member of the pair, while on rare occasions both members of the pair are preserved as one acquires a mutation with a beneficial function and the other retains the original function. However, empirical evidence from genome duplication events suggests that gene duplicates are preserved in genomes far more commonly and for periods far in excess of the expectations under this model, and whereas some gene duplicates clearly evolve new functions, there is little evidence that this is the most common mechanism of duplicate-gene preservation. An alternative hypothesis is that gene duplicates are frequently preserved by subfunctionalization, whereby both members of a pair experience degenerative mutations that reduce their joint levels and patterns of activity to that of the single ancestral gene. We consider the ways in which the probability of duplicate-gene preservation by such complementary mutations is modified by aspects of gene structure, degree of linkage, mutation rates and effects, and population size. Even if most mutations cause complete loss-of-subfunction, the probability of duplicate-gene preservation can be appreciable if the long-term effective population size is on the order of 105 or smaller, especially if there are more than two independently mutable subfunctions per locus. Even a moderate incidence of partial loss-of-function mutations greatly elevates the probability of preservation. The model proposed herein leads to quantitative predictions that are consistent with observations on the frequency of long-term duplicate gene preservation and with observations that indicate that a common fate of the members of duplicate-gene pairs is the partitioning of tissue-specific patterns of expression of the ancestral gene.
DUPLICATE genes arise frequently in eukaryotic genomes, either via local events that generate tandem duplications, larger-scale events that duplicate chromosomal regions or entire chromosomes, or genome-wide events that result in complete genome duplication (polyploidization). Because gene duplicates are believed to be initially redundant in function, it is commonly thought that one member of the pair will usually become silenced by degenerative mutation. Such nonfunctionalization is expected to occur within a few million generations because the rate of mutation to null alleles is on the order of 10-6 per generation, while the incidence of mutations to novel and beneficial functions is much lower. Although this classical model has been subject to substantial mathematical analysis (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Under the classical model for the evolution of gene duplicates, the only mechanism by which members of a pair can permanently escape mutational decay is neofunctionalization, whereby one copy acquires a new beneficial function with the other retaining the original function (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Numerous examples now exist for the presence of independently mutable regulatory sequences associated with developmental genes. Consider, for example, the bmp5 gene in the mouse. In a study of 34 induced mutations at this locus, ![]()
![]()
![]()
![]()
![]()
![]()
![]()
The idea that the differences in expression patterns of gene duplicates are often a consequence of evolutionary partitioning of the expression domains of ancestral genes, rather than reflecting the origin of new gene functions, is motivated by observations of tissue-specific patterns of expression of duplicate allozymes in tetraploid lineages of fish (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Although the DDC process is based entirely on degenerative mutations, there are at least three ways in which it may play a significant role in creative evolutionary processes. First, by stabilizing duplicate genes in the genome, the DDC process extends the time period during which genes are exposed to natural selection, thereby enhancing the chance that rare beneficial mutations to novel functions may arise (as compared to the situation under the classical model, where a gene is removed from selection once it has become nonfunctionalized). Second, the partitioning of gene expression patterns by the DDC process may reduce the pleiotropic constraints operating on single-gene loci, thereby allowing natural selection to more closely tune the duplicate members of a pair to their specific subfunctions. Third, gene duplicates that have unresolved subfunctions at the time of a reproductive isolation event may provide a powerful mechanism for the development of reproductive incompatibility, i.e., speciation. The degeneration of orthologues in different ways in two sister taxa effectively causes a divergence in genetic maps (![]()
The purpose of this article is to evaluate the conditions under which duplicate-gene preservation by the DDC process is likely to be quantitatively significant. In our earlier work (![]()
| BACKGROUND |
|---|
We initially focus on mutations that cause complete loss of some or all aspects of gene expression, returning later to consider the consequences of mutations with smaller effects. Here we have in mind events such as large insertions or deletions in regulatory regions or frameshift mutations in coding regions. An idealized scenario is laid out in Fig 1, which considers a gene consisting of a coding region and two independently mutable regulatory regions (one for each of two subfunctions). Each regulatory region incurs a knockout mutation for a specific subfunction with rate µr per gene per generation, whereas the coding region mutates to a completely nonfunctional allele at rate µc. The actual model pursued below is more general than the spatial arrangement implied in the figure, because µr is really just the rate at which single subfunctions are eliminated by mutation and µc is the rate of origin of complete loss-of-function alleles (mutations that simultaneously eliminate all subfunctions). The relative values of these two mutation rates may be only weakly correlated with the amount of DNA associated with coding and regulatory sequences. For example, long stretches of a protein can sometimes be removed with little effect on gene function, and the modular molecular architecture of some proteins results in tissue-specific effects of coding-region mutations (![]()
|
Because we are primarily interested in the extent to which duplicate-gene preservation can be understood in terms of degenerative mutation, we ignore rare beneficial mutations to new functions. Under this assumption, a pair of duplicate genes will then ultimately succumbto one of two possible fates: (1) nonfunctionalization occurs when one of the two loci experiences fixation of a null (complete loss-of-function) allele, either by the loss of the coding region or by the sequential loss of all of the subfunctions from one gene copy and (2) subfunctionalization occurs when the two duplicate loci become fixed for complementary loss-of-subfunction mutations, thereby resulting in their reciprocal preservation. The probabilities of these alternative outcomes, which sum to one, are denoted Pn and Ps, where n stands for nonfunctionalization and s for subfunctionalization.
The development of a general analytical model to predict Ps presents formidable technical challenges. For example, even when there are only two subfunctions for a gene, there are four possible classes of alleles: fully functional alleles, mutant alleles retaining only the first subfunction or only the second subfunction, and null alleles (Fig 1). There are then 10 genotypes at each locus and 100 two-locus genotypes. With three subfunctions in the original gene, there are eight classes of alleles: fully functional alleles with all three subfunctions intact, three types with only two subfunctions intact, three types with a single subfunction intact, and null alleles. More generally, with n subfunctions, there are 2n classes of alleles and [2n-1(2n + 1)]2 two-locus genotypes.
Because of the two-locus, multiallelic nature of the DDC process, most of our analyses have relied on computer simulations. In our initial studies, all such work was performed with individual-based simulations, wherein each offspring genotype was produced by randomly drawing a pair of parents, obtaining the gametes by random segregation and recombination, and imposing mutations stochastically according to the Poisson distribution. Because these types of simulations can be very time-consuming for large population sizes, for the case of two subfunctions per gene, we ultimately settled on an alternative approach that simply kept track of genotype frequencies. With this approach, an effectively infinite gamete pool was assumed, so that conditional on the parental genotype frequencies, recombination and mutation could be treated as deterministic processes in the production of the gamete pool for the next generation. The gamete frequencies were then used to derive the expected genotype frequencies after random mating and selection. Using these expectations, the actual genotype frequencies were obtained by sequential binomial sampling. This second approach gave results that were indistinguishable from those obtained with the individual-based model.
All of the results reported below assume a constant population size (N) and a monoecious mating system, and in most cases (all cases in which N
104), 1000 simulations were performed for each set of parameters. Throughout, we assumed a loss-of-function mutation rate of µc = 10-5 per allele per generation.
| THE DOUBLE-NULL RECESSIVE MODEL |
|---|
The classical model:
The classical model can be viewed as a special case of the DDC model in which there is only a single function per gene, and its analysis provides a useful basis for comparison with genes with multiple subfunctions. Most of the work on the classical model of gene duplication has considered the situation in which mutant alleles are completely recessive. Under the double-null recessive model, all individuals with at least one functional gene are fully viable, whereas homozygotes for null alleles at both loci have zero fitness. In this case, there is nothing to prevent the ultimate loss of the active allele at one locus in a finite population, so the only issue is the length of time for this to occur. Although a number of studies have attempted to answer this question (![]()
![]()
![]()
![]()
![]()
![]() |
(1) |
where
= 4Nµc and µc is the rate of mutation to null alleles.
Because the behavior of Watterson's formula has never been thoroughly examined, we compared its performance with results generated by computer simulation. As can be seen in Fig 2, Equation 1 yields predictions that are in remarkably good agreement with simulated data for freely recombining loci. For µcN < 0.1, the mean time to nonfunctionalization is slightly greater than 1/(2µc) generations, but for µcN > 1,
n is prolonged to roughly 10N generations as selection becomes more efficient. The mean time to nonfunctionalization for completely linked gene duplicates is essentially the same as that for unlinked loci when µcN < 0.1, but is reduced when µcN is larger (Fig 2). Thus, under the classical double-null recessive model, for populations with effective sizes less than 1/µc, we expect silencing of one member of a duplicate pair to occur in less than a million generations or so, whereas extremely large populations may harbor active pairs of gene duplicates for tens of millions of generations.
|
Two subfunctions:
As noted above, when a gene duplicate has independently mutable subfunctions, the possibility exists that the two members of the pair may experience fixation of complementary loss-of-subfunction mutations, leading to gene preservation rather than gene loss. For sufficiently small population sizes, we may expect fixation events to occur at the two loci in a nonoverlapping manner and double-null homozygotes to be rare enough that selection is essentially inoperable. The evolutionary fate of the duplicate pair can then be approximated by a neutral model, which simply tallies the alternative series of mutational events that can occur at the two loci (![]()
Consider the situation in which each subfunction is subject to mutational loss at the rate µr, and let µc be the rate at which complete nonfunctionalizing mutations arise. The mutation rate for a completely functional gene is then µc + 2µr per gene copy. Under the assumption of effective neutrality, the rate of fixation of a mutation at a locus is equal to the genic mutation rate (![]()
![]() |
(2) |
The results from computer simulations, for both linked and unlinked loci, demonstrate that the probability of subfunctionalization (Ps) is essentially independent of population size and adequately approximated by Equation 2 provided N(µc + µr) < 0.1 (Fig 3). At larger N(µc + µr), Ps for linked duplicates can greatly exceed that for unlinked loci. However, the range of values of N(µc + µr) for which this is true is fairly restrictive, as the probability of subfunctionalization drops off fairly rapidly beyond the point at which N(µc + µr)
0.1. Even for completely linked genes, there is essentially no chance of preservation of gene duplicates under the double-null recessive model when N(µc + µr) > 10.
|
Why does the probability of preservation of duplicate genes by subfunctionalization decline at high N(µc + µr)? Even where it can be reasonably assumed that selection is negligible in determining the fates of gene duplicates, because the average time to fixation of a neutral gene is ~4N generations (![]()
Obtaining an approximation for Ps that incorporates secondary mutations is fairly straightforward for the case in which a gene has only two subfunctions, because in this instance all secondary mutations lead to gene silencing. Letting c be the probability that all descendants of a subfunctionalized allele destined to fixation are rendered nonfunctional by the time the lineage fixes, then from the arguments given above, the probability that the first fixation event involves the loss of one subfunction from one of the copies is reduced to 2µr(1 - c)/(µc + 2µr). Given that this occurs, there are two possible outcomes in the next stagethe coding region or the alternative subfunction of the partially degraded first copy may be knocked out (at rate µc+ µr), thereby leading to nonfunctionalization, or the second copy may lose the subfunction that remains intact in the first copy [at rate µr(1 - c)], thereby resulting in subfunctionalization. Thus, a more general expression for the probability of subfunctionalization for genes with two subfunctions is
![]() |
(3) |
which reduces to Equation 2 when c = 0.
In the Appendix, we derive an approximate expression for the probability of mutational conversion (c) for genes with two subfunctions using a gene genealogical approach and known properties of the coalescent for neutral genes. A useful property of the resulting theory is that c depends simply on the product N(µc + µr) (Fig 4). For N(µc + µr) < 0.01, the probability of mutational conversion is essentially zero, whereas for N(µc + µr) > 5, virtually all subfunctionalized alleles that are destined for fixation are expected to be converted to nonfunctional alleles in transit. The decline in Ps observed with increasing N is in rough accord with the predictions of Equation 3 for unlinked genes (Fig 3). Equation 3 tends to overestimate Ps in the region of 0.1 > N(µc + µr) > 1, perhaps because selection plays a small role in this region. However, the neutral theory does provide a fairly good indication of the population size beyond which the likelihood of gene preservation by subfunctionalization is vanishingly small.
|
As in the case of the classical model, the mean time to resolution (either by nonfunctionalization or subfunctionalization) under the DDC model is generally on the order of 1/(2µc) when Nµc < 0.1, and is more on the order of 10N generations for larger Nµc (Fig 5). Although the mean time to resolution depends somewhat on the ratio µr/µc when N is small, this dependence is not strong, and for unlinked loci Equation 1 provides a good qualitative approximation for the full range of N. Given the previous results with the classical model, the agreement with the theoretical expectation is expected to be very good when N is large, because in this case the outcome is identical to that under the classical modelall pairs of gene duplicates are resolved by nonfunctionalization of one member of the pair.
|
Additional subfunctions:
Under the DDC model, gene preservation is expected to increase with the number of independently mutable subfunctions, because this increases the number of pathways by which complementation can occur. For example, with three subfunctions, gene preservation can occur in two steps by three different pathwayswith any one of the three subfunctions first being eliminated from one copy, followed by loss of a different subfunction from the second copy; or in three steps by three different pathwayswith any two of the three subfunctions first being eliminated from one copy, followed by loss of the third subfunction from the second copy.
Under the assumption of effective neutrality, we have derived the generalization of Equation 2 for genes with an arbitrary number (z) of subfunctions (![]()
![]() |
(4) |
and the total probability of gene preservation by subfunctionalization is then obtained by summing this quantity over i = 2 to z,
![]() |
(5) |
As can be seen in Fig 6, this approximation works very well when N(µc + zµr) < 0.01 for all z. With even a moderate number of independently mutable subfunctions, Ps can become quite high when populations are small to moderate in size. For example, with
= 1, Ps at low N asymptotically approaches 0.22 with two subfunctions, 0.40 with three subfunctions, and 0.61 with five subfunctions (Fig 6). For population sizes in the range 0.1 > N(µc + µr) > 1, Ps can actually exceed the neutral expectation when z is large, presumably because the efficiency of selection against nonfunctional alleles is increased when large numbers of subfunctionalized alleles are segregating. Eventually, however, regardless of the number of subfunctions, Ps declines to zero as N becomes very large and mutational conversion prevents the fixation of subfunctionalized alleles.
|
Some insight into the upper limit to the probability of subfunctionalization can be obtained by treating the ratio of the total subfunctionalization mutation rate to the nonfunctionalization rate, r =
, as a constant, and evaluating the limit of Equation 5 as z
, i.e., as the number of targets for subfunctionalization becomes effectively infinite,
![]() |
(6) |
Thus, if r = 0.5, the probability of subfunctionalization under the double-null recessive model is no greater than 0.11, whereas the upper limit to Ps is 0.25 with r = 1 and 0.44 with r = 2. For r << 1, Ps
r2.
| PARTIAL DOMINANCE |
|---|
In the previous section, we focused on the situation in which null alleles are completely recessive with respect to fitness. We now examine the situation in which two (rather than the previous one) active alleles for each subfunction are required, with individuals having one or zero active alleles for any subfunction being inviable. Such a condition is often referred to as haplo-insufficiency, although we allow the two alleles to be present at either locus. Under this model, heterozygotes at a particular locus are selectively eliminated whenever they appear on a background of a null homozygote at the alternative locus, so one would expect the time to resolution of the fates of gene duplicates to be extended. This, in fact, is observed, although the effect is not large (data not shown). Of greater interest is the relative insensitivity of the probability of subfunctionalization to dosage requirements. Except for the narrow range of population sizes in which Ps rapidly declines to zero, Ps is essentially the same under both the double-null recessive and haplo-insufficiency models (Fig 7).
|
| PARTIAL LOSS-OF-FUNCTION OR PARTIAL LOSS-OF-SUBFUNCTION MUTATIONS |
|---|
In all of the preceding analyses, we assumed that mutations completely eliminate all activity of a single subfunction or of both subfunctions. We now show that this extreme assumption leads to minimum predicted levels of duplicate-gene preservation, confining our attention to population sizes that are small enough to fulfill the assumptions of effective neutrality. When mutations cause only a partial reduction in function, duplicate genes can be preserved by quantitative complementation (![]()
![]()
Assuming an additive model of gene action between loci and letting s be the number of degradational steps between full and no function, the probability of duplicate-gene preservation is equivalent to the probability that both copies will experience partial loss-of-function prior to the occurrence of complete loss-of-function from either copy,
![]() |
(7a) |
where µp is the rate of occurrence of partial loss-of-function mutations, and µc is the rate of occurrence of complete loss-of-function mutations. The upper limit to Ps approached as s
is
![]() |
(7b) |
and this is closely approximated when s > 5 (Fig 8). Provided µp/µc > 1, which seems likely, the probability of duplicate-gene preservation by partial loss-of-function is substantial, even when s is as small as 2 (Fig 8). For µp/µc > 10, Ps is closely approximated by 1 - (1/2)s-1.
|
We next consider the case in which mutations to two independently mutable subfunctions partially reduce activity, whereas those to the coding region completely eliminate function. Under the model considered above in Equation 2 and Equation 3, there was only one path to gene preservation by subfunctionalization (the complete loss of one subfunction from one copy, followed by the complete loss of the second subfunction from the second copy). However, even with mutations that reduce subfunction by 50%, there are six different paths to duplicate-gene preservation (Fig 9). The probability of each path is obtained by multiplying the chain of relevant transition probabilities, and the total probability of duplicate-gene preservation for this s = 2 case is simply the sum of the probabilities of the six paths,
![]() |
(8a) |
where r0 =
, r1 =
, and r2 =
. For this model with s = 3,
![]() |
(8b) |
and the asymptotic limit as s
is
![]() |
(8c) |
|
For this case of two subfunctions, the probability of gene preservation can be increased as much as twofold when mutations have partial degenerative effects, and almost all of the increase is realized when two, rather than one, mutations are required for the complete loss of a subfunction (Fig 10). The expectations for s = 3 are not very different from those for s =
.
|
A general approximation for an arbitrary number of subfunctions (z) and an arbitrary number of steps to complete silencing (s) is given by
![]() |
(9) |
where P's is given by Equation 2 and Equation 5. The denominator of the fraction on the left is the asymptotic value of Ps that is approached with high µr/µc when subfunction-specific mutations lead to complete loss (s = 1), whereas the quantity in the numerator is the asymptotic value for arbitrary s. Thus, contrary to the situation with z = 2, where Ps can be increased by as much as 50% with partial loss-of-subfunction, with z = 3 the maximum increase is 33%, and with z = 5 it is only 7% (Fig 10). An intuitive explanation for this behavior is that when the number of subfunctions is even moderately high, almost all cases of complete gene silencing are a consequence of coding-region nulls (rather than of multiple eliminations of subfunctions), and further increasing the number of paths to subfunctionalization by increasing s does not appreciably change the situation.
The examples presented above cover only some limiting situations, with more complex scenarios leading to even higher rates of duplicate-gene preservation. For example, as noted above (Fig 8), if an appreciable fraction of mutations arising in the coding region (or more generally, mutations that jointly influence all subfunctions) lead to partial, rather than full, loss of expression, then the probability of gene preservation at low µr/µc will not be zero. Instead, it will asymptote at values close to those illustrated in Fig 8 for various ratios of partial to complete loss-of-function mutation rates in the coding region. These results indicate that when the average effects of mutations influencing both complete function and individual subfunctions are high, the probability of duplicate-gene preservation by degenerative mutations can be very substantial.
| DISCUSSION |
|---|
Under the classical model of gene duplication, nonfunctionalization of one member of the pair by degenerative mutation has generally been viewed as inevitable unless the fixation of a silencing mutation is preceded by a mutation to a novel beneficial function. However, there now appear to be several plausible mechanisms for the preservation of duplicate genes (![]()
![]()
![]()
![]()
![]()
The results presented above help clarify the conditions under which duplicate-gene preservation by subfunctionalization is likely to be quantitatively significant. First, subfunctionalization is most likely to occur when the effective population size and the coding null mutation rate is low enough that a new loss-of-function mutation arises in the population every five generations or less, i.e., roughly speaking, when N µc < 0.1. Moreover, provided these conditions are met, the probability of gene preservation by subfunctionalization can be fairly accurately predicted by use of a model that largely ignores the details of selection. The apparent reason for this behavior is that when the number of mutations arising per generation is small, the very low frequency of individuals with degenerative mutations at both loci results in dynamics of gene frequency change that are largely a consequence of random genetic drift and the relative incidence of different mutational types, i.e., the evolutionary fates of mutations at each locus are essentially independent of the alleles segregating at the other locus. It is also for this reason that the results for the double-null recessive and haplo-insufficiency models are essentially identical for Nµc < 0.1. For vertebrates, the condition Nµc < 0.1 is probably not uncommon when one considers that µc is on the order of 10-610-5, that the long-term effective size of a population is on the order of the minimum annual effective size, and that the effective size of a population is often on the order of one-tenth to one-third of the actual number of breeding adults.
On the other hand, the subfunctionalization process appears to be an unlikely mechanism of duplicate-gene preservation when N(µr + µc) > 10. With very large population sizes, selection begins to play a more significant role, so the probability of fixation of any mutant allele is expected to be diminished and the time to fixation to be magnified. Based on the observations for the classical model, under which fixation of a null allele at one locus must ultimately occur, the average time until one locus becomes silenced is on the order of 10N generations when N is large (Fig 2), and even for neutral alleles the time to fixation is approximately 4N generations. Thus, when N >> 1/(µr + µc) , any mutant allele that is destined to fixation is likely to acquire secondary mutations in transit, and once the population size becomes very large, mutational conversion of subfunctionalized to nonfunctionalized alleles eliminates any possibility of subfunctionalization being the ultimate fate of duplicate genes. Roughly speaking, N(µr + µc) > 10 implies an effective population size on the order of 106 to 107 or greater, so population-size conditions that completely thwart the subfunctionalization process are not necessarily common. In contrast to the situation with subfunctionalization, neofunctionalization or positive selection for redundancy appears to be ineffective at preserving gene duplicates at small population sizes, only becoming plausible explanations at effective population sizes on the order of 106 or greater (![]()
![]()
![]()
![]()
Second, our results suggest that provided N(µr + µc) < 0.1, the degree of linkage between two gene duplicates plays a negligible role in their ultimate fate, i.e., the probability of preservation for tandem duplicates is essentially the same as it is for duplicates carried on different chromosomes. Such behavior is expected under these conditions because mutant alleles are rare enough that they have essentially no influence on the dynamics of gene frequency at the opposite locus. On the other hand, when 0.1 < N(µr + µc) < 10, a completely linked pair of duplicates has a higher probability of preservation by the DDC process than an unlinked pair. This elevated probability of subfunctionalization of linked vs. unlinked duplicate genes at intermediate population sizes is very likely a consequence of the Hill-Robertson effect, whereby linked deleterious genes interfere with each other's selective elimination (![]()
![]()
Third, genes with as few as three to five independently mutable subfunctions are expected to have greatly elevated probabilities of gene preservation by the DDC process relative to the case in which there are only two subfunctions. Part of the reason for this behavior is simply that a greater number of independent regulatory-region targets lead to a greater number of paths by which a gene pair can be subfunctionalized. As shown by the limits of subfunctionalization (Equation 6, Equation 7b, and Equation 8c), the relative rate of subfunctionalizing mutations is far more important than the actual number of subfunctions. However, with a larger number of independently mutable subfunctions, the probability of mutational conversion of a subfunctionalized allele to a complete null is also reduced. Thus, as we have pointed out earlier (![]()
Fourth, whereas most of our results were obtained under the assumption that all mutations completely eliminate a function or subfunction, it is clear that mutations with smaller effects (even those that still cause as much as 50% loss of activity) can lead to a substantial increase in the probability of duplicate-gene preservation. As also pointed out by ![]()
![]()
Fifth, we note that as the number of independently mutable subfunctions (z) and/or the number of degradational steps (s) becomes even moderately large (four or greater), the limit to the probability of subfunctionalization closely approaches the same form. As can be seen by comparing Equation 6, Equation 7b, and Equation 8c, the upper limit to Ps is in all cases equal to the square of the fraction of mutations that have partial effects. Thus, a key determinant of the role of degenerative mutations in the preservation of duplicate genes is the frequency of mutations with partial effects relative to that of complete nulls. Insight into this parameter should be achievable through mutation screens of alleles with known expression patterns and the analysis of interactions between them.
Finally, we note that we have couched all of the theory in this article in terms of loss-of-function/subfunction mutations. However, some evidence suggests that gain-of-function mutations may be quite common, perhaps as common as loss-of-function mutations (![]()
| ACKNOWLEDGMENTS |
|---|
We are very grateful to A. Clark and G. Gibson for helpful comments. This research has been supported by National Institutes of Health (NIH) grant RO1-GM36827 to M.L., and by graduate fellowships for A.F. funded by a National Science Foundation Training Grant in Genetic Mechanisms of Evolution and a NIH Training Grant in Developmental Biology.
Manuscript received June 19, 1999; Accepted for publication September 15, 1999.
| APPENDIX |
|---|
Consider a newly arisen mutant allele that is destined to become fixed in a population. In large populations, the time to fixation of an effectively neutral allele can be sufficiently long that all descendants of the single original mutant allele will have acquired secondary mutations by the time fixation of the entire lineage has occurred. Our concern here is with the probability that all descendants of a subfunctionalized allele destined to fixation have acquired nonfunctionalizing mutations by the time the gene pool consists entirely of descendants of the original subfunctionalized allele. To accomplish this, we take a gene genealogy approach, noting that the 2N genes present in a population at any point in time trace back to 2N - 1 ancestral copies at some previous point in time, then to 2N - 2 copies, etc., until the entire current population of genes coalesces to a single ancestral gene copy. Looking forward from the basal ancestral gene, the gene genealogy can be viewed as a series of bifurcations, with two branches emanating from the base of the genealogy, a third branch coming off one of these some time later, and so on.
Let n = 2 denote the first phase of the branching process, where there are only two independent branches, and n = 3 denote the next phase, in which a third branch joins the genealogy, etc. Conversion of all descendant genes in the lineage to a new mutational type occurs when independent mutations arise in the genealogy such that all descendants carry those mutations at the time of fixation of the lineage. For example, if both branches during period n = 2 independently acquire mutations, convergence has occurred because all subsequent descendants inherit those mutations. If only one of the two original branches acquires a mutation, conversion can still occur if, for example, the third branch in the genealogy coalesces with the branch initially containing the mutation (thereby gaining it by inheritance) and the nonmutant lineage acquires an independent mutation during n = 3. Mutational conversion can also occur if neither branch during period n = 2 acquires a mutation, but three or more mutations independently arise subsequently in appropriate positions in the gene genealogy.
To jointly account for the very large number of pathways by which mutational convergence can occur, we treat the branching process in the following way. Let
n(j) denote the probability that j of the n distinct terminal branches during interval n have acquired new mutations by the time phase n has elapsed, either by inheritance or by new mutation. The index j takes on values 0 to n, so
n(n) is equivalent to the probability that the lineage has been completely converted to the new mutational type by the end phase n. The quantity that we wish to estimate is c =
2N(2N), the ultimate probability that complete mutational conversion has occurred by the time the lineage has become fixed in the population. This quantity can be obtained recursively by defining a set of coefficients, un(x, y), which denote the probability that x of y distinct branches acquire new mutations during interval n.
It follows that the probability that no mutations have arisen in the gene genealogy by the end of phase n is
![]() |
(A1a) |
which is simply the product of the probabilities that the genealogy is mutation-free entering the interval and the probability that none of the n unique branches in that interval acquire new mutations. The probability that one branch contains a mutation at the end of phase n is
![]() |
(A1b) |
Here, the first term is the probability that the lineage was mutation-free in the previous interval and that a single branch acquired a new mutation in interval n, whereas the second term is the joint probability that one mutation was present in the previous interval, that the branch added in this interval coalesces with any branch from the previous interval other than that carrying the mutation, and that no new mutation occurred in the current interval. The probabilities that two or more branches contain mutations at the end of phase n can be expressed by the general formula
![]() |
(A1c) |
where 2
j
n. For each of the pairs of terms after the summation, the first member denotes the joint probability that the new branch added to the genealogy in interval n coalesces with a mutation-containing branch and that the necessary number of independent mutations occurs in mutation-free branches to yield j mutation-containing branches at the end of interval n. The second member of each pair is the alternative joint probability that the new branch coalesces with a mutation-free branch and that the necessary number of independent mutations occurs in it and/or other mutation-free branches to yield j mutation-containing branches at the end of interval n. The entire recursion initiates with
2(0) = u2(0, n),
2(1) = u2(1, n), and
2(2) = u2(2, n).
To complete this exercise, we require expressions for coefficients of the form un(x, y). Denoting tn as the length of interval n in units of time, then un(x, y) is simply the probability that during this interval x of y independent branches acquire a new mutation while y - x do not. Letting µ denote the mutation rate, then assuming mutations are Poisson distributed, the joint probability of interest is
![]() |
(A2) |
Further progress requires that we know something about the time interval tn, but since we are dealing with effectively neutral mutations, this is straightforward, because tn is simply the coalescence time during interval n. It is well known that under the Wright-Fisher model, coalescence times are distributed exponentially with expectation
![]() |
(A3) |
(![]()
![]()
![]()
![]() |
(A4) |
| LITERATURE CITED |
|---|
ALLENDORF, F. W., F. M. UTTER and B. P. MAY, 1975 Gene duplication within the family Salmonidae: II. Detection and determination of the genetic control of duplicate loci through inheritance studies and the examination of populations, pp. 415432 in Isozymes, Vol. IV: Genetics and Evolution, edited by C. L. MARKERT. Academic Press, New York.
AMORES, A., A. FORCE, Y.-L. YAN, L. JOLY, and C. AMEMIYA et al., 1998 Zebrafish hox clusters and vertebrate genome evolution. Science 282:1711-1714
ARNONE, M. I. and E. H. DAVIDSON, 1997 The hardwiring of development: organization and function of genomic regulatory sequences. Development 124:1851-1864[Abstract].
BAILEY, G. S., R. T. M. POULTER, and P. A. STOCKWELL, 1978 Gene duplication in tetraploid fish: model for gene silencing at unlinked duplicated loci. Proc. Natl. Acad. Sci. USA 75:5575-5579
BIRKY, C. W., JR. and J. B. WALSH, 1988 Effects of linkage on rates of molecular evolution. Proc. Natl. Acad. Sci. USA 85:6414-6418
BROOKFIELD, J. F. Y., 1997 Genetic redundancy. Adv. Genet. 36:137-155[Medline].
CHRISTIANSEN, F. B. and O. FRYDENBERG, 1977 Selection-mutation balance for two nonallelic recessives producing an inferior double homozygote. Am. J. Hum. Genet. 29:195-207[Medline].
CLARK, A. G., 1994 Invasion and maintenance of a gene duplication. Proc. Natl. Acad. Sci. USA 91:2950-2954
CLARK, A. G., L. WANG, and T. HULLEBERB, 1995 Spontaneous mutation rate of modifiers of metabolism in Drosophila.. Genetics 139:767-779[Abstract].
DILEONE, R. J., L. B. RUSSELL, and D. M. KINGSLEY, 1998 An extensive 3' regulatory region controls expression of Bmp5 in specific anatomical structures of the mouse embryo. Genetics 148:401-408
DONNELLY, P. and S. TAVARE, 1995 Coalescents and genealogical structure under neutrality. Annu. Rev. Genet. 29:401-421[Medline].
FERRIS, S. D. and G. S. WHITT, 1979 Evolution of the differential regulation of duplicate genes after polyploidization. J. Mol. Evol. 12:267-317[Medline].
FISHER, R. A., 1935 The sheltering of lethals. Am. Nat. 69:446-455.
FORCE, A., M. LYNCH, F. B. PICKETT, A. AMORES, and Y.-L. YAN et al., 1999 Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151:1531-1545
GERHART, J., and M. KIRSCHNER, 1997 Cells, Embryos, and Evolution. Blackwell Science, Malden, MA.
GRAF, J.-D., and H. R. KOBEL, 1991 Genetics, pp. 1934, in Methods in Cell Biology, Vol. 36, Xenopus laevis: Practical Uses in Cell and Molecular Biology, edited by B. K. KAY and H. B. PENG. Academic Press, New York.
HALDANE, J. B. S., 1933 The part played by recurrent mutation in evolution. Am. Nat. 67:5-9.
HENIKOFF, S., E. A. GREENE, S. PIETROKOVSKI, P. BORK, and T. K. ATTWOOD et al., 1997 Gene families: the taxonomy of protein paralogs and chimeras. Science 278:609-614
HILL, W. G. and A. ROBERTSON, 1966 The effect of linkage on limits to artificial selection. Genet. Res. 8:269-294[Medline].
HUANG, J. D., D. H. SCHWYTER, J. M. SHIROKAWA, and A. J. COUREY, 1993 The interplay between multiple enhancer and silencer elements defines the pattern of decapentaplegic expression. Genes Dev. 7:694-704
HUDSON, R. R., 1991 Gene genealogies and the coalescent process. Oxford Surv. Evol. Biol. 7:1-44.
HUGHES, M. K. and A. L. HUGHES, 1993 Evolution of duplicate genes in a tetraploid animal, Xenopus laevis.. Mol. Biol. Evol. 10:1360-1369[Abstract].
JACK, J. and Y. DELOTTO, 1995 Structure and regulation of a complex locus: the cut gene of Drosophila.. Genetics 139:1689-1700[Abstract].
KAMMANDEL, B., K. CHOWDHURY, A. STOYKOVA, S. APARICIO, and S. BRENNER et al., 1999 Distinct cis-essential modules direct the time-space pattern of the Pax6 gene activity. Dev. Biol. 205:79-97[Medline].
KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, United Kingdom.
KIMURA, M. and J. L. KING, 1979 Fixation of a deleterious allele at one of two "duplicate" loci by mutation pressure and random drift. Proc. Natl. Acad. Sci. USA 76:2858-2861
KIMURA, M. and T. OHTA, 1969 The average number of generations until fixation of a mutant gene in a finite population. Genetics 61:763-771
KINGMAN, J. F. C., 1982 The coalescent. Stoch. Proc. Appl. 13:235-248.
KIRCHHAMER, C. V., C.-H. YUH, and E. H. DAVIDSON, 1996 Modular cis-regulatory organization of developmentally expressed genes: two genes transcribed territorially in the sea urchin embryo, and additional examples. Proc. Natl. Acad. Sci. USA 93:9322-9328
LI, W.-H., 1980 Rate of gene silencing at duplicate loci: a theoretical study and interpretation of data from tetraploid fishes. Genetics 95:237-258
LI, W.-H., 1985 Accelerated evolution following gene duplication and its implication for the neutralist-selectionist controversy, pp. 333352, in Population Genetics and Molecular Evolution, edited by T. OHTA and K. AOKI. Springer-Verlag, Berlin.
LUNDIN, L., 1993 Evolution of the vertebrate genome as reflected in paralogous chromosomal regions in man and the house mouse. Genomics 16:1-9[Medline].
NADEAU, J. H. and D. SANKOFF, 1997 Comparable rates of gene loss and functional divergence after genome duplications early in vertebrate evolution. Genetics 147:1259-1266[Abstract].
NEI, M. and A. K. ROYCHOUDHURY, 1973 Probability of fixation of nonfunctional genes at duplicate loci. Am. Nat. 107:362-372.
NORMES, S., M. CLARKSON, I. MIKKOLA, M. PEDERSEN, and A. BARDSLEY et al., 1998 Zebrafish contains two Pax6 genes involved in eye development. Mech. Dev. 77:185-196[Medline].
NOWAK, M. A., M. C. BOERLIJST, J. COOKE, and J. M. SMITH, 1997 Evolution of genetic redundancy. Nature 388:167-170[Medline].
OHNO, S., 1970 Evolution by Gene Duplication. Springer-Verlag, Berlin.
OHTA, T., 1987 Simulating evolution by gene duplication. Genetics 115:207-213
OHTA, T., 1988 Time for acquiring a new gene by duplication. Proc. Natl. Acad. Sci. USA 85:3509-3512
POSTLETHWAIT, J. H., Y.-L. YAN, M. GATES, S. HORNE, and A. AMORES et al., 1998 Vertebrate genome evolution and the zebrafish gene map. Nat. Genet. 18:345-349[Medline].
RAMOS-ONSINS, S. and M. AGUADE, 1998 Molecular evolution of the Cecropin multigene family in Drosophila: functional genes vs. pseudogenes. Genetics 150:157-171
SIDOW, A., 1996 Gen(om)e duplications in the evolution of early vertebrates. Curr. Opin. Genet. Dev. 6:715-722[Medline].
SLUSARSKI, D. C., C. K. MOTSNY, and R. HOLMGREN, 1995 Mutations that alter the timing and pattern of cubitus interruptus gene expression in Drosophila melanogaster.. Genetics 139:229-240[Abstract].
STOLTZFUS, A., 1999 On the possibility of constructive neutral evolution. J. Mol. Biol. 49:169-181.
TAKAHATA, N. and T. MARUYAMA, 1979 Polymorphism and loss of duplicate gene expression: a theoretical study with application to tetraploid fish. Proc. Natl. Acad. Sci. USA 76:4521-4525
WAGNER, A., 1999 Redundant gene functions and natural selection. J. Evol. Biol. 12:1-16.
WALSH, J. B., 1995 How often do duplicated genes evolve new functions? Genetics 110:345-364
WATTERSON, G. A., 1983 On the time for gene silencing at duplicate loci. Genetics 105:745-766
WENDEL, J. F., 1999 Genome evolution in polyploids. Plant Mol. Biol. in press.
WESTIN, J. and M. LARDELLI, 1997 Three novel notch genes in zebrafish: implications for vertebrate Notch gene evolution and function. Dev. Genes Evol. 207:51-63.
XU, P.-X., X. ZHANG, S. HEANEY, A. YOON, and A. M. MICHELSON et al., 1999 Regulation of Pax6 expression is conserved between mice and flies. Development 126:383-395[Abstract].
This article has been cited by other articles:
![]() |
J. Zielonka, I. G. Bravo, D. Marino, E. Conrad, M. Perkovic, M. Battenberg, K. Cichutek, and C. Munk Restriction of Equine Infectious Anemia Virus by Equine APOBEC3 Cytidine Deaminases J. Virol., August 1, 2009; 83(15): 7547 - 7559. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Chaudhary, L. Flagel, R. M. Stupar, J. A. Udall, N. Verma, N. M. Springer, and J. F. Wendel Reciprocal Silencing, Transcriptional Bias and Functional Divergence of Homeologs in Polyploid Cotton (Gossypium) Genetics, June 1, 2009; 182(2): 503 - 517. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Bao and M. Friedrich Molecular Evolution of the Drosophila Retinome: Exceptional Gene Gain in the Higher Diptera Mol. Biol. Evol., June 1, 2009; 26(6): 1273 - 1287. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Cutter, A. Dey, and R. L. Murray Evolution of the Caenorhabditis elegans Genome Mol. Biol. Evol., June 1, 2009; 26(6): 1199 - 1234. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. G. Howarth and M. J. Donoghue Duplications and Expression of DIVARICATA-Like Genes in Dipsacales Mol. Biol. Evol., June 1, 2009; 26(6): 1245 - 1258. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Weadick and B. S.W. Chang Molecular Evolution of the {beta}{gamma} Lens Crystallin Superfamily: Evidence for a Retained Ancestral Function in {gamma}N Crystallins? Mol. Biol. Evol., May 1, 2009; 26(5): 1127 - 1142. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. V. Koonin Darwinian evolution in the light of genomics Nucleic Acids Res., March 1, 2009; 37(4): 1011 - 1034. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ha, E.-D. Kim, and Z. J. Chen Duplicate genes increase expression diversity in closely related species and allopolyploids PNAS, February 17, 2009; 106(7): 2295 - 2300. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Catania, F. Wurmser, A. A. Potekhin, E. Przybos, and M. Lynch Genetic Diversity in the Paramecium aurelia Species Complex Mol. Biol. Evol., February 1, 2009; 26(2): 421 - 431. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Siefers, K. K. Dang, R. W. Kumimoto, W. E. Bynum IV, G. Tayrose, and B. F. Holt III Tissue-Specific Expression Patterns of Arabidopsis NF-Y Transcription Factors Suggest Potential for Extensive Combinatorial Complexity Plant Physiology, February 1, 2009; 149(2): 625 - 641. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. E. Soltis, V. A. Albert, J. Leebens-Mack, C. D. Bell, A. H. Paterson, C. Zheng, D. Sankoff, C. W. dePamphilis, P. K. Wall, and P. S. Soltis Polyploidy and angiosperm diversification Am. J. Botany, January 1, 2009; 96(1): 336 - 348. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Rogers, T. Bedford, and D. L. Hartl Formation and Longevity of Chimeric and Duplicate Genes in Drosophila melanogaster Genetics, January 1, 2009; 181(1): 313 - 322. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Greenberg, S. R. Stockwell, and A. G. Clark Evolutionary Constraint and Adaptation in the Metabolic Network of Drosophila Mol. Biol. Evol., December 1, 2008; 25(12): 2537 - 2546. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Hughes and R. Friedman Genome Size Reduction in the Chicken Has Involved Massive Loss of Ancestral Protein-Coding Genes Mol. Biol. Evol., December 1, 2008; 25(12): 2681 - 2688. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. W. Innes, C. Ameline-Torregrosa, T. Ashfield, E. Cannon, S. B. Cannon, B. Chacko, N. W.G. Chen, A. Couloux, A. Dalwani, R. Denny, et al. Differential Accumulation of Retroelements and Diversification of NB-LRR Disease Resistance Genes in Duplicated Regions following Polyploidy in the Ancestor of Soybean Plant Physiology, December 1, 2008; 148(4): 1740 - 1759. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Lyons, B. Pedersen, J. Kane, M. Alam, R. Ming, H. Tang, X. Wang, J. Bowers, A. Paterson, D. Lisch, et al. Finding and Comparing Syntenic Regions among Arabidopsis and the Outgroups Papaya, Poplar, and Grape: CoGe with Rosids Plant Physiology, December 1, 2008; 148(4): 1772 - 1781. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Fein, M. A. Wright, E. A. Slat, A. B. Ribera, and L. L. Isom scn1bb, a Zebrafish Ortholog of SCN1B Expressed in Excitable and Nonexcitable Cells, Affects Motor Neuron Axon Morphology and Touch Sensitivity J. Neurosci., November 19, 2008; 28(47): 12510 - 12522. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Jarinova, G. Hatch, L. Poitras, C. Prudhomme, M. Grzyb, J. Aubin, F.-A. Berube-Simard, L. Jeannotte, and M. Ekker Functional resolution of duplicated hoxb5 genes in teleosts Development, November 1, 2008; 135(21): 3543 - 3553. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bershtein and D. S. Tawfik Ohno's Model Revisited: Measuring the Frequency of Potentially Adaptive Mutations under Various Mutational Drifts Mol. Biol. Evol., November 1, 2008; 25(11): 2311 - 2318. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. E. Janes, C. Organ, and N. Valenzuela New resources inform study of genome size, content, and organization in nonavian reptiles Integr. Comp. Biol., October 1, 2008; 48(4): 447 - 453. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Liu, A. Kanazawa, H. Matsumura, R. Takahashi, K. Harada, and J. Abe Genetic Redundancy in Soybean Photoresponses Associated With Duplication of the Phytochrome A Gene Genetics, October 1, 2008; 180(2): 995 - 1007. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Dorus, Z. N. Freeman, E. R. Parker, B. D. Heath, and T. L. Karr Recent Origins of Sperm Genes in Drosophila Mol. Biol. Evol., October 1, 2008; 25(10): 2157 - 2166. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Hanada, C. Zou, M. D. Lehti-Shiu, K. Shinozaki, and S.-H. Shiu Importance of Lineage-Specific Expansion of Plant Tandem Duplicates in the Adaptive Response to Environmental Stimuli Plant Physiology, October 1, 2008; 148(2): 993 - 1003. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J Hegarty, G. L Barker, A. C Brennan, K. J Edwards, R. J Abbott, and S. J Hiscock Changes to gene expression associated with hybrid speciation in plants: further insights from transcriptomic studies in Senecio Phil Trans R Soc B, September 27, 2008; 363(1506): 3055 - 3069. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu, W. K. Versaw, N. Pumplin, S. K. Gomez, L. A. Blaylock, and M. J. Harrison Closely Related Members of the Medicago truncatula PHT1 Phosphate Transporter Gene Family Encode Phosphate Transporters with Distinct Biochemical Activities J. Biol. Chem., September 5, 2008; 283(36): 24673 - 24681. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-H. Xu and J. Messing Diverged Copies of the Seed Regulatory Opaque-2 Gene by a Segmental Duplication in the Progenitor Genome of Rice, Sorghum, and Maize Mol Plant, September 1, 2008; 1(5): 760 - 769. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Zhou, G. Zhang, Y. Zhang, S. Xu, R. Zhao, Z. Zhan, X. Li, Y. Ding, S. Yang, and W. Wang On the origin of new genes in Drosophila Genome Res., September 1, 2008; 18(9): 1446 - 1455. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. D. Rodgers and D. K. Garikipati Clinical, Agricultural, and Evolutionary Biology of Myostatin: A Comparative Review Endocr. Rev., August 1, 2008; 29(5): 513 - 534. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Qian and J. Zhang Gene Dosage and Gene Duplicability Genetics, August 1, 2008; 179(4): 2319 - 2324. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. C. Conant and K. H. Wolfe Probabilistic Cross-Species Inference of Orthologous Genomic Regions Created by Whole-Genome Duplication in Yeast Genetics, July 1, 2008; 179(3): 1681 - 1692. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Semon and K. H. Wolfe Preferential subfunctionalization of slow-evolving genes after allopolyploidization in Xenopus laevis PNAS, June 17, 2008; 105(24): 8333 - 8338. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Quezada, C. Aranda, A. DeLuna, H. Hernandez, M. L. Calcagno, A. Marin-Hernandez, and A. Gonzalez Specialization of the paralogue LYS21 determines lysine biosynthesis under respiratory metabolism in Saccharomyces cerevisiae Microbiology, June 1, 2008; 154(6): 1656 - 1667. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Sitaraman, M. Bui, and Z. Liu LEUNIG_HOMOLOG and LEUNIG Perform Partially Redundant Functions during Arabidopsis Embryo and Floral Development Plant Physiology, June 1, 2008; 147(2): 672 - 681. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Howarth, S. H. W. Law, B. Barnes, J. M. Hall, D. E. Hinton, L. Moore, J. M. Maglich, J. T. Moore, and S. W. Kullman Paralogous Vitamin D Receptors in Teleosts: Transition of Nuclear Receptor Function Endocrinology, May 1, 2008; 149(5): 2411 - 2422. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Wentzell and D. J. Kliebenstein Genotype, Age, Tissue, and Environment Regulate the Structural Outcome of Glucosinolate Activation Plant Physiology, May 1, 2008; 147(1): 415 - 428. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Hovav, J. A. Udall, B. Chaudhary, R. Rapp, L. Flagel, and J. F. Wendel Partitioned expression of duplicated genes during development and evolution of a single cell in a polyploid plant PNAS, April 22, 2008; 105(16): 6191 - 6195. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Beisswanger and W. Stephan Evidence that strong positive selection drives neofunctionalization in the tandemly duplicated polyhomeotic genes in Drosophila PNAS, April 8, 2008; 105(14): 5447 - 5452. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Storz, F. G. Hoffmann, J. C. Opazo, and H. Moriyama Adaptive Functional Divergence Among Triplicated {alpha}-Globin Genes in Rodents Genetics, March 1, 2008; 178(3): 1623 - 1638. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Yuri, R. T. Kimball, E. L. Braun, and M. J. Braun Duplication of Accelerated Evolution and Growth Hormone Gene in Passerine Birds Mol. Biol. Evol., February 1, 2008; 25(2): 352 - 361. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. C. Tzika, R. Helaers, Y. Van de Peer, and M. C. Milinkovitch MANTIS: a phylogenetic framework for multi-species genome comparisons Bioinformatics, January 15, 2008; 24(2): 151 - 157. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sato-Maeda, M. Obinata, and W. Shoji Position fine-tuning of caudal primary motoneurons in the zebrafish spinal cord Development, January 15, 2008; 135(2): 323 - 332. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Scannell and K. H. Wolfe A burst of protein sequence evolution and a prolonged period of asymmetric evolution follow gene duplication in yeast Genome Res., January 1, 2008; 18(1): 137 - 147. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Bielen, S. Oberleitner, S. Marcellini, L. Gee, P. Lemaire, H. R. Bode, R. Rupp, and U. Technau Divergent functions of two ancient Hydra Brachyury paralogues suggest specific roles for their C-terminal domains in tissue fate induction Development, December 1, 2007; 134(23): 4187 - 4197. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Cornell, I. Alam, D. M. Soanes, H. M. Wong, C. Hedeler, N. W. Paton, M. Rattray, S. J. Hubbard, N. J. Talbot, and S. G. Oliver Comparative genome analysis across a kingdom of eukaryotic organisms: Specialization and diversification in the Fungi Genome Res., December 1, 2007; 17(12): 1809 - 1822. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Zhang and N. A. Rosenberg On the Genealogy of a Duplicated Microsatellite Genetics, December 1, 2007; 177(4): 2109 - 2122. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Blanco, A. Barrallo-Gimeno, H. Acloque, A. E. Reyes, M. Tada, M. L. Allende, R. Mayor, and M. A. Nieto Snail1a and Snail1b cooperate in the anterior migration of the axial mesendoderm in the zebrafish embryo Development, November 15, 2007; 134(22): 4073 - 4081. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Nardmann, R. Zimmermann, D. Durantini, E. Kranz, and W. Werr WOX Gene Phylogeny in Poaceae: A Comparative Approach Addressing Leaf and Embryo Development Mol. Biol. Evol., November 1, 2007; 24(11): 2474 - 2484. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Johnson and M. A. Thomas The Monosaccharide Transporter Gene Family in Arabidopsis and Rice: A History of Duplications, Adaptive Evolution, and Functional Divergence Mol. Biol. Evol., November 1, 2007; 24(11): 2412 - 2423. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Bergthorsson, D. I. Andersson, and J. R. Roth Ohno's dilemma: Evolution of new genes under continuous selection PNAS, October 23, 2007; 104(43): 17004 - 17009. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Galis and J. A. J. Metz Evolutionary novelties: the making and breaking of pleiotropic constraints Integr. Comp. Biol., September 1, 2007; 47(3): 409 - 419. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Gitelman Evolution of the vertebrate twist family and synfunctionalization: a mechanism for differential gene loss through merging of expression domains Mol. Biol. Evol., September 1, 2007; 24(9): 1912 - 1925. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Rong, F. A. Feltus, V. N. Waghmare, G. J. Pierce, P. W. Chee, X. Draye, Y. Saranga, R. J. Wright, T. A. Wilkins, O. L. May, et al. Meta-analysis of Polyploid Cotton QTL Shows Unequal Contributions of Subgenomes to a Complex Network of Genes and Gene Clusters Implicated in Lint Fiber Development Genetics, August 1, 2007; 176(4): 2577 - 2588. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Goodstadt, A. Heger, C. Webber, and C. P. Ponting An analysis of the gene complement of a marsupial, Monodelphis domestica: Evolution of lineage-specific genes and giant chromosomes Genome Res., July 1, 2007; 17(7): 969 - 981. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. H. Gebhardt, S. Witte, H. Steuber, U. Matern, and S. Martens Evolution of Flavone Synthase I from Parsley Flavanone 3beta-Hydroxylase by Site-Directed Mutagenesis Plant Physiology, July 1, 2007; 144(3): 1442 - 1454. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. J. Evans Ancestry Influences the Fate of Duplicated Genes Millions of Years After Polyploidization of Clawed Frogs (Xenopus) Genetics, June 1, 2007; 176(2): 1119 - 1130. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Labbe, A. Berthomieu, C. Berticat, H. Alout, M. Raymond, T. Lenormand, and M. Weill Independent Duplications of the Acetylcholinesterase Gene Conferring Insecticide Resistance in the Mosquito Culex pipiens Mol. Biol. Evol., April 1, 2007; 24(4): 1056 - 1067. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Naghibalhossaini, A. D. Yoder, M. Tobi, and C. P. Stanners Evolution of a Tumorigenic Property Conferred by Glycophosphatidyl-Inositol Membrane Anchors of Carcinoembryonic Antigen Gene Family Members during the Primate Radiation Mol. Biol. Cell, April 1, 2007; 18(4): 1366 - 1374. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. X. Cunningham Jr., H. Lee, and E. Gantt Carotenoid Biosynthesis in the Primitive Red Alga Cyanidioschyzon merolae Eukaryot. Cell, March 1, 2007; 6(3): 533 - 545. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. Adams Evolution of Duplicate Gene Expression in Polyploid and Hybrid Plants J. Hered., March 1, 2007; 98(2): 136 - 141. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. P. Byrne and K. H. Wolfe Consistent Patterns of Rate Asymmetry and Gene Loss Indicate Widespread Neofunctionalization of Yeast Genes After Whole-Genome Duplication Genetics, March 1, 2007; 175(3): 1341 - 1350. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. T. O. Lee, F. K. Y. Siu, J. K. V. Tam, I. T. Y. Lau, A. O. L. Wong, M. C. M. Lin, H. Vaudry, and B. K. C. Chow Discovery of growth hormone-releasing hormones and receptors in nonmammalian vertebrates PNAS, February 13, 2007; 104(7): 2133 - 2138. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Guan, M. J. Dunham, and O. G. Troyanskaya Functional Analysis of Gene Duplications in Saccharomyces cerevisiae Genetics, February 1, 2007; 175(2): 933 - 943. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bezhani, C. Winter, S. Hershman, J. D. Wagner, J. F. Kennedy, C. S. Kwon, J. Pfluger, Y. Su, and D. Wagner Unique, Shared, and Redundant Roles for the Arabidopsis SWI/SNF Chromatin Remodeling ATPases BRAHMA and SPLAYED PLANT CELL, February 1, 2007; 19(2): 403 - 416. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. D. Akhunov, A. R. Akhunova, and J. Dvorak Mechanisms and Rates of Birth and Death of Dispersed Duplicated Genes during the Evolution of a Multigene Family in Diploid and Tetraploid Wheats Mol. Biol. Evol., February 1, 2007; 24(2): 539 - 550. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Shakhnovich and E. V. Koonin Origins and impact of constraints in evolution of gene families Genome Res., December 1, 2006; 16(12): 1529 - 1536. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Nardmann and W. Werr The Shoot Stem Cell Niche in Angiosperms: Expression Patterns of WUS Orthologues in Rice and Maize Imply Major Modifications in the Course of Mono- and Dicot Evolution Mol. Biol. Evol., December 1, 2006; 23(12): 2492 - 2504. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Kuznetsova, M. Proudfoot, C. F. Gonzalez, G. Brown, M. V. Omelchenko, I. Borozan, L. Carmel, Y. I. Wolf, H. Mori, A. V. Savchenko, et al. Genome-wide Analysis of Substrate Specificities of the Escherichia coli Haloacid Dehalogenase-like Phosphatase Family J. Biol. Chem., November 24, 2006; 281(47): 36149 - 36161. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Gray, C. Mulligan, B. Molini, E. Sun, L Giacani, C Godornes, A Kitchen, S. Lukehart, and A Centurion-Lara Molecular Evolution of the tprC, D, I, K, G, and J Genes in the Pathogenic Genus Treponema Mol. Biol. Evol., November 1, 2006; 23(11): 2220 - 2233. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Udall and J. F. Wendel Polyploidy and Crop Improvement Crop Sci., November 1, 2006; 46(Supplement_1): S-3 - S-14. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Federico, F. L. Iniguez-Luy, R. W. Skadsen, and H. F. Kaeppler Spatial and Temporal Divergence of Expression in Duplicated Barley Germin-Like Protein-Encoding Genes Genetics, September 1, 2006; 174(1): 179 - 190. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Preston and E. A. Kellogg Reconstructing the Evolutionary History of Paralogous APETALA1/FRUITFULL-Like Genes in Grasses (Poaceae) Genetics, September 1, 2006; 174(1): 421 - 437. [Abstract] [Full Text] [PDF] |
||||
![]() |
A M Tarrant, S R Greytak, G V Callard, and M E Hahn Estrogen receptor-related receptors in the killifish Fundulus heteroclitus: diversity, expression, and estrogen responsiveness. J. Mol. Endocrinol., August 1, 2006; 37(1): 105 - 120. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Wang, K. Chong, and T. Wang Divergence in spatial expression patterns and in response to stimuli of tandem-repeat paralogues encoding a novel class of proline-rich proteins in Oryza sativa J. Exp. Bot., August 1, 2006; 57(11): 2887 - 2897. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. de Martino, I. Pan, E. Emmanuel, A. Levy, and V. F. Irish Functional Analyses of Two Tomato APETALA3 Genes Demonstrate Diversification in Their Roles in Regulating Floral Development PLANT CELL, August 1, 2006; 18(8): 1833 - 1845. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Freeling and B. C. Thomas Gene-balanced duplications, like tetraploidy, provide predictable drive to increase morphological complexity Genome Res., July 1, 2006; 16(7): 805 - 814. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. C. Thomas, B. Pedersen, and M. Freeling Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes Genome Res., July 1, 2006; 16(7): 934 - 946. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wang, L. Tian, H.-S. Lee, and Z. J. Chen Nonadditive Regulation of FRI and FLC Loci Mediates Flowering-Time Variation in Arabidopsis Allopolyploids Genetics, June 1, 2006; 173(2): 965 - 974. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Huntley, D. M. Baggott, A. T. Hamilton, M. Tran-Gyamfi, S. Yang, J. Kim, L. Gordon, E. Branscomb, and L. Stubbs A comprehensive catalog of human KRAB-associated zinc finger genes: Insights into the evolutionary history of a large family of transcriptional repressors Genome Res., May 1, 2006; 16(5): 669 - 677. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ezawa, S. OOta, and N. Saitou Genome-Wide Search of Gene Conversions in Duplicated Genes of Mouse and Rat Mol. Biol. Evol., May 1, 2006; 23(5): 927 - 940. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Katju and M. Lynch On the Formation of Novel Genes by Duplication in the Caenorhabditis elegans Genome Mol. Biol. Evol., May 1, 2006; 23(5): 1056 - 1067. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Su, J. Wang, J. Yu, X. Huang, and X. Gu Evolution of alternative splicing after gene duplication Genome Res., February 1, 2006; 16(2): 182 - 189. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Lynch The Origins of Eukaryotic Gene Structure Mol. Biol. Evol., February 1, 2006; 23(2): 450 - 468. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Duarte, L. Cui, P. K. Wall, Q. Zhang, X. Zhang, J. Leebens-Mack, H. Ma, N. Altman, and C. W. dePamphilis Expression Pattern Shifts Following Duplication Indicative of Subfunctionalization and Neofunctionalization in Regulatory Genes of Arabidopsis Mol. Biol. Evol., February 1, 2006; 23(2): 469 - 478. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B Bonsall Longevity and ageing: appraising the evolutionary consequences of growing old Phil Trans R Soc B, January 29, 2006; 361(1465): 119 - 135. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wang, L. Tian, H.-S. Lee, N. E. Wei, H. Jiang, B. Watson, A. Madlung, T. C. Osborn, R. W. Doerge, L. Comai, et al. Genomewide Nonadditive Gene Regulation in Arabidopsis Allotetraploids Genetics, January 1, 2006; 172(1): 507 - 517. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. van Hoof Conserved Functions of Yeast Genes Support the Duplication, Degeneration and Complementation Model for Gene Duplication Genetics, December 1, 2005; 171(4): 1455 - 1461. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Zhao, L. Fang, N. Chen, R. C. Johnsen, L. Stein, and D. L. Baillie Distinct Regulatory Elements Mediate Similar Expression Patterns in the Excretory Cell of Caenorhabditis elegans J. Biol. Chem., November 18, 2005; 280(46): 38787 - 38794. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Gabaldon, M. Lopez-Serrano, M. A. Pedreno, and A. R. Barcelo Cloning and Molecular Characterization of the Basic Peroxidase Isoenzyme from Zinnia elegans, an Enzyme Involved in Lignin Biosynthesis Plant Physiology, November 1, 2005; 139(3): 1138 - 1154. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Notebaart, M. A. Huynen, B. Teusink, R. J. Siezen, and B. Snel Correlation between sequence conservation and the genomic context after gene duplication Nucleic Acids Res., October 27, 2005; 33(19): 6164 - 6171. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. G. Woods, C. Wilson, B. Friedlander, P. Chang, D. K. Reyes, R. Nix, P. D. Kelly, F. Chu, J. H. Postlethwait, and W. S. Talbot The zebrafish gene map defines ancestral vertebrate chromosomes Genome Res., September 1, 2005; 15(9): 1307 - 1314. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. S. Makarova, Y. I. Wolf, S. L. Mekhedov, B. G. Mirkin, and E. V. Koonin Ancestral paralogs and pseudoparalogs and their role in the emergence of the eukaryotic cell Nucleic Acids Res., August 16, 2005; 33(14): 4626 - 4638. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Jones and D. J. Begun Parallel evolution of chimeric fusion genes PNAS, August 9, 2005; 102(32): 11373 - 11378. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Poon, B. H. Davis, and L. Chao The Coupon Collector and the Suppressor Mutation: Estimating the Number of Compensatory Mutations by Maximum Likelihood Genetics, July 1, 2005; 170(3): 1323 - 1332. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Hughes Gene duplication and the origin of novel proteins PNAS, June 21, 2005; 102(25): 8791 - 8792. [Full Text] [PDF] |
||||
![]() |
N. Eldredge, J. N. Thompson, P. M. Brakefield, S. Gavrilets, D. Jablonski, J. B. C. Jackson, R. E. Lenski, B. S. Lieberman, M. A. McPeek, and W. Miller III The dynamics of evolutionary stasis Paleobiology, June 1, 2005; 31(2_Suppl): 133 - 145. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Lynch, M.
- Articles by Force, A.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Lynch, M.
- Articles by Force, A.





















































