| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 178, 2201-2216, April 2008, Copyright © 2008
doi:10.1534/genetics.107.076687
| ||||||||||||||||||||||||||||||||||||||||||||||||||
Department of Statistics, University of Georgia, Athens, Georgia 30602
1 Author e-mail: pdschlie{at}stat.uga.edu
Manuscript received May 29, 2007. Accepted for publication January 11, 2008.| ABSTRACT |
|---|
|
|
|---|
The reason that single-locus traits are relatively easy to map is that there is a strong correlation between genotype at the causative locus and the trait. If a qualitative trait is present, then there is usually or always a corresponding genotype at the causative locus. By contrast, with a multilocus trait it is possible to exhibit the trait without there being a "trait genotype" at a particular locus or to not exhibit the trait when there is a trait genotype at that causative locus. Thus, there is not a consistent genotypic difference between those individuals exhibiting the trait and those who are not (or between different levels of the trait for a quantitative trait) and power to map complex trait loci is low.
In the past few years, the new field of "genetical genomics" has emerged—that is, the combination of molecular marker data and genomewide expression data to elucidate the genetic basis of gene expression. Evidence has emerged of substantial genetic variation in gene expression (e.g., BREM et al. 2002; OLEKSIAK et al. 2002; YAN et al. 2002; CHEUNG et al. 2003; SCHADT et al. 2003; MORLEY et al. 2004). For example, BREM et al. (2002) conducted linkage analysis of expression levels in a cross between two yeast strains. They found 1528 genes with differential expression. Among those genes they found a median heritability of expression of 84%. Expression levels for 308 of these genes showed linkage to at least one marker. MONKS et al. (2004) measured expression in 23,499 genes in lymphoblastoid cell lines from members of 15 CEPH families. A total of 2340 were found to be differentially expressed and 762 (31%) of these had heritability significantly different than zero. Median heritability was 0.34 for these genes and 25% had heritability >0.44. A genome scan was performed and 55 expression-level QTL (eQTL) for 33 genes were found. Twenty of these QTL explained >90% of the variance and >45 explained >70% of the variance.
These and other studies show that there may be many expression-level traits with a relatively simple and tractable genetic basis. This indicates that microarrays might be a powerful tool for mapping loci underlying complex traits. The premise is that certain gene expression-level traits might have a stronger correlation with the underlying causative loci than the trait does itself. That is, (1) many expression-level traits are highly heritable relative to complex clinical traits and appear to have a relatively simple genetic basis, (2) there is evidence of multiple expression-level traits mapping to the same chromosomal region (and perhaps the same locus), and (3) if genetic polymorphisms that affect disease risk also affect expression levels of other genes, these expression-level traits will give high power for mapping the polymorphisms. By causative locus, I mean a locus at which there is genotypic variation responsible for variation in a clinical trait of interest. By clinical trait, I mean an organismal trait of interest, such as disease status, body weight, etc. Of course, the word "clinical" is used just for convenience—the trait need not solely be of clinical interest.
The potential of this type of approach was demonstrated by SCHADT et al. (2003). They performed an F2 cross between two inbred mouse strains and performed a genomewide scan for linkage for expression levels of 23,574 genes. The mice were on a high-fat diet for 4 months and thus developed a spectrum of obesity. They were classified according to the subcutaneous fat-pad-mass (FPM) trait and expression levels were compared between the upper 25% and the lower 25% for this trait. They examined the 280 genes most differentially expressed between these two groups and found that clustering on this set of genes divided the mice into two high-FPM groups and one low-FPM group, revealing genetic heterogeneity in the high-FPM mice. They then performed a genome scan for the FPM trait and showed that LOD scores of QTL for this trait were substantially increased when only one of the high-FPM groups was included. They also showed that a number of genes had expression-level traits mapping to the same region as a QTL for the FPM trait.
To apply genetical genomics approaches to a clinical trait (e.g., a disease), it is necessary to establish a connection between expression-level traits and the clinical trait. To date, most of the focus in genetical genomics has been in exploring the quantitative genetics of gene expression and in candidate gene studies. Substantial progress has been made, on both experimental (e.g., the articles cited above) and statistical (see KENDZIORSKI and WANG 2006 for review) fronts. However, little is still known about how transcriptional variation among genotypes relates to variation in clinical traits (GIBSON and WEIR 2005). Until this is better understood, it will be difficult to apply genetical genomics approaches to understanding complex traits. The goal of this article is to explore how transcriptional variation among genotypes should be expected to relate to variation in a qualitative trait.
The ideal expression-level trait for mapping would be one whose variation is entirely determined by the genotype at a single one of the L causative loci. If such expression-level traits existed for each of the causative loci, then they could all be mapped with relative ease. However, such expression-level traits would not correlate strongly with the disease and might be difficult to detect as differentially expressed between disease affected and unaffected individuals. Expression-level traits that correlate strongly with the disease would be easiest to detect as differentially expressed, but would not be any better than the disease itself as a response variable for mapping. SCHADT et al. (2003) and others have found a very large number of eQTL mapping to locations throughout the genome, irrespective of trait status. Determining which of these eQTL are relevant requires establishing a correlation between the expression-level trait and the disease. Thus, the use of eQTL for mapping complex traits may just shift the obstacle of low power for establishing a correlation between trait and genotype to a new obstacle of low power for establishing a correlation between disease and expression level. Thus, it is unclear how powerful we should expect eQTL-based strategies to be.
A number of studies have used microarrays for identifying candidate genes (e.g., LAWN et al. 1999; BERGE et al. 2000; KARP et al. 2000; WAYNE and MCINTYRE 2002; KIRST et al. 2004; BYSTRYKH et al. 2005; CHESLER et al. 2005; HUBNER et al. 2005; LI et al. 2005; SCHADT et al. 2005). For example, BYSTRYKH et al. (2005) mapped eQTL from mouse stem cells. Those that mapped to the region of a known QTL for stem cell turnover were then identified as candidate genes. Such approaches are promising when there is prior knowledge of the location of a causative locus. The primary focus of this article is genome scans—that is, whole-genome linkage or association studies when there is no candidate region previously identified.
There are two basic tasks that must be accomplished to use genomewide expression information for a genome scan: (1) an association must be established between the transcript and the clinical trait and (2) association must be established between the transcript and a marker. These tasks might be done in a two-stage analysis or as some combined analysis, but they must be accomplished somehow. First, I examine the statistical power to detect expression-level changes as being associated with the clinical trait. Then, I examine the power to detect eQTL for expression-level traits that have been identified as differentially expressed.
I consider both transcripts that are disease causative (that is, variation in transcript level contributes directly to disease risk) and those that are controlled by a disease locus but are not causative. These latter transcripts are termed "reactive." Note that this term refers to transcripts reactive to a disease locus and not to the disease itself (thus the usage is somewhat different from that in some previous eQTL articles). Power is shown to be quite good for causative transcripts, with sample sizes on the order of 100 being sufficient for reliable detection of association with the clinical trait. The story is more complicated for reactive transcripts. There is a trade-off in power between the two tasks laid out in the previous paragraph. The sample sizes needed for reasonable power to detect expression-level differences are often very high unless the expression-level trait and disease share multiple causative loci (thus creating a higher correlation between disease and expression level). Unfortunately, the power to map these causative loci via the expression-level trait plummets as the number of influencing loci increases. The result of this trade-off is that, for natural populations, there may be no expression-level traits that will give good power for detecting association both with the clinical trait and with a marker (although the exact power varies greatly depending on model assumptions and parameter values). It should be stressed that I am considering only associations involving single expression-level traits and there is potentially a great deal of information contained in the joint behavior of expression-level traits. Furthermore, improved experimental designs are possible. Thus, it is not the goal of this article to make absolute statements about statistical power. Rather, the goal is to determine the fundamental issues and to provide a baseline for understanding power in such studies.
| POWER TO DETECT EXPRESSION-LEVEL DIFFERENCES DUE TO POLYMORPHISMS BETWEEN AFFECTED AND UNAFFECTED INDIVIDUALS: METHODS |
|---|
|
|
|---|
Consider an ELT X1 that is either causative for the disease or reactive to a disease locus. We wish to find the distribution for X1 as a function of an individual's disease-affected status. If T is an indicator variable for disease status (T = 1 indicating disease and T = 0 indicating no disease), then we have
![]() | (1) |
is a vector of genotypes at the causative loci and the disease prevalence is K = P(T = 1).
![]() | (2) |
Multiplicative trait model:
To proceed, we need to further specify the trait penetrance model. The first example model is a multiplicative one. If the ELT is reactive only, we have
![]() | (3) |
is the contribution to disease risk from the one-locus genotype ij on locus j. Thus, G is a vector [g1, g2, ..., gL]. The mechanistic basis for a multiplicative model is that each locus affects one of a chain of L disease events that must all occur for the disease to occur. By "disease event" I mean some physiological event associated with the locus that contributes to the occurrence of the disease.
Substituting (3) for P(T = 1 | G) in (1),
![]() | (4) |
denotes the multiple summations over the individual-locus genotypes gi and I have suppressed the X1 = x and G = g notation to simplify the appearance of the equations. I assume no gametic disequilibrium so that
. In this case K = K1K2 ... KL, where
![]() | (5) |
![]() | (6) |
If the ELT is causative, the disease risk depends both on genotype and on the value of the ELT X1,
![]() | (7) |
![]() | (8) |
The functions uc+1(gc+1) ... uL(gL) are of the same type as those for the reactive model. Note that it is not specified whether there are causative ELTs associated with the loci c + 1–L. The effects of these loci are contained in the functions uj(gj). The effect could be via a causative ELT or via some other mechanism.
Additive trait model:
Under an additive model with a reactive ELT, the disease probability is given by
![]() | (9) |
Under this model the disease event associated with any causative locus is sufficient to cause the disease, but the more that such disease events occur the higher the chance of disease. Equation 1 becomes
![]() | (10) |
![]() | (11) |
Expression-level distribution:
I use the above relationships to calculate the power to detect expression-level differences due to genotypic differences between affected and unaffected individuals. The premise is that there should be a large number of changes in expression-level traits associated with the occurrence of a disease or other trait.
Suppose that some individual has mutations at some set of disease causative loci and that each such mutation initiates some sequence of events. These different sequences of events will then eventually interact in some way to cause the disease (or, more generally, to increase disease probability). ELTs that are "upstream" in the sequence of events originating from a mutation will have a strong association with that mutation, but a weak association with the disease. That is, if the mutation is weakly associated with the disease (as is the case for complex diseases), then ELTs strongly associated with that mutation must also be weakly associated with the disease. On the other hand, ELTs that occur "downstream" of the interactions between mutations will have stronger association with the disease but weaker association with the disease loci (because these ELTs are themselves complex traits). I assume that no reactive ELTs are simultaneously upstream and downstream—that is, for example, that no reactive ELT depends directly both on genotype at a causative locus and on disease status (although there would be indirect dependencies in both cases). The precise meaning of this becomes clear in the derivations that follow.
My focus is on the statistical power to map a specific one of the L causative loci. I call this locus the target causative locus (TCL). I assume that the genotype at this causative locus affects the expression level of M genes (possibly including itself). If there is a causative ELT, then M is always taken as one. The genotype at loci besides the L causative loci may also affect an expression-level trait, but such loci are uncorrelated with the disease and their effect is assumed to be included in the normal distributions. When I refer to a causative locus as influencing or controlling an expression level, I mean specifically that genetic variation at the causative locus affects the expression-level trait.
Under the assumptions stated earlier, the expression-level distribution is a mixture of normal distributions, where the mean and the variance of those normals are genotype dependent. For c = 1, and a multiplicative model with a reactive ELT, we have
![]() | (12) |
![]() | (13) |
denotes a normal probability density distribution with mean
and variance
.
Standard statistical theory states that if the probability density function is
![]() |
![]() |
![]() | (14) |
and
. The resulting distribution is not normal. However, I consider sample sizes large enough that the central limit theorem applies and the distribution of means is normal. Power is then calculated using standard theory for the comparison of two groups. If there are multiple expression-level traits dependent on the TCL (i.e., M > 1), then X1 becomes a vector of expression levels and the normal distribution is replaced by a multivariate normal. In this case a simulation approach is used for power calculations (described later).
We next need to specify how the expression mean and variance depend on genotype. I take the variance as a constant for all genotypes. I use either a multiplicative or an additive model for the dependence of the mean on genotype. If expression is multiplicative and controlled by loci 1–c, then the expression mean is given by
![]() | (15) |
is the contribution to the mean from locus i.
In the additive case, the expression mean is given by
![]() | (16) |
Except where noted otherwise, the form of genetic control (multiplicative or additive) is the same for expression level and disease.
Population genetic parameters:
I assume that all alleles at the TCL can be classified either as normal or as disease causing. All disease-causing alleles at the TCL are assumed to have equal effect. The frequency of disease alleles (denoted by D) is p and that of normal alleles (denoted by d) is 1 – p. The contribution to disease risk u(gi) is
for a normal homozygote (genotype dd),
for a disease homozygote (genotype DD), and
for heterozygotes (genotype Dd). These are the probabilities that the disease event associated with that locus occurs, given genotype. The amount of disease risk associated with the TCL is given by a parameter L. The single-locus contribution to disease risk KL is taken as equal to K1/L for a multiplicative model and K/L for an additive one. Thus, this is the contribution that locus would have if there were L loci all of equal effect. Note that this does not assume that there are actually L loci of equal effect, just that the effect attributable to the TCL is the same is if there were. Note that the role of L in the previous derivations is no different when we interpret L this way. For the target causative locus I take
and
for a multiplicative penetrance model and
and
for an additive model.
Very little is known about allele frequencies or other parameters for complex traits. Because the prevalence K is typically known, I take it as given and solve for the allele frequency p using (5) after specifying the parameters h,
,
, and L.
The parameters
and
are the mean expression levels for individuals with two disease alleles at all c loci controlling the expression level and the genotype with 0 disease alleles at those loci, respectively. If the expression genetics are multiplicative, then the single-locus expression mean
is given by
,
, and
for individuals with zero, one, and two disease alleles at locus i, respectively. If expression is controlled additively, then these values become
,
, and
.
The parameter
is the standard deviation in expression (assumed the same for all genotypes) and I take
. Values for
were chosen on the basis of calculations given as supplemental material using the data of HEDENFALK et al. (2001), who measured expression-level differences between patients carrying different breast cancer mutations. Briefly, I calculated t-statistics for differences in expression in 3326 genes between women carrying and not carrying BRCA1 mutations. I used an approach similar to that used by BREM and KRUGLYAK (2005) for correcting for multiple testing. That is, I randomly divided the data set in two and used half of the data to calculate t-statistics for all 3226 genes. Then, I used the other half of the data to recalculate the t-statistics for those genes with the 50 highest values. The 10th highest t-statistic from this set was
3 (the maximum was 6 and the median was 2). On the basis of this, I take a baseline of
= 3. This was chosen as an extreme but not the most extreme value: that is, we are specifically interested in the expression-level traits that are most correlated with the disease mutation, but want to avoid possible outliers. See the supplemental material for more details. Table 1 gives a description of parameters used in this article.
|
In the case of reactive ELTs, I assume that the M expression-level traits are independent given the genotype at their causative loci. This is most likely not true in many cases. This assumption will increase the calculated power because correlations between ELTs would decrease the effective value of M.
Note that I consider detection of an association between a reactive ELT and the disease as a "success," because that reactive ELT can be used to map the controlling disease locus even if the ELT itself is not disease causative. However, in another context a researcher might be interested in detecting only ELTs that are disease causative. In this case, the statistical power calculated here for reactive ELTs would actually be the probability of type I error.
In this article I strictly focus on expression-level traits directly controlled by disease-causative loci. There may also be expression levels whose controlling locus is linked to a disease locus. These expression levels will also be correlated with the disease, but this correlation will be weakened by recombination. To avoid complications, I do not consider such genes.
I use a simulation approach to calculate approximate power for M > 1. For each repetition, expression levels for treatment and control groups are randomly generated using distributions (1) and (2). A t-test is conducted for each of the M expression-level traits. If at least one expression-level trait is significantly different between treatment and controls, then that repetition is declared a success. The power is then approximately equal to the fraction of success in a large number of trials.
| POWER TO DETECT EXPRESSION LEVEL DIFFERENCES DUE TO POLYMORPHISMS BETWEEN AFFECTED AND UNAFFECTED INDIVIDUALS: RESULTS |
|---|
|
|
|---|
, sample size is 200 (100 for each treatment), the penetrance parameters
and
are 0.0 and 1, respectively (indicating perfect penetrance with respect to the full multilocus genotype), and the expression-level trait depends only on a single TCL. Thus, there are three expression-level trait means. In both plots
and
, where
is the mean expression level for individuals with i disease alleles at the TCL. These values correspond to
= 3. The dominance in expression is hE = 0 (completely recessive) in Figure 1a and hE = 0.5 (no dominance) in Figure 1b. I assume that there are 20,000 genes tested on the microarray. These are default parameter values (with hE = 0.5) throughout this article unless otherwise specified.
|
When L > 1, then some nondisease individuals will also have DD genotypes at the TCL (see the description of the multiplicative trait model). As L is increased, the disease-allele frequency at the TCL also increases. As the number of causative loci is increased, then the probability of the disease events associated with those loci must also increase to maintain the constraint that the disease prevalence is 0.01. For example, the disease-allele frequency is 0.68 for L = 6 and 0.75 for L = 10 at h = 0. See SCHLIEKELMAN and SLATKIN (2002) for further details and plots of allele frequencies. As L increases, the proportion of unaffected individuals with a DD genotype at the TCL also increases. Thus, the contrast between the two treatments also decreases and power drops quickly. At h = 0.1, power is near 97% for L = 6, but drops to
46% for L = 11. The power also drops quickly as h is increased from 0. For example, at h = 0.3, the power is <50% for all values of L and for h
0.5 the maximum power is
0.05. One interesting aspect of Figure 1a is that power actually increases when L is increased from 1 when h is low but >0. This occurs because allele frequency increases as L is increased. As discussed above, the disease-allele frequency is low for L = 1 and when h > 0 most affected individuals are genotype Dd. The allele frequency increases as L does and thus more affected individuals are of genotype DD. Only genotype DD produces any contrast in expression level, so power is increased.
Figure 1b has identical parameter values except that hE = 0.5 (no dominance in expression). In this case, the power to detect differential expression is less sensitive to h and does not drop to zero as h is increased. Power is lowest for intermediate h and as above is highest for h = 0. However, the power in the best case is substantially worse for larger L than in the previous example. The maximum power is 0.81 for L = 6 and 0.61 for L = 5. As discussed above, disease-allele frequency is quite high for larger L when h = 0. Thus, the difference between treatment groups is primarily due to differences between Dd and DD individuals. Because the expression-level means are closer together, the difference is less. The closer h is to 0.5, the more affected individuals are of Dd genotype. Again, the contrast between treatments is lower and power decreases.
Supplemental material available at my website shows a similar plot for the case with hE = 1. In this case the power is highest at h = 1 and drops off quickly as h is decreased from 1 (although less quickly than in Figure 1a).
Dependence on c:
Next, I explore how the number of loci (c) controlling the expression-level trait affects the power to detect an expression-level difference. The more similar the genotypic dependence of the expression-level trait is to that of the disease, the more contrast in expression-level trait we expect there to be between affecteds and unaffecteds. Thus, increasing the number of loci controlling the expression-level trait should increase the power (since all loci controlling the expression-level trait are assumed to be controlling disease status also). There is no question that increasing the range of variation in an expression-level trait will increase power. The unknown is how changing the control of expression level affects power. Therefore, the maximum and minimum expression levels are kept constant as c is varied. That is, the expression level for the genotype with all c loci as DD is constant and the expression level for the genotype with all c loci as dd is constant. Figure 2 shows plots for several parameter values. In Figure 2, a and b, the expression-level trait has an additive dependence on the TCLs while it has a multiplicative dependence in Figure 2, c and d. In all cases the disease has a multiplicative dependence. We see that the power increases only slowly with c in the additive case. The advantage of increasing c is that it increases the concordance of genotypic dependence between expression-level trait and disease status. However, if disease and expression-level traits depend in fundamentally different ways on the multilocus genotype, this concordance does not increase substantially with c.
|
We see in Figure 2, c and d, that there is good power to detect expression-level differences for L = 3 and c = 1–2 and for L = 6 cases and c = 2–3. For higher values of L and h = 0.5 (Figure 2c) the power is maximized in the 50–60% range for c = 4–5. For the h = 0.9 case (Figure 2d) power is substantially lower.
Dependence on M (number of expression-level traits controlled by the TCL):
If the difference in genotype distribution between affecteds and unaffecteds means is not large at the TCLs, then the probability of the sample means being different enough to achieve significance may be small. In this case, the probability of achieving significance is increased if there are multiple expression-level traits dependent on the TCLs. Figure 3 shows a surface plot of power vs. c and M for L = 9, hE = h = 0.5, and a sample size of 200. The power here is defined as the probability that at least one of the M ELTs is detected as differentially expressed. When c = 1, we see the power increasing from 5% for M = 1 to 50% for M = 40. In this case, there is little distinction between affecteds and unaffecteds at the TCL. The disease-allele frequency is 0.6 for these parameter values. Approximately 84% (that is, 1 – 0.42) of unaffecteds carry a disease allele at the TCL and there is little difference in TCL genotype between affecteds and unaffecteds. With so little difference in genotype distribution, each additional ELT dependent on the TCL makes only a small difference. However, a large number of such ELTs do make a substantial difference. For the case c = 2, small changes in M have a substantial affect on power. When we jointly consider two loci, the genotypic difference between treatments is greater. Power is still poor (36%) for M = 1, but improves to >80% for M = 8. For c = 3, power increases from
60 to
90% when M is increased from 1 to 4. Thus, increased M can help power substantially in cases where the genotypic distinction between treatments is intermediate.
|
|
and
= 1). Figure 5 shows that varying
(the probability of an individual with no disease alleles being affected) from zero seriously decreases power, while decreasing
from 1 has negligible effect. For
0.00005, the power is near zero even for a sample size of 1500. When
= 0 it is certain that an affected individual has at least one disease allele at each locus. When
> 0, then some affected individuals do not have any disease alleles at some loci. For the parameters in Figure 5 (L = 9 and prevalence = 0.01) and
= 0.0005, it can be shown that most affected individuals have at least one locus with no disease alleles. Thus, the contrast in expression levels between affecteds and unaffecteds is diminished. On the other hand, many loci in unaffected individuals have a disease alleles regardless of the value of
and changing it therefore has less effect. The sharp drop in power as
is increased is largely due to the particular form of model used. The parameter
is the probability of an individual with no disease alleles being affected. The parameter
, the contribution to disease risk of the TCL, is
. When L is large,
increases very quickly as
does. For example, when L = 9 and
= 0.0005, then
. Thus, the probability that a locus with no disease alleles contributes its disease event is quite high. The effect of varying
is small for the same reason. For
= 0.5 and L = 9, the value of
is 0.51/9 = 0.93. It is, of course, unknown whether this form for
is appropriate. However, if we assume a multiplicative model and that individuals with no disease alleles can have the disease, then it is necessary.
|
, where
and
are the expression-level means for individuals with all 2c disease alleles and 0 disease alleles, respectively, and
is the standard deviation in gene expression. This corresponds to 45% of variance explained by genotype. Referring to supplemental Figure 1, we see that a value of three is among the highest values calculated for the Hedenfalk data. However, there are genes with values ranging as high as six, and the power to detect expression level differences for these genes will be greater. Figure 6 shows a plot of power vs. the parameter
for various values of L and c. In this figure
and
are held constant at 1 and 0, respectively, while the expression-level variance
is varied. The sample size is 200, h = 5, hE = 0.5, and all other parameters are as in Figure 2. As expected, the power increases steadily with
. For example, the power in the L = 6, c = 1 case increases from
7% at
= 2 to 86% for
= 8. On the other hand, the power in the L = 9, c = 1 case reached only 37% at
= 8 because it started from only 1%.
|
|
Power under a multiplicative penetrance is much higher than that under additive penetrance even for c = 1. This is due to the very different genotype distributions under the two models. With multiplicative penetrance, affected individuals have at least one disease allele at all loci (or at most loci if the penetrance parameter
). In contrast, only one locus need have a disease allele to cause disease with additive penetrance. With L = 6 and other parameters as in Figure 7a,
90% of affected individuals carry only a single disease allele and most of the remaining 10% carry single disease alleles at two loci. Thus, only (90%/6 + 10%/3) = 18% of affected individuals carry a disease allele on a given disease locus. Assuming no dominance as in Figure 7, the average expression level for a gene controlled by this locus is 0.5 and thus the average expression level among affecteds is 0.5 x 0.18 = 0.09. Most unaffecteds carry no disease alleles, so the mean difference in expression level between affected and unaffected is
0.09. With the same parameters and a multiplicative model, affecteds on average have three loci that are homozygous for the disease allele and three loci that are heterozygous. The average expression level in affecteds for a gene controlled by the disease locus would be
0.5 x 1 + 0.5 x 0.5 = 0.75. Now, as discussed earlier, disease-allele frequencies are high under a multiplicative model (in strong contrast to the case with an additive model). The disease-allele frequency would be 0.46 for the parameters in Figure 7 and genotype frequencies in unaffecteds would be
0.21 disease homozygote, 0.50 heterozygote, and 0.29 normal homozygote. Thus, the mean expression level in unaffecteds would be 0.21 x 1 + 0.5 x 0.5 + 0.29 x 0 = 0.46 and the mean difference between affecteds and unaffecteds would be
0.75 – 0.46 = 0.29. The mean difference in expression level is 3.5 times higher for the multiplicative model than for the additive one and power to detect expression-level differences is correspondingly higher under the multiplicative model.
Figure 7b shows a plot of power vs. M for sample size 200, L = 4, and c = 1, 3, and 5. We see increasing M has minor impact. Even for this low value of L the power is still low for M up to 20.
Causative expression-level trait:
There are many possibilities for how disease risk depends on the expression level of a causative transcript. I use a simple threshold model with u(X1) = 1 if X1
Q and u(X1) = 0 if X1 < Q. The expression means and variances for affected and unaffected individuals are calculated using Equation 15 in a similar manner to that done for the reactive ELT. For example, the expected value among affecteds for the multiplicative model is
![]() | (17) |
Figure 8a shows a plot of power to detect a difference in X1 between affecteds and unaffecteds as a function of sample size. The parameter Q = 0.5, c = 1, and the expression means are
,
, and
. The different curves correspond to the value of L as shown. Other parameters are as in Figure 2. The power is greatly improved compared to a reactive ELT. For example, we see that 80% power is achieved at a sample size of
100 when L = 9. This compares to a sample size of 600 for 80% power for similar parameter values with a reactive ELT. This increased power results from a combination of higher expression means and lower variances for affected individuals when the ELT is causative relative to the case when it is reactive. Both of these effects result primarily from the heterozygote expression distribution being truncated in affected individuals. The value of the ELT must exceed Q in affected individuals. With the parameter values in Figure 8 (Q = 0.5 and
), half of the expression distribution is truncated in affected heterozygotes.
|
Q = 0.6, but that Q has a small impact on power. The effects of varying Q are complicated. As Q increases, fewer individuals will have an ELT value that exceeds it. Thus, the disease-allele frequency must increase to keep disease prevalence fixed. Increasing Q has four major effects. First, it increases the expression mean in affected individuals, which tends to increase power. Second, it increases the expression mean in unaffected individuals (because the disease-allele frequency increases), which tends to decrease power. Third, increasing Q reduces the portion of each genotype-specific expression distribution that exceeds the threshold, tending to decrease variance. Fourth, when Q reaches extreme values, then the disease allele frequency does also. This tends to reduce expression variance by reducing genetic variance in the population. The interaction between these effects is complex, but the net effect on power is small because the various effects are often in opposition to each other. Figures available as supplemental material show the effects of various other parameters on the power for the causative ELT. Power increases very substantially with c as the proportion of the risk explained by the ELT increases. The degree of dominance in the ELT (that is, the mean expression level for heterozygotes) has a roughly inverse effect to that of Q. That is, power is at a peak for intermediate dominance and the various effects discussed above for Q work in roughly the opposite direction as h is increased.
Allowing M to vary for causative ELTs would require specifying a model for how these ELTs interact to determine disease risk. Thus, I keep M fixed at one. Other parameters have effects similar to those observed for reactive ELTs.
This model for a causative ELT is not compatible with additive disease risk for the standard parameter values used in this article. In the additive case, the disease risk associated with the causative locus is K/L = 0.01/9 = 0.0011. Under the standard parameters, the ELT has a probability of 0.07 of exceeding the threshold Q = 0.5 when its controlling locus has no disease alleles. This means that an individual with no disease alleles still has a probably 0.07 of being affected, which far exceeds the "allowed" disease risk for the locus. This can be corrected by lowering the genotype-specific ELT standard deviation
to a level that makes the probability of exceeding Q < 0.011. This requires halving the standard deviation to
0.15 for the standard parameter values. Not surprisingly, power is quite good with this low ELT variance (comparable to that seen in Figure 8). However, the analysis of
values described in the METHODS section indicates that ELTs with values this low may be very rare.
| POWER TO MAP CAUSATIVE LOCI USING DIFFERENTIALLY EXPRESSED TRANSCRIPTS AS QUANTITATIVE TRAITS |
|---|
|
|
|---|
100. With c = 1 (meaning that the ELT has a single controlling locus), the power to map the controlling locus will be good. Thus, in this case, eQTL-based strategies for genome scans should be powerful. The situation is more complicated for reactive ELTs. Power was shown to be very poor if an additive penetrance model holds. Power for multiplicative penetrance depends strongly on the parameter c. If there is only a single TCL controlling an expression-level trait (c = 1), then power to detect an association between that gene and disease status is low unless L is small. If multiple TCL control the gene (i.e., c > 1), then the power to detect associations with the disease is higher. However, this expression-level trait is then a multilocus trait and it will have lower power in use as a quantitative trait for detecting the TCL(s).
A number of methods for mapping QTL in natural populations have been proposed (see, e.g., LYNCH and WALSH 1998). I will focus on humans and assume that mapping is conducted using Haseman–Elston (HE) regression (HASEMAN and ELSTON 1972). HE regression uses sib pairs and detects linkage to a QTL by regression of the squared difference in trait values on the number of alleles identical-by-descent (IBD) between the sibs at each marker. If a marker is linked to a QTL, then there should be a negative correlation between the squared trait difference and the IBD status at the marker.
I used a Monte Carlo simulation to estimate power for HE regression with an eQTL. I assumed a multiplicative model for gene expression with equally contributing loci. Figure 9a shows a plot of power vs. the parameter c for sample sizes of 100, 300, and 500 sib pairs. M = 1 in Figure 9a. That is, there is one expression-level trait controlled by the eQTL. This expression-level trait being tested is assumed to be completely linked to one of the c controlling loci. A Bonferroni correction for 50 tests was applied (i.e.,
= 0.05/50 = 0.001). This is very a modest correction considering that there typically might be tens or hundreds of differentially expressed genes tested against each of hundreds of markers (note that a correction for 20,000 tests was used in previous sections of this article). We see that power drops quickly as c increases, with <10% power for c = 2 and 500 sib pairs.
|
The power curves in Figure 9b correspond to M = 10, 20, 50, and 100 as shown. The sample size is 500 sib pairs in each case. The power is defined as the probability that the target causative locus is detected as an eQTL for at least one of the expression-level traits. Each of the M expression-level traits is assumed to be controlled by the same set of c loci. We see that the power increases strongly with M. However, M = 100 is required to achieve 80% power when c = 2 and the power is only 20% when c = 3.
The basic HE regression method assumed here is not the most powerful method available for detecting QTL in natural populations. However, other methods are not dramatically better and it is not expected that the qualitative results will be much different. Thus, we see that the power for detecting an eQTL is expected to be poor when the expression-level traits have more than one controlling locus, unless that locus is an eQTL for many expression-level traits.
| DISCUSSION |
|---|
|
|
|---|
Results that will appear in a future publication (P. SCHLIEKELMAN, unpublished results) show that the situation is better for crosses with inbred lines. The power to detect expression-level differences is not substantially higher, but the power to detect linkage drops off more slowly as the number of loci (c) controlling the expression-level trait increases.
These results do not imply that no reactive expression-level changes should be detected with natural populations. On the contrary, the power to detect such differences is quite reasonable for expression-level traits influenced by multiple causative loci (i.e., expression-level traits with c > 1). The point is that the reactive expression-level traits that are found to be most differentially expressed are likely to be poor candidates for use as quantitative traits for linkage detection.
Model assumptions:
The results of this article depend on a large number of assumptions about how disease risk and expression levels depend on genotype. This is unavoidable given our lack of knowledge of the genetics of complex traits. I have used two very different disease-penetrance models and a wide range of parameter values to give a sense of what the range of possibilities is. However, in the end the results of a model are only as good as the assumptions that go into it and it is possible that these assumptions are badly off.
In a sense, the additive and multiplicative models are at extreme ends of a continuum. A multiplicative model requires that each of the causative loci have a disease event for the disease to occur. The additive model requires only a single locus to have a disease event for disease to occur. As shown in this article, this difference results in very different genotype distributions. I have focused on the multiplicative model because the power to detect reactive expression-level differences is very poor under an additive model and the additive model does not appear consistent with a causative ELT. If an additive model is correct, then there appears to be little hope for mapping complex disease genes using eQTL. Fortunately, additive disease-penetrance models do not appear to be consistent with family history data in humans. RISCH (1990) showed that inheritance patterns do not change as the number of causative loci increases under an additive model. Thus, if the inheritance patterns for a disease indicate a multilocus trait, then it cannot be additive. Multiplicative models typically do fit family history data well (e.g., SCHLIEKELMAN and SLATKIN 2002). However, it is likely that a great many other models would fit equally well and there is no way to tell if the "true" model behaves more like an additive model, like a multiplicative model, or like neither model.
The values of M (the number of conditionally independent expression-level traits influenced by the TCL) assumed here are also highly speculative. Surveys of eQTL have found great variability in the number of gene expression-level traits mapping to individual loci. SCHADT et al. (2003) found several hotspots with hundreds of eQTL mapping to 4-cM regions. CHESLER et al. (2005) found a single locus modulating the expression of 1650 transcripts in addition to numerous loci modulating hundreds of transcripts. On the other hand, MONKS et al. (2004) did not find evidence of such hotspots. Their study was conducted with randomly chosen human families whereas studies that have found hotspots were conducted with inbred lines. One possible explanation for the difference is that there is little variation at hotspot loci in natural populations, but selection for divergent phenotypes (or perhaps chance) in inbred lines leads to genotypic differences between strains. Another possibility is that expression-level traits with multilocus inheritance in natural populations become monogenic between inbred lines and thus are easier to detect. In any event, an increase in M increases the power to detect at least one expression-level trait associated with the TCL and a causative locus with very high M would be detectable as differentially expressed with reasonable sample sizes even for c = 1 (see Figure 3). However, M is the effective number of independent (given TCL genotype) expression-level traits influenced by the TCL. The amount of correlation between expression-level traits in the above studies is unknown. Most likely they are not all independent and the effective number of independent expression-level traits may be substantially lower than the observed number.
Comparisons to standard methods for genome scans:
The required sample sizes shown here are large by the standards of typical microarray experiments, but not outrageous on the scale of human linkage studies. Power studies (e.g., RISCH and MERIKANGAS 1996; SLAGER et al. 2000) have shown that sample sizes of many thousands might be required to detect complex trait loci in genome scans. There are too many unknowns to say whether eQTL will provide a more effective method for mapping complex trait loci than do conventional mapping strategies. If there are causative ELTs for many complex traits, then the results here indicate that eQTL-based strategies