Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Rapid evolution of male-biased gene expression in Drosophila Colin D. Meiklejohn*†, John Parsch‡, José M. Ranz*, and Daniel L. Hartl* *Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138; and ‡Department of Biology II, Section of Evolutionary Biology, University of Munich, Luisenstrasse 14, 80333 Munich, Germany Edited by Eviatar Nevo, University of Haifa, Haifa, Israel, and approved June 17, 2003 (received for review February 5, 2003) A number of genes associated with sexual traits and reproduction evolve at the sequence level faster than the majority of genes coding for non-sex-related traits. Whole genome analyses allow this observation to be extended beyond the limited set of genes that have been studied thus far. We use cDNA microarrays to demonstrate that this pattern holds in Drosophila for the phenotype of gene expression as well, but in one sex only. Genes that are male-biased in their expression show more variation in relative expression levels between conspecific populations and two closely related species than do female-biased genes or genes with sexually monomorphic expression patterns. Additionally, elevated ratios of interspecific expression divergence to intraspecific expression variation among male-biased genes suggest that differences in rates of evolution may be due in part to natural selection. This finding has implications for our understanding of the importance of sexual dimorphism for speciation and rates of phenotypic evolution. microarray 兩 intraspecific variation 兩 interspecific variation 兩 cDNA A nisogamous reproduction is common in many animal and plant species and can produce a number of conflicts with important evolutionary consequences. For example, differential selection coefficients between the two sexes can lead to stable genetic polymorphisms or a decline in population mean fitness (1). It can also drive accelerated rates of phenotypic evolution, as many morphologies associated with sex and reproduction diverge more rapidly than other phenotypes (2). Molecular techniques that provide rapid and quantitative measures of genotypic and phenotypic variation have extended this pattern to include accelerated rates of evolution among proteins with sexual or reproductive functions (3, 4). Since then, most data supporting this observation have come from homologous nucleotide sequences of genes that are associated with sex or reproduction. In ciliates, green algae, diatoms, angiosperms, fungi, and at least four animal phyla, unusually high ratios of nonsynonymous to synonymous substitutions (dN兾dS) between species have been documented in sex-related genes (reviewed in ref. 5). Some of these genes also show high levels of intraspecific differentiation (5). In Drosophila, much of this work has focused on genes that are expressed in testes or accessory glands (e.g., refs. 6 and 7), although a high dN兾dS has also been observed for genes expressed in females and components of the sex determination pathway (8). Protein coding sequences provide a natural context for studying rates of evolution, as the effect of a given nucleotide substitution on the polypeptide is predictable, and comparison between neighboring synonymous and nonsynonymous sites controls for mutation rate. Because of the lack of an analogous context for regulatory sequences, the rates and patterns of evolution in regions of the genome controlling gene expression are less well understood. Thus, it is not known whether the rapid rates of evolution among genes associated with sex and reproduction holds for gene expression as well. Because a large proportion of important phenotypic evolution may be the result of changes in gene expression (9, 10), understanding rates and patterns of regulatory change within and between species is 9894 –9899 兩 PNAS 兩 August 19, 2003 兩 vol. 100 兩 no. 17 critical for a comprehensive picture of biological evolution. Given the pattern seen for amino acid sequences and morphologies, we would predict that genes associated with sex should be evolving faster at the level of gene regulation as well. Indeed, much of the divergence among proteins in the male reproductive tract of Drosophila may be attributable to large changes in protein levels, which is likely due in part to changes in gene expression (3). To test this prediction, we obtained gene expression data for ⬇1兾3 of the genome from adult males of eight strains of Drosophila melanogaster, and from adult males and females of one strain of D. melanogaster and one strain of Drosophila simulans. By analyzing intra- and interspecific expression differentiation within males and the sex-specificity of expression in both species, we show that gene expression in males evolves more rapidly than in females. Genes that are male-biased in their expression have on average more intra- and interspecific divergence in expression than genes with female-biased expression. Furthermore, comparison of intra- and interspecific differentiation suggests that at least some of the excess in divergence among male-biased genes (MBGs) is due to differential selective pressures acting on the expression of different sexbiased classes of genes. Materials and Methods Fly Strains and cDNA Preparation. Eight strains of D. melanogaster (three laboratory strains: Canton S, Oregon R, and Hikone R; an isofemale strain derived from St. Louis; and four lines derived from Zimbabwe: Zim53, Zim30, Zim29, and Zim2) were raised on standard medium at 25°C. Adult males were collected up to 24 h after eclosing, separated from females, and allowed to age an additional 3–4 days. Total RNA was extracted by using TRIzol reagent (Invitrogen) followed by chloroform extraction and isopropanol precipitation. Poly(A) RNA was purified by using the Oligotex Direct mRNA kit (Qiagen, Valencia, CA) and confirmed to be of high quality with a 2100 Bioanalyzer (Agilent Technologics, Palo Alto, CA). Two micrograms of poly(A) RNA was used as a template for SuperScript II reverse transcriptase (Invitrogen) in the presence of amino-allyl dUTP (Sigma). Cyanine-3 or cyanine-5 fluorochromes (Amersham Pharmacia) were incorporated after reverse transcription. Purification of cDNA and hybridizations were done following a published protocol (11). Labeled cDNAs were competitively hybridized to arrays by using the comparison scheme illustrated in Fig. 3, which is published as supporting information on the PNAS web site, www.pnas.org, with the following number of replicates per strain: Canton S, 13; Oregon R, 5; Hikone R, 3; St. Louis, 3; Zim53, 11; Zim30, 5; Zim29, 3; Zim2, 3. cDNA Microarrays. A total of 5,928 clones from the Drosophila Gene Collection version 1.0 (12) were amplified by PCR with universal primers, and the products were confirmed by gel This paper was submitted directly (Track II) to the PNAS office. Abbreviations: MBGs, male-biased genes; FBGs, female-biased genes; UBGs, unbiased genes; OBGs, ovary-biased genes. ‡To whom correspondence should be addressed. E-mail: cmeiklejohn@oeb.harvard.edu. www.pnas.org兾cgi兾doi兾10.1073兾pnas.1630690100 Table 1. Overrepresentation of MBGs among genes with polymorphic expression within D. melanogaster Significance level of polymorphism All genes P ⬍ 0.05 P ⬍ 0.01 MBGs FBGs UBGs G (2 df) 910 840 (92%) 655 (72%) 1,499 1,058 (71%) 513 (34%) 1,397 1,030 (74%) 575 (41%) 192 (P ⬍ 0.001) 351 (P ⬍ 0.001) Subsets of genes include those that exhibit at least one pairwise difference between any two strains at the significance level indicated. G, G test of independence. Statistical Analysis. Relative gene expression levels were deter- mined with a Bayesian method (Bayesian analysis of gene expression levels, BAGEL) (13) from the normalized ratio data (Supporting Materials and Methods, which is published as supporting information on the PNAS web site). This method estimates a normalized relative expression level for each strain and a single variance parameter across all strains from the Cy5兾Cy3 ratios, on a gene-by-gene basis. It also calculates credible intervals from the stationary distribution of the Markov chain used to obtain the posterior distribution of the parameters (eight mean expression levels and one variance). For all pairwise intraspecific comparisons, unless otherwise stated, the threshold chosen for statistical significance was P ⬍ 0.01. This threshold signifies that the relative expression value for a given strain was greater than (or less than) that of another strain in ⬎99% of the samples taken from the posterior distribution. Sex-biased expression was defined by analysis of a parallel set of experiments comparing gene expression in adult males and females from a lab strain of D. simulans and the D. melanogaster strain Canton S (14). The significance threshold for sex-biased expression was defined by nonoverlapping 95% credible intervals, which was determined to be equivalent to P ⬍ 0.00025 by using a randomization approach (Supporting Materials and Methods). We define MBGs and female-biased genes (FBGs) as those with significantly different expression between the sexes (in the same direction) in both D. melanogaster and D. simulans. Unbiased genes (UBGs) are defined as those clones that show no significant expression between the two sexes in either species. For a given gene, we describe intraspecific expression polymorphism (Se) by the coefficient of variation of the relative expression levels among all eight strains. Similarly, the kurtosis for a given gene’s expression (Ke) was calculated from the eight D. melanogaster expression values. Interspecific expression divergence (De) is described by the coefficient of variation between the mean of all eight D. melanogaster expression levels and the single D. simulans expression level (14) within a given sex. Differences in the distributions of these statistics are reported as the arithmetic mean of the statistic across all genes within a given sex-bias class (e.g., S៮ eM, S៮ eF, and S៮ eU for the mean expression polymorphism of MBGs, FBGs, and UBGs, respectively). Relative expression levels and the associated credible intervals for each gene were recoded into discrete expression states by assigning to different states all strains for which the 95% credible intervals were nonoverlapping. Strains whose 95% credible intervals overlapped with multiple strains in different states were assigned to all of those states. The number of different transcriptional states found among the eight strains for a given gene was then tabulated. Meiklejohn et al. Details regarding fluorescence ratio acquisition from microarray hybridizations, signal normalization and data quality control, assessment of false positive rates, and statistical analysis of previously published microarray data (15) are given in Supporting Materials and Methods. Results The comparison scheme used here to obtain transcription profiles from adult males of eight strains of D. melanogaster is shown in Fig. 3. The strains were chosen to represent the range of genotypic and phenotypic variation known to exist in this species. Populations from Africa show significant differentiation from non-African populations in nucleotide variation (17) and mating behavior (18). Of the 4,905 clones selected for analysis, 2,289 showed differences that were significant between at least one pair of strains, whereas 297 are expected by chance (see Supporting Materials and Methods), indicating that at least 40% of the genome is detectably differentially regulated between males from interfertile populations of D. melanogaster. Pairs of strains showed from 218 to 928 genes with significantly different expression, where on average only 26 are expected by chance (Table 6, which is published as supporting information on the PNAS web site). This degree of differentiation in expression profile between strains is much higher than has been previously reported for Drosophila (16), and may reflect differing experimental designs and statistical methods as well as the inclusion in this study of the Zimbabwe strains. However, this level of variation is similar to the proportion of differentially expressed genes detected between two strains of Saccharomyces cerevisiae (19). Although the Zimbabwe strains do show evidence of differentiation from the Cosmopolitan strains in global gene expression (unpublished results), there are a surprisingly small number of genes that show fixed differences between the two groups (Table 7, which is published as supporting information on the PNAS web site). The intraspecific comparisons were combined with information on interspecific divergence in gene expression between D. melanogaster and its sibling species, D. simulans (14). The combined data set consisted of 4,759 clones common to both experiments. As expected, the degree of differentiation among strains is smaller than the range of variation in transcription profiles seen between species. The variance in the distribution of log2 ratios across all genes between males of D. melanogaster and D. simulans is 0.208. Within D. melanogaster, the most divergent pair of strains has a variance in log2 ratios of 0.159, which is significantly lower than the interspecific comparison (F4758,4758 ⫽ 1.30, P ⬍ 0.001). Based on the distributions of log2 ratios, intraspecific differentiation ranges from 23% to 77% of interspecific divergence across the elements on these arrays. There is a strong effect of sex-biased expression on intraspecific variation in gene expression, and this effect is reversed between MBGs and FBGs. Among genes that show significantly different expression between at least one pair of strains, there is a significant overrepresentation of MBGs and an underrepresentation of FBGs (Table 1). The strength of this effect increases as the stringency of the threshold chosen for statistical signifiPNAS 兩 August 19, 2003 兩 vol. 100 兩 no. 17 兩 9895 EVOLUTION electrophoresis. Added to these was a set of 177 separately amplified controls, each of which was replicated from 1 to 16 times on the array. The PCR products were purified and mechanically spotted onto polylysine-coated glass slides (11). The results from these hybridizations have been deposited to the Gene Expression Omnibus (40) under accession nos. GPL356 and GSM7863–GSM7885. Table 2. Amounts and distribution of expression polymorphism is influenced by sex-biased expression S៮ e K៮ e MBGs UBGs FBGs P(M vs U) P(U vs F) 0.158 ⫺0.086 0.140 0.327 0.108 0.229 ⬍0.001 ⬍0.001 ⬍0.001 0.121 P values were calculated from Wilcoxon rank-sum test. Table 3. Interspecific divergence is accelerated among MBGs when expressed in males Expression in males Expression in females P(expressing sex) ៮ eM D ៮ eU D ៮ eF D P(M vs U) P(U vs F) 0.164 0.132 ⬍0.001 0.129 0.125 0.035 0.120 0.121 0.979 ⬍0.001 0.150 0.055 0.727 P values were calculated from a Wilcoxon rank-sum test. cance is increased. By comparison with genes whose expression is not sex biased, the effect of sex bias on intraspecific transcriptional variation can be shown to be the result of both a reduction in variation among FBGs and an increase among MBGs; S៮ eM and S៮ eF are both significantly different from S៮ eU (Table 2). Not only do MBGs on average have higher levels of expression polymorphism than FBGs or UBGs, but this variation is distributed differently among the eight strains than it is for the other ៮ e), for the expression levels of two classes. The mean kurtosis (K ៮e MBGs among the eight strains is significantly different from K ៮ eM is more platykurtic, and the two other for FBGs or UBGs; K classes tend toward leptokurtosis (Table 2). A similar result is observed by assigning expression levels to discrete states, analogous to transcriptional alleles. The distributions of number of states for sex-biased and unbiased genes are shown in Fig. 1. MBGs have a much greater proportion of genes with two or more expression alleles than do the FBG or UBG classes, in which the majority of genes have a single expression state (male-biased vs. female-biased: G ⫽ 388, df ⫽ 3, P ⬍ 0.001). The distribution of expression states among FBGs is also significantly different from that of UBGs (G ⫽ 52.8, df ⫽ 3, P ⬍ 0.001). This result is not due to differences in ability to discriminate between states among the sex-bias classes. The average 95% credible interval across all eight strains is very similar for MBGs and FBGs (0.432 and 0.452, respectively), but is significantly higher in the UBGs (0.536; female-biased vs. unbiased: ts ⫽ ⫺7.68, df ⫽ 2481, P ⬍ 0.001). If this were responsible for the differences in transcription state distributions, it would result in an excess of monomorphic genes in the UBG class relative to the FBG class, but in fact the opposite result is observed (Fig. 1). The interspecific hybridizations (14) allow a parallel set of observations to be made regarding the influence of sex-biased expression on interspecific divergence in gene expression. The ៮ e is signifiresults are consistent with the intraspecific data; D cantly greater among MBGs than UBGs or FBGs (Table 3). Fig. 1. Frequency distributions of gene expression states for male-enriched, female-enriched, and non-sex-biased genes. Relative gene expression levels were coded into discrete expression states as described in Materials and Methods. 9896 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.1630690100 These experiments also provide information on expression divergence for gene expression in females, as well as in males. Although the same pattern of increased divergence among MBGs relative to UBGs and FBGs exists for these genes when they are expressed in females, it is not nearly as strong, nor is it statistically significant (Table 3). Interestingly, MBGs are significantly more divergent when they are expressed in males than when they are expressed in females. Although not significant, this pattern holds for UBGs, suggesting that gene expression in males in general, and not just expression of MBGs, may be rapidly evolving. Given DNA sequence data, deviations from the neutral expectation in the ratio of divergence to polymorphism have been used to infer the past activity of natural selection at a locus (20, 21). Although gene expression changes do not have a simple relationship with nucleotide sequence changes, a positive relationship between intra- and interspecific variation is predicted for neutrally evolving polygenic characters (22, 23), and has been empirically demonstrated for morphological traits that are presumably under selection as well (24). Elevated ratios of inter- to intraspecific variation in phenotypes associated with male reproduction have been used to infer the importance of directional selection on these characters relative to other types of morphologies (25). Thus, although we may not know a priori the neutral ratio of divergence to polymorphism for a given gene’s expression, differences in this ratio between groups of genes may indicate disparate selective pressures acting on these groups. Fig. 2 shows De plotted against Se for the three classes of sex bias. All three classes show a weak positive correlation between Se and De, and the correlation coefficient is significantly different between all three classes, indicating a different relationship of covariation between Se and De for the three classes of sex bias. The majority of sex-biased regulation in Drosophila has previously been shown to be the result of expression in germ-line tissues (15, 26); thus, transcription in testes and ovaries is most likely responsible for much of the sex-biased expression observed here, and in large part, it is genes expressed in the testes that are evolving rapidly and genes expressed in the ovaries that are evolving slowly. To confirm this conjecture, published data directly comparing D. melanogaster expression profiles of males and females, dissected testes and ovaries, and gonadectomized males and females (15) were analyzed and integrated with the results presented here. In addition to providing an independent identification of MBGs and FBGs, these data allow the description of analogous classes of genes defined by significantly different expression between testes and ovaries and between the somatic tissues of males and females. The patterns of rapid expression evolution seen among MBGs relative to FBGs described above are also seen in the whole fly experiments of Parisi et al. (15). MBGs are overrepresented relative to FBGs among genes with polymorphic expression when the experiments of Parisi et al. (15) are used to determine sex-biased expression (data not shown). This same pattern is also found among genes with testis or ovary-biased expression (Table 4). Interestingly, this discrepancy is not observed for sex-biased genes when assayed in gonadectomized adults, because somatically MBGs and somatically FBGs show virtually identical Meiklejohn et al. Fig. 2. Se and De for male-biased, female-biased, and unbiased genes. The product moment correlation coefficients for each class are all significantly different from zero (MBG, r ⫽ 0.273, P ⬍ 0.001; FBG, r ⫽ 0.064, P ⫽ 0.013; UBG, r ⫽ 0.162, P ⬍ 0.001) and from each other (MBG vs. UBG, z ⫽ 2.717, P ⫽ 0.007; UBG vs. FBG, z ⫽ 2.674, P ⫽ 0.008). Both Se and De were log-transformed before graphing. representation among genes with polymorphic expression (Table 4). Furthermore, both of these classes appear to be overrepresented relative to genes with no somatic sex-biased expression, suggesting that sex-biased expression may be evolving relatively rapidly in somatic tissues as well as the gonads, but in both sexes. Table 4. Overrepresentation of testis-biased genes and somatically sex-biased genes among genes with polymorphic expression within D. melanogaster Significance level of polymorphism All genes P ⬍ 0.05 (%) P ⬍ 0.01 (%) TBGs OBGs sMBGs sFBGs 579 543 (94)* 429 (74)* 787 579 (74)* 321 (41)* 55 49 (89) 38 (69) 73 61 (84) 49 (67) Significant departures from independence were determined by a G test with 1 df. Both sMBGs and sFBGs are overrepresented among polymorphic genes relative to genes with no somatic sex bias, of which 80% are polymorphic at P ⬍ 0.05 and 47% are polymorphic at P ⬍ 0.01. These results do not change if a different cutoff is chosen to assign significance to gonad or somatic sex-biased expression. TBGs, testis-biased genes; sMBGs, somatically MBGs; sFBGs, somatically FBGs. *, P ⬍ 0.001. Meiklejohn et al. Discussion The data presented here indicate that rates of both intraspecific and interspecific differentiation of gene expression in Drosophila are correlated with sex-biased expression, and that this difference is largely a function of gene expression in testes and ovaries. Furthermore, among somatically expressed genes, sex-biased expression in both sexes appears to evolve more rapidly than sexually monomorphic expression. Analogous results have come from morphological studies in Drosophila that documented a higher rate of intra- and interspecific divergence among morphologies associated with male reproduction than nonreproductive morphologies (25). These conclusions are not the result of nucleotide sequence divergence within or between species causing spurious inferences of changes in gene expression. Data from competitive hybridizations using genomic DNA extracted from D. melanogaster and D. simulans indicate that sequence divergence between these two species has a small effect on hybridization signal intensity that is within the range of experimental error associated with cDNA hybridizations (14). Furthermore, across the clones on these arrays, a greater number of MBGs are found in D. simulans than in D. melanogaster (14), which cannot be the result of sequence divergence. Consistent with our results on rates of expression evolution, a subset of male germ-line genes in Drosophila are known to be enriched for sequences with no detectable homologs in other eukaryotic genomes (26), suggesting that MBGs may be on the whole younger than other classes of genes. The accelerated rate of evolution among MBGs may therefore extend further back in time than the comparison between D. melanogaster and D. simulans, and is consistent with the hypothesis that evolution of male-specific phenotypes may be often driven by the creation of new genes (e.g., refs. 27 and 28). One interpretation of these results is that mutations affecting the expression of MBGs on average experience greater (i.e., PNAS 兩 August 19, 2003 兩 vol. 100 兩 no. 17 兩 9897 EVOLUTION ៮ e show similar patterns for ៮ e, and D The summary statistics S៮ e, K gene expression variation in the gonads and soma as was described above for whole fly extractions (Table 5). Although greater intra- and interspecific variation is seen among genes with no sex-biased expression in the gonads compared with ovary-based genes (OBGs), this difference is not as significant as the difference between UBGs and FBGs seen from whole body extractions (compare Tables 2, 3, and 5). Among genes with ៮ e among somatic sex-biased expression, there is a greater S៮ e and D both male-biased and female-biased genes than among genes with no somatic sex bias (Table 5). These patterns are also observed in the distributions of transcription states for testisbased genes, OBGs, somatically MBGs, and somatically FBGs (not shown). A large proportion of the transcriptional differences observed between D. melanogaster and D. simulans involves the loss, gain, or reversal of sex-biased expression (14). Examination of intraand interspecific expression variation in these genes does not reveal as clear a pattern as that observed for genes retaining an ancestral sex bias in both D. melanogaster and D. simulans. This is most likely due to a diversity of selective forces acting on genes with rapidly evolving sex bias. However, such genes do appear to be more variable in their expression within D. melanogaster than those that have retained the ancestral sex bias, as shown by a greater S៮ e among genes that are sex biased in one species only than among those that are sex biased in both species (malebiased in one species only, S៮ e ⫽ 0.180, P ⬎ 0.05, Wilcoxon rank-sum test; female-biased in one species only, S៮ e ⫽ 0.122, P ⬍ 0.001, Wilcoxon rank-sum test). However, among genes with a novel sex bias, there is still a correlation between Se and sex bias, as genes that are male-biased in D. melanogaster or D. simulans only have a S៮ e that is significantly greater than genes that are female-biased in one species only (P ⬍ 0.001, Wilcoxon rank-sum test). Table 5. Expression polymorphism and divergence as a function of sex-biased expression in the gonads and soma S៮ e K៮ e ៮e D TBGs UBGs OBGs P(T vs U) P(U vs O) sMBGs sUBGs sFBGs P(M vs U) P(M vs F) 0.162 ⫺0.229 0.180 0.124 0.248 0.134 0.119 0.191 0.125 ⬍0.001 ⬍0.001 ⬍0.001 0.175 0.486 0.225 0.202 ⫺0.090 0.207 0.131 0.182 0.135 0.221 0.174 0.224 ⬍0.001 0.193 ⬍0.001 0.280 0.115 0.536 P values were calculated from a Wilcoxon rank-sum test. either more positive or less negative) selection coefficients than UBGs or FBGs. A larger positive average selection coefficient would result in the fixation of a greater number of beneficial mutations affecting gene expression of MBGs, whereas less negative selection coefficients would result in a smaller fraction of deleterious mutations contributing to intraspecific variation affecting the expression of MBGs. Both of these hypotheses are suggested in the greater correlation between Se and De seen among MBGs (Fig. 2). Differences in this relationship between the sex-biased classes are largely due to an elevated De兾Se ratio among the most extremely male-biased genes. The 324 genes with the most significantly testis-biased expression (P ⬍ 0.001) have a higher average De兾Se than the 290 corresponding OBGs (Wilcoxon rank sum test, P ⫽ 0.04). Circumstantial evidence for the role of positive selection in the differentiation among MBGs is seen in the patterns of divergence between D. melanogaster and D. simulans (Table 3). If relaxed selection is driving the divergence of MBGs, the fact that gene expression among MBGs is less divergent in females than it is in males requires that the neutral mutations that fix and cause changes in expression have their effects in males only, because their regulation is more conserved in females. An alternate interpretation of these data are that there is a fundamentally different relationship between fold-change in expression and effect on fitness between the sex-biased classes of genes. Because gene regulation in male gametogenesis appears to be highly specialized in both mammals and insects (29), we might expect the evolution of gene expression in testes to be unusual. However, this explanation does not address the differences observed here between FBGs and UBGs (i.e., Tables 1 and 2). It is important to remember that the intraspecific comparisons presented here include gene expression data from males only. However, preliminary results indicate that the relationship between sex bias and rates of intraspecific expression evolution seen here for gene expression in males holds for expression in females as well. Assessment of expression profiles in virgin females of Canton S and Zim2 using the same methods described above reveals a significant overrepresentation of MBGs among those genes with significant differences between these two strains. Furthermore, in these females S៮ eM ⬎ S៮ eU ⬎ S៮ eF, and these differences are highly significant (J.M.R., unpublished data). This finding is in agreement with the results from the interspecific comparisons, which show that FBGs show similar or reduced amounts of interspecific differentiation than MBGs when these genes are expressed in females (Table 3). The literature documenting elevated rates of evolution among genes associated with reproduction has included numerous examples of genes with functions in both males and females (4, 5). This is in contrast to the results presented here, which show that the expression of FBGs evolves more slowly than that of both MBGs and UBGs, although this pattern is not as extreme as the accelerated evolution observed for malebiased expression. Some of the studies that have found rapidly evolving genes associated with female reproduction have focused on a small sample for which there was an a priori 9898 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.1630690100 expectation of positive selection (e.g., ref. 30, but see ref. 4). The low correlation between Se and De for all three sex-bias classes shows that the rate of expression evolution for any given gene is likely to be idiosyncratic, and we observe a number of FBGs with high levels of intra- and interspecific expression variation. Nonetheless, assaying ⬇1兾3 of the Drosophila genome indicates that rapid evolution of expression is far more prevalent among MBGs than FBGs, and indicates the value of data sets of this size. One potential source of error in the above analyses is the assumption of independence across genes in their intra- and interspecific variation in expression. This assumption will be violated when a single genetic locus inf luences variation in the expression of multiple genes simultaneously, as will result from coordinate regulation. In the most extreme scenario, all genes expressed in the testes might appear to be up-regulated in a given strain of D. melanogaster because of an increase in the relative size of the testes in that strain. Such an extreme bias can be ruled out by the large number of genes that show all possible patterns of covariation among these strains, indicating that at a broad scale there are many groups of independently regulated genes. Further argument to this point can be made by reference to the few experiments to date that have examined the genetics of gene expression. Studies in S. cerevisiae (19), mice, and maize (31) indicate that 35– 80% of QTLs that inf luence expression of a gene map to the gene itself, suggesting cis-regulation. The fraction of cis-acting genetic factors increases with more stringent statistical cutoffs (31), suggesting that large changes in expression may more often be in cis, whereas trans-acting mutations are more often of small effect. Although these numbers are inf luenced by the power of each experimental design, these data from three different biological kingdoms suggest that a large fraction (⬎30%) of large effect variants affecting gene expression are in cis, and that this may be a phenomenon intrinsic to eukaryotic gene expression. This does not address how the remaining fraction of variation in gene expression is distributed across many unlinked factors of small effect. Nonetheless, it is unlikely that these caveats could affect the nature of our conclusions, or render them statistically insignificant. For example, all of the comparisons made in Tables 1 and 4 remain significant at P ⬍ 0.01, when the numbers of genes are multiplied by a factor of 0.1 (as might be appropriate if, on average, a single genetic variant were responsible for changes in the expression of 10 downstream genes). However, this issue will require population genetic studies of the inheritance of global gene expression to be conclusively addressed. It is tempting to speculate that MBG expression may be related to the evolution of hybrid male sterility (32). The genetic factors that inf luence male fertility appear to evolve much faster at both intra- and interspecific levels than those inf luencing female fertility or viability in either sex in Drosophila. This rapidity is evident in the disproportionately high amounts of genetic variation affecting male fertility observed in mutation–accumulation lines (33) and the excess of hybrid male sterility factors relative to hybrid female sterility factors that have accumulated between closely related species (34 – Meiklejohn et al. 36). None of these patterns is caused by an excess of loci affecting male fertility, because mutagenesis screens indicate that approximately seven times more genes inf luence viability than male fertility, whereas similar numbers affect fertility in the two sexes (37). A causal relationship between these observations would require that the rapid evolution of gene expression in MBGs leads to the misexpression of these genes in sterile hybrid males. A recent study of gene expression in hybrids between Drosophila mauritiana and D. simulans found that MBGs were preferentially misexpressed in the sterile F1 males (38), lending support to this hypothesis. Together, these patterns of gene expression and misexpression are consistent We thank the members of the Harvard Drosophila Microarray Consortium and the Bauer Center for Genomics Research for help in creating the arrays; the Hartl and Wakeley laboratories, A. M. Wilczek, and especially J. P. Townsend for helpful discussions; and Rama Singh, Mohamed Noor, and an anonymous reviewer for valuable comments on the manuscript. This work was supported by National Institutes of Health Grant GM60035 (to D.L.H.) and a postdoctoral fellowship from the Ministerio de Ciencia y Tecnología (to J.M.R.). 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37. 38. 39. 40. Hudson, R. R., Kreitman, M. & Aguadé, M. (1987) Genetics 116, 153–159. McDonald, J. H. & Kreitman, M. (1991) Nature 351, 652–654. Lande, R. (1976) Evolution (Lawrence, Kans.) 30, 314–334. Lynch, M. & Hill, W. G. (1986) Evolution (Lawrence, Kans.) 40, 915–935. Kluge, A. G. & Kerfoote, W. C. (1973) Am. Nat. 107, 426–442. Civetta, A. & Singh, R. S. (1998) Evolution (Lawrence, Kans.) 52, 1080–1092. Arbeitman, M. N., Furlong, E. E. M., Imam, F., Johnson, E., Null, B. H., Baker, B. S., Krasnow, M. A., Scott, M. P., Davis, R. W. & White, K. P. (2002) Science 297, 2270–2275. Nurminsky, D. I., Nurminskaya, M. V., De Aguiar, D. & Hartl, D. L. (1998) Nature 396, 572–575. Long, M. & Langley, C. H. (1993) Science 260, 91–95. Kleene, K. C. (2001) Mech. Dev. 106, 3–23. Swanson, W. J., Yang, Z., Wolfner, M. F. & Aquadro, C. F. (2001) Proc. Natl. Acad. Sci. USA 98, 2509–2514. Schadt, E. E., Monks, S. A., Drake, T. A., Lusis, A. J., Che, N., Colinayo, V., Ruff, T. G., Milligan, S. B., Lamb, J. R., Cavet, G., et al. (2003) Nature 422, 297–302. Laurie, C. C. (1997) Genetics 147, 937–951. Fry, J. D., Heinsohn, S. L. & Mackay, T. F. C. (1998) Genetics 148, 1171–1188. True, J. R., Weir, B. S. & Laurie, C. C. (1996) Genetics 142, 819–837. Wu, C.-I. & Davis, A. W. (1993) Am. Nat. 142, 187–212. Tao, Y. & Hartl, D. L. (2003) Evolution (Lawerence, Kans.), in press. Lindsley, D. L. & Tokuyasu, K. T. (1980) in The Genetics and Biology of Drosophila, eds. Ashburner, M. & Wright, T. R. F. (Academic, New York), Vol. 2d, pp. 226–294. Michalak, P. & Noor, M. A. F. (2003) Mol. Biol. Evol. 20, 1070–1076. Haldane, J. B. S. (1922) J. Genet. 12, 101–109. Edgar, R., Domrachev, M. & Lash, A. E. (2002) Nucleic Acids Res. 30, 207–210. EVOLUTION 1. Ewens, W. J. (1979) Mathematical Population Genetics (Springer, Berlin). 2. Darwin, C. (1871) The Descent of Man, and Selection in Relation to Sex (John Murray, London). 3. Coulthart, M. B. & Singh, R. S. (1988) Mol. Biol. Evol. 5, 182–191. 4. Civetta, A. & Singh, R. S. (1995) J. Mol. Evol. 41, 1085–1095. 5. Swanson, W. J. & Vacquier, V. D. (2002) Annu. Rev. Ecol. Syst. 33, 161–179. 6. Swanson, W. J., Clark, A. G., Waldrip-Dail, H. M., Wolfner, M. F. & Aquadro, C. F. (2001) Proc. Natl. Acad. Sci. USA 98, 7375–7379. 7. Parsch, J., Meiklejohn, C. D., Hauschteck-Jungen, E., Hunziker, P. & Hartl, D. L. (2001) Mol. Biol. Evol.18, 801–811. 8. Civetta, A. & Singh, R. S. (1998) Mol. Biol. Evol. 15, 901–909. 9. Britten, R. J. & Davidson, E. H. (1969) Science 165, 349–357. 10. Carroll, S. B., Grenier, J. K. & Weatherbee, S. D. (2001) From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design (Blackwell Science, Malden, MA). 11. Eisen, M. B. & Brown, P. O. (1999) Methods Enzymol. 303, 179–205. 12. Rubin, G. M., Hong, L., Brokstein, P., Evans-Holm, M., Frise, E., Stapleton, M. & Harvey, D. A. (2000) Science 287, 2222–2224. 13. Townsend, J. P. & Hartl, D. L. (2002) Genome Biol. 3, RESEARCH0071.1– 0071.16. 14. Ranz, J. M., Castillo-Davis, C. I., Meiklejohn, C. D. & Hartl, D. L. (2003) Science 300, 1742–1745. 15. Parisi, M., Nuttall, R., Naiman, D., Bouffard, G. G., Malley, J., Andrews, J., Eastman, S. & Oliver, B. (2003) Science 299, 697–700. 16. Jin, W., Riley, R. M., Wolfinger, R. D., White, K. P., Passador-Gurgel, G. & Gibson, G. (2001) Nat. Genet. 29, 389–395. 17. Begun, D. J. & Aquadro, C. F. (1993) Nature 365, 548–550. 18. Hollocher, H., Ting, C.-T., Pollack, F. & Wu, C.-I. (1997) Evolution (Lawrence, Kans.) 51, 1175–1181. 19. Brem, R. B., Yvert, G., Clinton, R. & Kruglyak, L. (2002) Science 296, 752–755. with the idea that rapid evolution of male reproductive characters contributes to Haldane’s rule (39) for hybrid male sterility in Drosophila. Meiklejohn et al. PNAS 兩 August 19, 2003 兩 vol. 100 兩 no. 17 兩 9899