Discrepancies Between Recombination Frequencies and Physical Distances in Aspergillus nidulans: Implications for Gene Identification
Eduardo A. Espeso, Laura Cobeño, Herbert N. Arst, Jr.

Abstract

A rapid route to gene molecular identification involves using recombination frequencies in locating mutational sequence changes. We describe a case where the recombination frequency is deceptively low, probably reflecting centromere proximity. Recombination frequencies are greatly reduced near the centromeres on the right arms of chromosomes III and IV of Aspergillus nidulans.

WITH the advent of whole-genome sequencing, an attractive route for identifying the sequence of a gene for which a mutant allele exists involves mapping the mutation and sequencing plausible candidate open reading frames (ORFs) to locate the mutational sequence change. Sometimes, however, recombination frequencies can be deceptive, as in the case described here.

As part of a project to investigate nuclear localization of the PacC transcription factor of the ascomycete Aspergillus nidulans, we sought spontaneous mutations that suppress the molybdate hypersensitivity phenotype of a pacC allele whose translation product has a K159M mutational change inactivating the PacC nuclear localization signal (described by Fernández-Martinez et al. 2003). We selected for tolerance of 20 mm Na2MoO4 on supplemented minimal medium (Cove 1966). Among the mutations obtained, one, designated halA24, had no effect on PacC nuclear localization or on other aspects of the highly pleiotropic (see reviews by Peñalva and Arst 2002, 2004) mutant pacC allele phenotype apart from molybdate hypersensitivity. However, it has an otherwise interesting phenotype (to be described elsewhere). Using conventional parasexual and sexual mapping techniques (Clutterbuck 1974) facilitated by the molybdate resistance of halA24 pacC+ strains, halA24 was located between phenA and cbxA/carC on the right arm of chromosome III. Two preliminary crosses, with 99 and 100 progeny analyzed, respectively, gave recombination frequencies of 1 cM to phenA2 and 13 and 17 cM, respectively, to cbxA17. Using the A. nidulans genome sequence (http://www.broad.mit.edu/annotation/fungi/aspergillus) to identify likely ORFs, mutational sequence changes were determined for identifying definitively the phenA and cbxA/carC genes (Table 1). Following failure to find the halA24 sequence change in plausible candidate ORFs near phenA on the cbxA side, it was eventually located at >35% of the physical distance between phenA and cbxA (Figure 1). The gene identity has been firmly established and its characterization will be described in detail elsewhere. Its predicted product is very likely to be a Ser/Thr protein kinase (http://www.sanger.ac.uk/Software/Pfam; E-value 5.7e-78) and halA24 tandemly duplicates four residues within the predicted kinase domain (Table 1).

Figure 1.—

Physical maps of the centromere-proximal regions of the right arms of chromosomes III and IV. Contig numbering follows that of the Broad Institute (http://www.broad.mit.edu/annotation/fungi/aspergillus/). Distances between genes or to the end of the available genomic sequence are given. Coordinates following the gene name indicate the coding region with that of the first base of the initiation codon appearing first. Contigs 1.161 and 1.162 were joined by alignment with contig 1.198. To join contigs 1.162 and 1.163, genomic DNA was PCR amplified using oligonucleotides annealing at a distance of 170 nt from the end of 1.162 and the beginning of 1.163. Sequencing of the PCR fragment revealed database errors at the ends of both contigs (a consequence of shotgun cloning) and, after deletion of a total of 36 incorrect nt, a 34-nt addition was necessary to join the two contigs. The chromosome III contigs thus joined in total 429,190 bp, but only 429,190 bp of the 627,835 bp of contig 1.129 from chromosome IV are indicated.

View this table:
TABLE 1

Mutant sequence changes determined in this work

To better document the highly skewed relationship between recombination frequency and physical distance in the phenA-cbxA interval, 392 random progeny were analyzed from the cross biA1; phenA2 cbxA17 × halA24; inoB2; fwA1 (see Clutterbuck 1993 for gene symbols), giving 1.8 ± 0.7 cM between phenA2 and halA24 and 13.5 ± 1.7 cM between halA24 and cbxA17, a 7.5-fold difference. The physical distances are 67,765 and 124,414 bp, respectively (Table 1 and Figure 1), a 1.84-fold difference. Between phenA2 and halA24 1 cM corresponds to 37.6 kb and between halA24 and cbxA17 1 cM corresponds to 9.2 kb. To ensure that this difference is a genuine feature of relative recombination frequencies in these two intervals and not simply the consequence of a previously unrecognized chromosomal rearrangement(s), the two parental strains from this cross were first shown to be translocation-free using the parasexual cycle. Second, a combination of Southern blotting and PCR fragment size determination demonstrated the absence of any significant rearrangement in either parental strain in a region of 96,747 bp extending from 17,847 bp centromere-proximal to phenA2 to 11,134 bp centromere-distal to halA24 (data not shown). It is possible that the recombination frequency continues to rise beyond cbxA. Sealy-Lewis (1987) determined a map distance of 5.1 ± 2.2 cM between cbxA17 and suaB111. Using the physical data in Figure 1 and Table 1 to calculate 38,038 bp between these mutations, this would equate to 7.5 kb/cM. Beyond suaB, the recombination frequency might be even higher in view of the 12.5-kb distance between suaB and amdS coding regions (Figure 1) and the suaB111amdS1350 8.2 cM map distance (Sealy-Lewis 1987), which would result in a recombination frequency <2 kb/cM.

The most obvious explanation for this markedly nonlinear relationship between recombination frequency and physical distance is the proximity of the centromere. Reduced recombination in the vicinity of the centromere was reported first in Drosophila melanogaster (Dobzhansky 1930) and subsequently in other organisms, a particularly elegant demonstration being that of Lambie and Roeder (1986) in Saccharomyces cerevisiae. In A. nidulans a translocation shifting a region near the chromosome II centromere to a much more centromere-distal position on chromosome VIII increases the recombination frequency between two of the translocation-shifted genes by an order of magnitude (Arst et al. 1979). Arguably the most dramatic effect involves the silent mating-type loci of Schizosaccharomyces pombe where no recombination has been detected at a resolution of 0.001 cM in a 15-kb interval sharing homology with centromeric repeats (Egel et al. 1989; Grewal and Klar 1997 and references therein). All eight A. nidulans centromeres lie within gaps in the genomic sequence, but from the contiguous right arm sequence of chromosome III the minimum distance between phenA2 and the centromere is 53,065 bp.

A striking centromere effect on recombination frequency can also be seen on the right arm of chromosome IV, as previously noted by Aleksenko et al. (2001). With a gene order of centromere, hisA/hisHF, gdhB, and meaA and using mapping data from Arst et al. (1989) in conjunction with the genome sequence (see Figure 1) and mutational sequence changes (Table 1), the distances between hisA10 and gdhB1 are 1.2 ± 0.3 cM and 62,103 bp and those between gdhB1 and meaA8 are 5.9 ± 1.3 cM and 33,641 bp. Between hisA10 and gdhB1 there are 51.8 kb/cM, in good agreement with 49.3 kb/cM between hisA134 and gdhB1 reported by Aleksenko et al. (2001), and between gdhB1 and meaA8 1 cM corresponds to 5.7 kb. The higher rate of recombination between phenA2 and halA24 as compared to that between hisA10 and gdhB1 might be explained by the minimum distance of phenA2 to the centromere being approximately twice that of hisA10 to the centromere (53,065 bp vs. 26,205 bp). Caution is needed, however, because the actual distances to the centromere have not been determined. Figure 2 is a summary showing relationships between recombination frequencies and physical distances on the right arms of chromosomes III and IV.

Figure 2.—

Summary indicating meiotic map distances and relationships between recombination frequencies and physical distances.

In the above cases it is very likely that centromere proximity is responsible for greatly reduced recombination frequencies. However, when recombination frequencies are used in conjunction with genome sequences for gene identification, it should be remembered that other factors can also affect the relationship between recombination frequency and physical distance. For brevity we give a few examples from fungi but the factors involved affect recombination frequencies across a wide range of organisms. Such factors include environmental influences such as nutritional state (e.g., Abdullah and Borts 2001), genetic factors including both trans-acting genes (e.g., Catcheside 1981; Yeadon et al. 2004) and cis-acting recombination hotspots (e.g., Pryce et al. 2005; Steiner and Smith 2005), and the long-known effects of chromosomal rearrangements (reviewed by Käfer 1977 and Perkins 1997). These effects can be considerable. For example, effects of more than an order of magnitude have been observed for both trans- and cis-acting genetic factors (Catcheside 1981; Steiner and Smith 2005), and there is evidence that, in the wild, selection operates on recombination frequencies (e.g., Saleem et al. 2001). Judicious choice of strains and crossing conditions and, in extremis, molecular screening for chromosomal rearrangements can facilitate the use of recombination data for gene identification.

Acknowledgments

We are grateful to Heather Sealy-Lewis for suaB111 and to John Clutterbuck for a number of other markers; to Miguel Peñalva, Claudio Scazzocchio, Joan Tilburn, and Ken Haynes for much helpful advice; and to Elena Reoyo and Lily Stanton for technical assistance. We thank the Ministerio de Ciencia y Tecnologia, the Fondo de Investigaciones Sanitarias, and the Wellcome Trust for support through grants BMC2003-00874 to E.A.E., Redemeth G054 to Miguel Peñalva, and 067878 to H.N.A.

Footnotes

  • Sequence data from this article have been deposited with the EMBL/GenBank libraries under accession nos. DQ010319 (phenA), DQ010320 (halA), DQ010321 (cbxA), and DQ010322 (suaB).

  • Communicating editor: J. J. Loros

  • Received April 19, 2005.
  • Accepted July 7, 2005.

References

View Abstract