- Split View
-
Views
-
Cite
Cite
Sarah Thibault-Sennett, Qi Yu, Fatima Smagulova, Jeff Cloutier, Kevin Brick, R Daniel Camerini-Otero, Galina V Petukhova, Interrogating the Functions of PRDM9 Domains in Meiosis, Genetics, Volume 209, Issue 2, 1 June 2018, Pages 475–487, https://doi.org/10.1534/genetics.118.300565
- Share Icon Share
Abstract
Homologous recombination is required for proper segregation of homologous chromosomes during meiosis. It occurs predominantly at recombination hotspots that are defined by the DNA binding specificity of the PRDM9 protein. PRDM9 contains three conserved domains typically involved in regulation of transcription; yet, the role of PRDM9 in gene expression control is not clear. Here, we analyze the germline transcriptome of Prdm9−/− male mice in comparison to Prdm9+/+ males and find no apparent differences in the mRNA and miRNA profiles. We further explore the role of PRDM9 in meiosis by analyzing the effect of the KRAB, SSXRD, and post-SET zinc finger deletions in a cell culture expression system and the KRAB domain deletion in mice. We found that although the post-SET zinc finger and the KRAB domains are not essential for the methyltransferase activity of PRDM9 in cell culture, the KRAB domain mutant mice show only residual PRDM9 methyltransferase activity and undergo meiotic arrest. In aggregate, our data indicate that domains typically involved in regulation of gene expression do not serve that role in PRDM9, but are likely involved in setting the proper chromatin environment for initiation and completion of homologous recombination.
HOMOLOGOUS recombination is required for the faithful segregation of homologous chromosomes during meiosis. It assures that homologous chromosomes find each other and stay connected until the first meiotic division, when they segregate to different daughter cells. Failure to segregate properly results in aneuploidy in gametes and offspring, which accounts for 35% of miscarriages and developmental disabilities in 0.3% of live births (Hassold et al. 2007). The most severe recombination defects lead to meiotic arrest and consequent infertility (reviewed in Handel and Schimenti 2010).
During meiotic recombination, the chromosomes undergo programmed DNA double-stranded breaks (DSBs) (reviewed in Keeney and Neale 2006). This triggers a genome-wide search for the intact homologous chromosome to be used for repair. RAD51 and DMC1 recombinases bind to single-stranded tails of the resected DSB ends and initiate recombinational repair of DSBs that results in either crossover or noncrossover products (Keeney and Neale 2006). Meiotic DSBs preferentially occur in discrete sites of the genome called recombination hotspots (reviewed in Arnheim et al. 2007; Buard and de Massy 2007; Lichten 2008; Clark et al. 2010; Paigen and Petkov 2010). In mice and humans, hotspot locations are defined by the DNA binding specificity of the PR/SET domain containing nine (PRDM9) protein that trimethylates Lysine 4 of the histone H3 (H3K4me3) at the sites where recombination may subsequently occur (Baudat et al. 2010; Myers et al. 2010; Parvanov et al. 2010; Grey et al. 2011; Brick et al. 2012). The DNA binding domain of PRDM9 consists of an array of zinc finger (ZF) domains that vary primarily at three amino acids that define the sequence specificity of DNA binding (reviewed in Wolfe et al. 2000; Persikov et al. 2009). Remarkably, the ZF array is extremely polymorphic and may consist of different ZFs in different individuals, leading to a completely different distribution of recombination hotspots. When PRDM9 is absent, DSBs are formed at so-called default hotspots, the other available H3K4me3 sites within the genome, mostly at gene promoters and enhancer elements (Brick et al. 2012). These default sites are almost never used in wild-type mice, suggesting a dominant direct interaction between PRDM9 and the DSB formation machinery. Indeed, recent studies implicated the KRAB domain of PRDM9 in tethering the hotspot DNA to the chromosome cores, where the DSB machinery resides, through an interaction with CXXC1 (Imai et al. 2017; Parvanov et al. 2017). Prdm9−/− mice undergo meiotic arrest due to recombination failure, leading to sterility in both sexes (Hayashi et al. 2005). Interestingly, the loss of Prdm9 does not universally result in sterility as Prdm9 knockouts in some species remain fertile (Narasimhan et al. 2016), and the functional Prdm9 gene has been lost in several lineages (Ponting 2011; Baker et al. 2017).
PRDM9 belongs to the PRDM protein family, a group that is defined by the presence of a ZF array and the PR/SET domain (reviewed in Hohenauer and Moore 2012). The PR/SET domain of PRDM9 is responsible for histone methyltransferase activity, including mono-, di- and trimethylation of the H3K4 (Hayashi et al. 2005; Wu et al. 2013), as well as H3K36 trimethylation (Eram et al. 2014; Powers et al. 2016). This domain is distantly related to the classic SET domain found in the histone lysine methyltransferases, a protein family involved in regulation of gene expression. The crystal structure of the PRDM9 PR/SET domain suggests that the methyltransferase activity of PRDM9 is autoregulated and the lone ZF located in the post-SET region is involved in this regulation (Wu et al. 2013).
PRDM family members either methylate their histone targets directly or recruit other histone modifiers to chromatin (Hohenauer and Moore 2012). Owing to these activities, the PRDM family is involved in a diverse array of developmental processes through the regulation of gene expression (Hohenauer and Moore 2012). PRDM9, however, is unique in that it also has the Kruppel Associated Box (KRAB) domain (Liu et al. 2014). This leads to inclusion of PRDM9 in a second family of transcription factors, the KRAB-ZFPs (KRAB Zinc Finger Proteins). Members of this family also contain a ZF DNA-binding array, but have a KRAB domain instead of PR/SET (Lupo et al. 2013).
The KRAB domain is typically involved in gene silencing through interaction with the TRIM28 protein, also known as KAP-1 (reviewed in Iyengar and Farnham 2011). TRIM28 interacts with Heterochromatin Protein 1 (HP-1) and serves as a scaffold for recruiting chromatin modifying enzymes and remodeling complexes to promoters of target genes bound by the KRAB-ZFPs (Iyengar and Farnham 2011). Interestingly, the KRAB domain of PRDM9 is missing the amino acids required for interaction with TRIM28 (Liu et al. 2014), and recent studies found no direct interaction between these two proteins (Patel et al. 2016; Imai et al. 2017). In fact, the KRAB domain of PRDM9 has higher similarity to the atypical KRAB domain found in the Synovial Sarcoma X chromosome breakpoint (SSX) protein family (Lim et al. 1998). PRDM9 and the SSX family members have another similarity: they both contain the SSX Repressive Domain (SSXRD) following the atypical KRAB domain. Although the SSXRD was shown to be a potent transcription repressor (Lim et al. 1998) the molecular mechanism behind this function has not been described.
Considering that PRDM9 has three domains implicated in regulation of transcription, it was originally thought of as a transcription factor (Hayashi et al. 2005; Matsui and Hayashi 2007; Mihola et al. 2009; reviewed in Nowick et al. 2013; Capilla et al. 2016). This role was attractive, because Prdm9 is the only known speciation gene in mammals, and the conflict between regulatory mechanisms is perceived as the most likely cause of hybrid incompatibilities (Nowick et al. 2013; Mack and Nachman 2017). Although later studies suggested a different role of Prdm9 in speciation (Davies et al. 2016; Smagulova et al. 2016) they did not rule out the possibility of additional mechanisms that may involve misregulation of transcription. Furthermore, it was recently reported that, during meiosis, PRDM9 binds to some active promoters (Grey et al. 2017), bringing back the question of the possible regulatory function of PRDM9 in gene expression. In this study we evaluate the potential role of PRDM9 in transcription and the role of transcription-related domains within PRDM9 in PRDM9 function.
We first used mRNA sequencing to demonstrate that inactivation of Prdm9 does not lead to changes in gene expression. We then showed that PRDM9 does not appear to regulate miRNA expression as well. We next assessed the effect of the KRAB domain, SSXRD, and post-SET ZF deletions on methyltransferase activity of PRDM9 expressed in cell culture. Although deletion of SSXRD completely abolished histone methylation by PRDM9, the KRAB and post-SET ZF mutants retained methyltransferase activity. To assess whether the KRAB domain is essential for maintaining the meiotic program, we generated the KRAB domain deletion mouse strain and found that mutant mice undergo meiotic arrest resulting in sterility. We found only a trace amount of H3K4me3 at the sites of expected hotspots. However, DSB formation was evident at ∼30% of PRDM9-dependent H3K4me3 sites in the KRAB mutant, raising the possibility that deletion of the KRAB domain does not completely abolish communication of PRDM9 with the DSB formation machinery.
Materials and Methods
Mice and cell lines
Cell-culture-based PRDM9 mutant analyses were performed using the GC-1 spg immortalized BALB/c cell line derived from spermatogonial cells (ATCC CRL-2053; Hofmann et al. 1992). All animal procedures have been approved by the Uniformed Services University of Health Sciences (USUHS) Animal Care and Use Committee. Hop2 knockout mice have been previously described (Petukhova et al. 2003). Prdm9 knockout mice have been previously described (Hayashi et al. 2005) and were obtained from the Jackson Laboratory (stock number 010719). Spo11 knockout mice (strain deltaSpo.BC/B6) have been described in Smagulova et al. (2013). C57Bl/6 was used as a wild-type strain and all mutant strains were on the C57Bl/6 background. Adult (2–6 month old) mice were used for the analyses unless specifically mentioned in the text. Prdm9K mice were generated on a C57Bl/6 background as outlined in Supplemental Material, Figure S2 and Table S4. Targeting design and pronuclear injections were done by University of California Irvine transgenic mouse facility. Genomic DNA was sequenced to confirm the correct deletion.
RNA sequencing
Total RNA was isolated from whole testes using Qiagen RNeasy mini kit (74104). Polyadenylated RNA was purified using Oligotex beads (70022; Qiagen). Quality of the RNAs was verified on Bioanalyzer and RNA was fragmented to ∼200 bp. Sequencing libraries were prepared using mRNA-Seq Sample Preparation Kit (RS-930-10; Illumina) and sequencing was performed using Illumina X2500 platform with 50 bp paired-end reads. Reads for each sample were quantified against mouse GRCm38 cDNA using kallisto0.43.0 (Bray et al. 2016), bootstrap = 10 was used to measure the accuracy of the quantification (which does not influence quantification). Sleuth (Pimentel et al. 2017) was used to perform the differential expression tests of genes and transcripts from kallisto quantifications. Genes were identified as expressed if transcripts per million reads (TPM) > 0.1 in two replicates (Everaert et al. 2017).
miRNA analysis
Small RNAs were isolated from whole testes using the mirVana miRNA Isolation Kit (Thermo Fisher Scientific) according to the manufacturer’s instructions. The resulting RNA (∼50 µg) was heat-denatured, loaded and resolved on the 15% denaturing acrylamide gel. Two radiolabeled RNA oligos of 18 and 28 bases were used as the ladder. The gel was run for 3 hr at 45 W. The gel slice containing RNA from 19 to 27 bases was excised and RNA was eluted in 800 µl of 0.4 M NaCl overnight. RNA was precipitated with ethanol and resuspended in 6 µl of H2O (∼200 ng of RNA). Library was prepared with the NEBNext Small RNA Library Prep Set for Illumina (E7330S; New England BioLabs) following the manufacturer’s instructions. The microRNA samples were sequenced using Illumina HiSequation 2500 using single-end reads. After trimming the adapter, reads were filtered using the length threshold of 17–29 bp. Reads were mapped to the mouse (mm10) genome using bowtie 0.12.9 (Langmead et al. 2009) with parameters of “bowtie -f -n 0 -e 80 -l 18 -a -m 5 –best –strata” in MiRDeep2 v2.0.0.7 (Friedländer et al. 2012), bowtie “best strata” has been shown to be one of the most sensitive and robust alignment methods for miRNA analysis (Tam et al. 2015). The quantification and novel miRNAs identification (used miRNAs from other species as templates) was performed by MiRDeep2 v2.0.0.7 against annotated mouse miRNAs of miRBase (Version 21) (Kozomara and Griffiths-Jones 2014). Significance testing for the differential expression analysis was performed using DESeq2 (Love et al. 2014). Common novel miRNAs between the four samples were defined as novel miRNAs (with TPM > 10) that have 100% sequence overlap between each other.
GFP-PRDM9 constructs
Full length B6 mouse PRDM9 following GFP was cloned into pcDNA4-Myc-His-B (ThermoFisher). A linker region of the following sequence separated GFP from PRDM9 “GGAGGCTCTGGCGGATCTGGAGGGTCTGGAGGT.” The PRDM9 start codon was removed but the stop codon was maintained. Mutant constructs were generated using the Q5 Site-Directed Mutagenesis Kit (New England Biolabs). The specific mutations for constructs in Figure 2 are as follows:
SET: Y345F; KRAB1: del(D30-P87), KRAB2: del(K27-V198V), SSXRD: del(L175-Y206); ZincF: del(H392-W413). GFP-STOP was generated by the following mutation in linker region above: G1T, G2A.
Cell culture and transfection
GC-1 cells were grown in DMEM supplemented with 10% FBS and 5% Penn-Strep at 37° in 5% CO2. Cells were plated and allowed to grow for 15 hr before transfection using Lipofectamine 3000 (Thermo Fisher Scientific). Cells were allowed to grow in transfection solution for 15 hr before collection, and typically showed 50–60% transfection efficiency.
Chromatin immunoprecipitation followed by high-throughput sequencing
GC-1 cells were fixed directly on the plate as described in Baker et al. (2015). Chromatin immunoprecipitation (ChIP) for testis and GC-1 cell samples was carried out as previously described (Brick et al. 2012).
For single nucleosome resolution, H3K4me3 ChIP followed by high-throughput sequencing (ChIP-seq) instead of sonication the fixed nuclei were resuspended in MNase buffer (50 mmol Tris-HCl, 1 mmol CaCl2, 4 mmol MgCl2, 4% NP-40) at 26 μl/106 cells, supplemented with 1 mmol PMSF and 1× cOmplete, EDTA-free Protease Inhibitor Cocktail (Sigma-Aldrich). The Micrococcal nuclease digestion was then carried out as described in Billings et al. (2013). Following digestion, the sample was diluted 3:1 in ChIP buffer (0.1% SDS, 1% NP-40, 2 mM EDTA, 20 mM Tris-HCl pH 8, 150 mM NaCl) and H3K4me3 ChIP was performed as in Brick et al. (2012). The following antibodies were used: anti-DMC1, Santa Cruz sc8973; anti-H3K4me3, Millipore #07-473. H3K4me3 sequencing was performed with 75 or 150 bp SE reads on an Illumina NextSeq500 and SSDS sequencing was performed with 50 bp PE reads on an Illumina HiSequation 2500. H3K4me3 ChIP-sequencing reads were aligned to the mouse genome mm10 using bwa aln (Li and Durbin 2009). Overlap of reads with either promoters or known hotspots was performed using bedtools 2.26.0 Intersect. Bigwig files were generated with deepTools 1.5.11 bam-Coverage with the following parameters: -binSize 10 -centerReads -normalizeUsingRPKM. Bigwig files were then used to generate heatmaps across promoters or hotspots using deepTools computeMatrix reference-point followed by plotHeatmap. SSDS reads were aligned to the genome using bwa aln (0.7.12) and ssDNA derived reads were identified using the single stranded DNA sequencing (SSDS) processing pipeline (Khil et al. 2012; https://github.com/kevbrick/SSDSpipeline.git).
Identification of DSB hotspots
Uniquely mapped fragments unambiguously derived from ssDNA (ssDNA type 1) and having both reads with a mapping quality score ≥30 were used for identifying hotspot locations (peak calling). Normalization of ChIP-seq (NCIS) was used to estimate the background fraction for each library. Peak calling was performed using MACS (v.2.1.0.20150420) (Zhang et al. 2008) with the following parameters: -ratio [out-put from NCIS] -g mm -bw 1000 -keep-dup all -slocal 5000. DSB hotspots within regions previously blacklisted (Smagulova et al. 2016) were removed and hotspot strength was calculated as described previously (Smagulova et al. 2016). The pipeline for calling DSB hotspots from SSDS can be obtained from (https://github.com/kevbrick/callHotspotsSSDS.git). The fraction of reads in peaks is calculated as the number of ssDNA fragments that overlap a feature (hotspots or TSSs) divided by the total number of ssDNA fragments.
Assessment of PRDM9 expression in GC-1 cells
Live cell images were taken 12 hr post transfection to confirm nuclear localization and estimate expression level of GFP-PRDM9 constructs. During the ChIP procedure, a 50 μl aliquot of sheared chromatin lysate was removed and frozen at −80°. The sample was incubated at 95° for 10 min in SDS loading buffer with DTT and resolved in a Bolt Bis-Tris Plus acrylamide gel followed by transfer onto polyvinylidene difluoride (PVDF) membrane. The following antibodies were used: rabbit anti-GFP (AB290; Abcam) and rabbit anti-CTCF (AB70303; Abcam). The blots were developed using SuperSignal West Pico PLUS Chemiluminescent Substrate.
Assessment of PRDM9 expression in mouse testis
Testes were removed and stored at −80° in RNALater (Thermo Fisher Scientific). Total RNA was extracted using RNeasy plus mini kit (Qiagen) and the cDNA was prepared with the Transcriptor First Strand cDNA Synthesis Kit (Roche). Quantitative PCR was performed using the SYBR Green PCR Master Mix (Thermo Fisher Scientific) using a 7500 AB Real-Time PCR system with ABI SDS Software. The primers used are listed in Table S5; Gapdh primers are from Spandidos et al. (2010).
Histology, chromosome spreads, and immunostaining
Testes or ovaries were dissected and stored in buffered formalin (1:10 dilution). Tissue sections with hematoxylin/eosin staining was done by American HistoLabs (Gaithersburg, MD). Meiotic chromosomes spreads were made and immunostained as described in Smagulova et al. (2013). The following primary antibodies were used: anti-SCP3 (Petukhova et al. 2003); anti-γH2AX (4411-PC-100; Trevigen). Samples were imaged using Leica fluorescent DM5500 microscope with Retiga camera and velocity acquisition system.
Data availability
Mouse strains are available upon request. Gene expression and ChIP-seq data are available at GEO with the accession number: GSE104850. Supplemental material available at Figshare: https://doi.org/10.25386/genetics.6157487.
Results
Inactivation of Prdm9 does not lead to changes in gene expression
Since the SET domain, the KRAB domain and the SSXRD are usually involved in regulation of gene expression, we assessed whether PRDM9 indeed is involved in transcription control. We carried out high throughput sequencing of mRNA (mRNA-seq) from testes of Prdm9−/− mice. As Prdm9−/− spermatocytes undergo meiotic arrest at pachytene-like stage (Hayashi et al. 2005), testes of the Prdm9−/− mice lack germ cells of later stages leading to a vastly different cell type composition compared to that in wild-type testes. Consequently, comparing the transcriptome between wild type and Prdm9 knockout mice would not be meaningful. To get around this problem we used Hop2−/− mice as the Prdm9+/+ control. In Hop2−/− mice, meiotic DSBs are formed, but not repaired, which leads to meiotic arrest at the stage similar to Prdm9−/−, with a very similar cell type composition (Petukhova et al. 2003; Hayashi et al. 2005).
We examined differences in gene expression using two biological replicates of mRNA-seq for Prdm9−/− and Hop2−/− mice (Table S1). The correlation between Prdm9−/− and Hop2−/− expression is very strong across annotated genes that are expressed in at least one sample (TPM > 0.1, Figure 1A, Spearman’s R = 0.98–0.99). This indicates that the absence of Prdm9 does not perturb the global gene expression pattern. As expected, the transcriptome of wild-type mice showed low similarity with the mutants (Figure 1A, Spearman’s R = 0.69–0.70), reflecting the presence of the late meiotic and postmeiotic cells in wild type testes.
PRDM9 has been implicated in the regulation of the Morc2b gene expression because of altered expression level in Prdm9−/− testes (Hayashi et al. 2005) and in sterile hybrids between Mus musculus musculus and Mus musculus domesticus subspecies (Mihola et al. 2009). We found that in wild-type mice there is a 300-fold change in the Morc2b expression over the course of spermatogenesis, with a sharp increase starting at the onset of pachytene stage (Figure 1B). Therefore, even very slight changes in the testes cell population due to meiotic arrest or delay would lead to detectable changes in the abundance of the Morc2b transcript, explaining prior observations. Indeed, we found no significant difference (q-value = 1) in the Morc2b expression level between Prdm9−/− (mean TPM = 1.44, Prdm9−/− #1 TPM = 1.04, Prdm9−/− #2 TPM = 1.84) and Prdm9+/+Hop2−/− mice (mean TPM = 0.38, Hop2−/− #1 TPM = 0.29, Hop2−/− #2 TPM = 0.48), indicating that Morc2b transcription is unlikely to be regulated by PRDM9. Overall, we did not detect any differentially expressed genes or transcripts between the Prdm9−/− and Prdm9+/+Hop2−/− samples (q-value < 0.05).
We next explored whether PRDM9 could play a role in regulating miRNAs, as they have been shown to play crucial roles in meiosis (Yu et al. 2005; Song et al. 2009; Royo et al. 2015). To compare miRNA levels we considered miRNAs that were detected at ≥10 TPM in at least one miRNA sequencing library prepared from two Hop2−/− and two Prdm9−/− samples (Table S2). We found that there is a strong overall correlation of miRNA profiles between Prdm9−/− and Hop2−/− samples (R = 0.94–0.97, Figure 1C), and that there are no differentially expressed miRNAs between the Prdm9−/− and Hop2−/− samples (adjusted P-value < 0.05). In addition to annotated miRNAs we identified mature miRNAs that were not described in the miRBase (release 21), 12 of which were present in all four samples (Table S3). The majority of these novel miRNAs had very low expression level (1000 times lower than annotated miRNAs) and none of them showed differential expression between the Prdm9−/− and Hop2−/− samples (adjusted P-value < 0.05). Although it remains possible that transcripts below our detection level are being influenced by PRDM9, our data indicate that inactivation of the Prdm9 gene does not lead to significant changes in transcription, either of protein coding genes or of miRNAs.
Neither the KRAB domain nor the post-SET zinc finger are essential for H3K4me3 activity of PRDM9 in cell culture
Since we found no evidence of PRDM9 being involved in regulation of transcription, we decided to assess whether the KRAB domain and the SSXRD were required for other PRDM9 activities. We first expressed the wild type GFP-tagged PRDM9 (Dom2 allele (Parvanov et al. 2010), WT in Figure 2) in the GC-1 mouse spermatogonial cell line (Hofmann et al. 1992) and evaluated the methyltransferase activity of PRDM9 by H3K4me3 ChIP-seq. As expected, we found a marked enrichment of H3K4me3 reads at the known Dom2 hotspots, but not at hotspots defined by a different Prdm9 allele, 13R (Brick et al. 2012) (Figure 2B). This enrichment was not due to expression of the endogenous Prdm9, because no H3K4me3 signal was detected at Dom2 hotspots in GC-1 cells before transfection (Figure 2B, GC-1 cells carry the Dom2 Prdm9 allele).
Following confirmation that the expressed GFP-PRDM9 was active, we used targeted mutagenesis to generate two alternative GFP-PRDM9 constructs that lacked the KRAB domain (KRAB-1 and KRAB-2, Figure 2A and Figure S1) and assessed their activity by H3K4me3 ChIP-seq. Additionally, we generated a point mutant with a single Y341F amino acid change in the active site of the PR/SET domain (SET, Figure 2A and Figure S1). This mutation was previously shown to abolish methyltransferase activity of PRDM9 (Wu et al. 2013) and serves in this study as a negative control. H3K4me3 is a relatively common epigenetic modification that is usually found at gene promoters and enhancer elements. H3K4me3 in these regions is not dependent on PRDM9 and therefore, we used the H3K4me3 ChIP-seq signal at promoters as an internal control for the ChIP efficiency. This allowed us to compare H3K4 trimethylation activity at hotspots across multiple wild type and mutant samples (Figure 2B).
We found that both KRAB mutants retained methyltransferase activity (Figure 2B). Relative to the H3K4me3 signal at transcription start sites, the magnitude of this signal was similar to that in cells expressing wild type PRDM9 (HS:TSS ratio, Figure 2B). Since the KRAB-2 mutant also lacks 24 out of 32 amino acids of the SSXRD we assessed whether the SSXRD is required for PRDM9 function. Unexpectedly, removing the entire SSXRD (SSXRD, Figure 2A and Figure S1) resulted in complete inactivation of PRDM9 with no trace of H3K4me3 enrichment at the Dom2 hotspots (Figure 2B). We confirmed that the SSXRD mutant protein product was expressed at comparable to wild type level and properly localized to the nucleus (Figure S1); therefore, it is unlikely that the lack of the H3K4me3 signal was due to low level of the mutant PRDM9.
We next tested whether the post-SET zinc finger had an effect on the H3K4 methyltransferase activity of PRDM9. This lone zinc finger has been proposed to regulate the PR/SET catalytic activity by blocking access to the active site in the absence of histones (Wu et al. 2013). We found that deletion of this zinc finger (ZincF, Figure 2A and Figure S1) did not reduce PRDM9 activity and H3K4 trimethylation marks were still detected at expected hotspots (Figure 2B).
The KRAB domain is necessary for PRDM9 activity in vivo
After we found that deletion of the KRAB domain does not abolish methyltransferase activity of PRDM9 we wanted to investigate the effect of the KRAB domain deletion in meiosis. It was recently proposed that the KRAB domain of PRDM9 mediates tethering of the hotspot DNA to chromosome cores, where the proteins required for the DSB formation are located (Imai et al. 2017; Parvanov et al. 2017). To test this, we used CRISPR-based gene targeting to generate a knock-in mouse strain that lacks the KRAB domain of PRDM9, Prdm9K/K (Figure S2 and Table S4). The mutant Prdm9K allele matches exactly the KRAB-1 deletion mutant that was tested in cell culture and lacks the majority of the KRAB domain while retaining three N- and three C-terminal residues of the domain.
We found that both male and female Prdm9K/K mice were infertile. Prdm9K/K ovaries were greatly reduced in size and had no follicles (Figure S3). Testes size was also reduced and seminiferous tubules lacked spermatids, although primary spermatocytes were easily detectable (Figure S3). To evaluate the time of arrest more precisely, we analyzed meiotic progression by immunostaining chromosome spreads of the Prdm9K/K spermatocytes. We found that spermatocytes arrested at the stage resembling pachynema, but DSBs were only partially repaired and synapsis of the homologous chromosomes was incomplete (Figure 3). Overall, the phenotype of the Prdm9K/K mice was indistinguishable from that of the Prdm9−/− mutant (Figure 3). While we could not assess the protein level of PRDM9 in testes, we confirmed that the expression of the Prdm9 KRAB deletion mutant transcript was not different from that of wild type Prdm9 (Figure S2). As this same allele yields a protein product in GC-1 cells at the level similar to wild type Prdm9 (Figure S1) and retains the methyltransferase activity (Figure 2B), we do not expect that PRDM9 lacking the KRAB domain is unstable in vivo. This makes it unlikely that the null phenotype observed in the Prdm9K/K mouse is due to lack of the Prdm9K/K protein product. Together these data indicate that the KRAB domain of PRDM9 is indispensable for meiosis.
To understand the reason for meiotic failure in the Prdm9K/K mice we evaluated whether the methyltransferase activity retained by the PRDM9 KRAB mutants in GC-1 cells (Figure 2B) was also detectable in the Prdm9K/K mice. We found only trace levels of H3K4 trimethylation at Dom2 hotspots (Figure 4A). This suggests that the methyltransferase activity observed in cell culture was likely the result of Prdm9 over-expression (Figure S1), although we cannot exclude the possibility that some complex meiosis-specific regulatory mechanisms gave rise to this difference.
Intriguingly, the purified PRDM9 PR/SET domain alone is capable of carrying out methyltransferase activity in vitro using synthetic peptides as targets (Wu et al. 2013). As PRDM9 lacking the KRAB domain is largely incapable of trimethylating H3K4 in testes it is possible that the KRAB domain or KRAB-mediated interactions with other proteins (Imai et al. 2017; Parvanov et al. 2017) are required for the PR/SET domain activity within the chromatin, where the access of PRDM9 to DNA or histone targets may be restricted. Dynamic interactions between PRDM9 and chromatin was previously proposed, when H3K4me3 profiling at recombination hotspots revealed nucleosome depleted regions at PRDM9 binding sites (Baker et al. 2014). To determine whether this was mediated by PRDM9 rather than downstream DSB formation and processing we carried out H3K4me3 profiling in the Spo11−/− mice that are deficient in DSB formation (Baudat et al. 2000; Romanienko and Camerini-Otero 2000). We found a clear nucleosome depleted region at hotspot locations, confirming that nucleosome reorganization at these sites occurs before the formation of DSBs, as a result of PRDM9 activity (Figure S4).
We next mapped DSB hotspots in the Prdm9K/K mice by ChIP with DMC1 antibodies followed by single-stranded DNA sequencing (SSDS; Khil et al. 2012). Not surprisingly, this analysis revealed that the vast majority of DSBs in the Prdm9K/K spermatocytes formed at promoters and enhancers, the default hotspots where DSBs form in the absence of PRDM9-dependent H3K4me3 marks (Brick et al. 2012) (Figure 4, B and C). Nevertheless, a small number of DSB hotspots formed at locations defined by PRDM9. With few exceptions, these were much weaker, but approximately half of them had a clear H3K4me3 signal, confirming that these were bona fide PRDM9-dependent hotspots. This indicates that the Prdm9 mutant without the KRAB domain is largely incapable of trimethylating H3K4 at its binding sites in vivo, but DSB formation at the rare sites where such methylation occurs is not impaired.
Discussion
PRDM9 contains three different domains that have been implicated in regulation of transcription in other proteins: the KRAB domain, the SSXRD, and the PR/SET domain. Based on this, PRDM9 was assumed to be a transcription factor and early reports supported this assumption (Hayashi et al. 2005; Matsui and Hayashi 2007; Mihola et al. 2009; reviewed in Nowick et al. 2013; Capilla et al. 2016). Once the role of PRDM9 in defining recombination hotspots was identified (Baudat et al. 2010; Myers et al. 2010; Parvanov et al. 2010) it became clear that, rather than regulation of transcription, the methyltransferase activity of the PRDM9 PR/SET domain is needed to introduce H3K4me3 marks at the sites where recombination will be subsequently initiated. However, the role of the KRAB domain and the SSXRD remained uncertain. In this study, we found no change in global gene expression profile in Prdm9−/− mice, indicating that PRDM9 is not involved in transcription regulation; thus, the function of the KRAB and SSXRD domains of PRDM9 must be distinct from a role in transcriptional regulation.
The KRAB domain of PRDM9 has a noncanonical structure and shows ∼25-fold weaker repression activity than the classic KRAB domains of the KRAB-ZFP family in in vitro assays (Lim et al. 1998). Therefore, involvement of the KRAB domain of PRDM9 in gene expression was somewhat unlikely. Indeed, two papers recently suggested that the role of the KRAB domain may be to anchor the PRDM9-bound DNA regions to the cores of meiotic chromosomes where the factors required for DSB formation are located. PRDM9 may exert this function through interaction with the CXXC1 protein (Imai et al. 2017; Parvanov et al. 2017), the mammalian homolog of yeast Spp1. Spp1 has a PHD domain that binds H3K4me3, but it also interacts with Mer2, the axis-associated protein required for DSB formation (Acquaviva et al. 2013; Sommermeyer et al. 2013). Therefore, Spp1 provides a link allowing H3K4 trimethylated chromatin to be physically anchored to the chromosome axis where much of the DSB formation machinery resides. Yeast lacks the PRDM9 protein, and recombination hotspots occur at H3K4me3 sites, usually promoters, regardless of the underlying DNA sequence (Wu and Lichten 1994; Borde et al. 2009; Pan et al. 2011). A similar situation occurs in Prdm9 knockout mice, where DSBs are formed at promoter regions (Brick et al. 2012). In the presence of PRDM9, CXXC1 would interact with both PRDM9 (Imai et al. 2017; Parvanov et al. 2017) and H3K4me3 (Eberl et al. 2013) and then be tethered to the chromosome axis through interaction with IHO1 (Imai et al. 2017), the mammalian homolog of Mer2.
Although the role of PRDM9 in bringing hotspot regions to the axis-associated DSB promoting factors is plausible, it does not explain why removal of the KRAB domain leads to the inhibition of the methyltransferase activity of PRDM9 during meiosis. The other six mammalian methyltransferases that methylate H3K4 (SET1A, SET1B and MLL1 through MLL4 in human) are catalytic cores in six nonredundant COMPASS complexes that have common, as well as complex-specific, subunits (Shilatifard 2012; Vedadi et al. 2017). Interestingly, CXXC1 that interacts with the KRAB domain of PRDM9 (Imai et al. 2017; Parvanov et al. 2017) is a subunit of the SET1A/B COMPASS, and removal of its yeast counterpart Spp1 leads to 80% reduction in the methyltransferase activity of the yeast COMPASS complex (Schneider et al. 2005). Therefore, it is conceivable that PRDM9 can also assemble into a COMPASS-like complex and recruit a CXXC1 subunit through its KRAB domain. In this case, removal of the KRAB domain would result in dissociation of CXXC1 and loss of methyltransferase activity. Whether PRDM9 is indeed a part of a meiosis-specific canonical COMPASS complex or a part of an alternative complex, it looks like the KRAB domain has an earlier role in meiosis by facilitating H3K4 trimethylation and possibly other chromatin remodeling steps required for subsequent DSB formation and/or repair.
The deletion in one of our KRAB mutants (KRAB-2) included the entire KRAB domain and 24 out of 32 amino acids of the SSXRD. Since the resulting protein showed similar methyltransferase activity to the KRAB-1 mutant in cell culture, it appeared that the SSXRD did not have an additional role in the H3K4 trimethylation activity of PRDM9, at least in a nonmeiotic environment. It was therefore surprising that deletion of the entire SSXRD resulted in a complete loss of methyltransferase activity. Although it is possible that the last eight amino acids of the SSXRD are critical for its function, we cannot rule out the possibility that this deletion resulted in misfolding of the protein and indirect inhibition of the PR/SET domain.
In addition to the unknown roles of the KRAB domain and the SSXRD of PRDM9, the function of the single post-SET ZnF located right downstream of the SET domain was also unclear. While the other ZnFs in PRDM9 recognize and bind DNA, removal of this lone zinc finger does not affect the DNA-binding ability of PRDM9 in in vitro assays (Walker et al. 2015). At the same time, the post-SET ZnF was implicated in the autoregulation of the methyltransferase activity of PRDM9 through blocking the substrate binding site (Wu et al. 2013). If such regulation indeed occurred, removal of the post-SET ZnF would either increase the methyltransferase activity of PRDM9 (due to opening of the substrate binding site) or decrease it (if, for example, PRDM9 were automethylated and this inhibited the PR/SET activity and/or PRDM9 interactions with other proteins or DNA). As we found no significant changes in methyltransferase activity of PRDM9 with or without the post-SET ZnF in GC-1 cells, the regulatory role of the post-SET ZnF may be not that straightforward and needs to be evaluated in vivo using mouse models.
A recent study described the development of the Prdm9 mutant mouse strain that lacked the N-terminal part of the protein including a portion of the KRAB domain (Imai et al. 2017). Although it was not clear whether the mutant phenotype was due to this truncation or because of substantially reduced expression level of the product, it matched the phenotype observed in our Prdm9K/K strain. We could not assess the level of PRDM9 protein in our mutant due to lack of antibodies of sufficient quality. However, the mRNA levels of Prdm9K/K and wild type Prdm9 were similar (Figure S2), and, in cell culture, wild type Prdm9 and the same KRAB mutant produced a comparable amount of protein product (Figure S1). Furthermore, we were able to detect residual H3K4me3 activity at the sites of expected hotspots and the evidence of DSB formation (Figure 4). If the KRAB domain of PRDM9 is required to communicate with the DSB formation machinery through interaction with CXXC1 (Imai et al. 2017) then deletion of this domain should prevent DSB formation at PRDM9 binding sites. The fact that these DSBs still form suggests that either (1) PRDM9 communicates with the DSB machinery through additional regions, outside the KRAB domain or (2) CXXC1 alone or within the SET1A/B COMPASS can interact with any H3K4me3 sites in the genome, regardless of whether they were introduced by PRDM9 or SET1A/B, and tether these sites to chromosome cores to enable DSB formation (Imai et al. 2017). Further studies will help to distinguish between these possibilities.
To summarize, we found that the PRDM9 protein is not involved in regulation of gene expression. Instead, the KRAB domain of PRDM9 appears to be required for the efficient H3K4 trimethylation activity of PRDM9 in vivo. The vast majority of meiotic DSBs in the Prdm9K/K mutants are formed at default hotspots, but those few H3K4me3 marks that are introduced by the KRAB-less PRDM9 are also recognized by the DSB formation machinery.
Acknowledgments
We thank Florencia Pratto for critical feedback on the manuscript, and members of the Camerini-Otero and Petukhova laboratories for helpful discussions and suggestions. We thank Jon Neumann and University of California (UC) Irvine Transgenic Mouse Facility for CRISPR targeting design and pronuclear injections. We thank both the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) genomics core and the American Genome Center at Uniformed Services University of Health Sciences (USUHS) for assistance with sequencing. This study used the high-performance computational capabilities of the Biowulf Linux cluster at the National Institutes of Health (http://biowulf.nih.gov.lrc1.usuhs.edu). This study was supported by grants R01GM084104 and R01GM124402 from the National Institute of General Medical Sciences (G.V.P.) and the NIDDK Intramural Research Program (R.D.C.-O.).
Author contributions: S.T.-S. generated the mutant Prdm9 constructs, performed the cell culture experiments and characterized the Prdm9K mouse model. F.S. cloned the PRDM9 cDNA and prepared RNA-seq libraries. J.C. constructed the miRNA libraries. S.T.-S. analyzed ChIP-seq data, Q.Y. analyzed the RNA-seq and microRNA-seq data. K.B. performed and supervised computational analyses. S.T.-S. and Q.Y. drafted the manuscript; S.T.-S., Q.Y., and K.B. designed and prepared the figures. All authors reviewed and commented on the manuscript. R.D.C.-O. and G.V.P. supervised the study.
Footnotes
Supplemental material available at Figshare: https://doi.org/10.25386/genetics.6157487.
Communicating editor: D. Bishop
Literature Cited
Author notes
These authors contributed equally to this work.
Present address: 9 Avenue du Professeur Léon Bernard, 35000 Rennes, France.