Substrate selectivity of the human RNA m5C methyltransferase NSUN2
Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
NSUN2 is the primary m5C writer for mRNAs, and its dysregulation has been implicated in numerous human diseases10,11,12,13. Loss-of-function mutations in NSUN2 cause intellectual disability and Dubowitz-like syndrome, as NSUN2 is important for neural circuit formation and synaptic function14,15,16,17. NSUN2 also regulates ageing, tissue regeneration and tumourigenesis18,19,20,21,22,23,24. Notably, NSUN2 copy number is amplified in more than 10% of non-small-cell lung cancers25 (Extended Data Fig. 1a), and dysregulated m5C metabolism also promotes leukaemogenesis26. Therefore, NSUN2 is an important biological regulator and a potential therapeutic target.
Each m5C writer exhibits distinct substrate preferences5. NSUN2 is an especially versatile methyltransferase that can modify many types of RNA, including mRNAs, tRNAs, various ncRNAs and viral RNAs27,28,29. Even within tRNAs, NSUN2 can methylate multiple positions30,31,32. However, the mechanism underlying the specific and regulated methylation activity of NSUN2 remains unknown, and transcriptome-wide studies yield no clear consensus motif10,11,13,33. Deciphering the molecular mechanisms underlying NSUN2 activity is important for understanding its broader impact on human health and pathology.
Here, we describe the mechanism by which NSUN2 specifically methylates RNA. We present a high-resolution cryogenic electron microscopy (cryo-EM) structure (2.57 Å) of an NSUN2–RNA reaction intermediate. Moreover, we provide eight more cryo-EM structures of NSUN2–substrate complexes at various stages of the catalytic cycle to explain how an interplay among different RNA conformational states, cofactor binding and specific protein–RNA contacts drives reaction progress. Furthermore, we show how the methyltransferase can modify multiple sites within the same tRNA scaffold while maintaining specificity. We identify a dual-stem structure with a bulge-like linker and an m5CNNRR motif as the preferred features of NSUN2 substrates. We introduce a simplified, minimal substrate to demonstrate how NSUN2 targets are more generally recognized. Overall, our findings show the molecular basis of m5C modification, providing a framework for understanding its role in biology and disease.
The alternative text for this image may have been generated using AI.Full size imagea, In vitro methylation activity of full-length wild-type (WT) human NSUN2 with indicated tRNA substrates, shown as mean ± standard deviation (s.d.) (n = 3). Statistical significance is indicated (two-tailed unpaired t-test). DPM, disintegrations per minute. b, In vitro crosslinking activity of NSUN2C271A with indicated tRNA substrates. Representative SDS–PAGE gel from three replicates is shown. For gel source data, see Supplementary Fig. 1. c, Domain organization of human NSUN2. d, Sharpened cryo-EM density map of the NSUN2C271A–tRNALysCTT complex with bound SAH (pink) and RNA (orange) in the D-arm conformation. The NSUN2 portion is coloured by domain as marked in c. e, Cloverleaf diagram of tRNALysCTT used in cryo-EM. Protein–RNA contacts (distance <3.8 Å) are coloured by the interacting protein domain, as in c, with the target cytosine highlighted in yellow. Unresolved nucleotide 76 is shown in grey. f–h, Models of the NSUN2C271A–tRNALysCTT–SAH complex in the D-arm conformation in cartoon representation from three different views, with SAH as pink sticks and m5Cyt48 as cyan sticks.
To gain insight into the catalytic activity of NSUN2, we sought to stabilize the reaction intermediate by leveraging its unique chemical mechanism. According to the proposed mechanism conserved for m5C methyltransferases34,35, NSUN2 forms a covalent linkage with the substrate cytosine base through a cysteine (C321), activating the methyl transfer from S-adenosylmethionine (SAM) to the cytosine and yielding S-adenosylhomocysteine (SAH) as a by-product; subsequently, a second cysteine (C271) is necessary to resolve the catalytic intermediate (Extended Data Fig. 1e,f). We mutated C271 to alanine (C271A) to inhibit the dissolution of the protein–RNA conjugate, as an intermolecular crosslink would increase the stability of the protein–RNA complex, enabling us to determine a higher-resolution complex structure and reveal a view of the catalytic intermediate. NSUN2–tRNA conjugate formation requires SAM and Cyt48, and the specific crosslinking activity is readily detectable as the conjugate is resistant to denaturing conditions (Fig. 1b).
We used single-particle cryo-EM to determine the structure of an NSUN2C271A–tRNA crosslinked complex at 2.57 Å resolution (Fig. 1c–e, Extended Data Table 1 and Extended Data Figs. 1 and 2). The overall architecture of the complex shows that NSUN2 contains four domains: an N-terminal domain (NTD), a methyltransferase domain (MTD) and two C-terminal domains (CTD1 and CTD2). The catalytic MTD is the largest domain and organizes the protein by making direct contact with NTD and CTD1, and CTD2 is wedged between CTD1 and NTD (Fig. 1f). NSUN2 uses all the domains to make extensive contact (2,878 Å2) with the tRNA, and most of the tRNA interacts with NSUN2, except for the D-arm (Fig. 1g,h). The lack of substantial protein contact probably leads to conformational variability in the D-arm; we observe two main classes of particles: one with a visible D-arm and the other with a disordered D-arm (Extended Data Fig. 1g,h). We also observe that the interactions between NSUN2 and the anticodon arm are distinct in these two conformations (Extended Data Fig. 1i,j), suggesting that the anticodon arm interacts with the protein more weakly. Nevertheless, despite weaker contact with D- and anticodon arms, the tRNA molecule is stably bound to NSUN2 to produce a high-resolution map of the RNA–protein complex (Extended Data Fig. 2).
A striking feature of the NSUN2–tRNA conjugate structure is the unusual conformation of the bound tRNA. In the NSUN2-bound state, the tRNA no longer adopts its canonical L-shape that depends on the contact between the T- and D-loops to form the elbow (Fig. 2a and Extended Data Fig. 1k). Structural superimposition indicates a large amount of unfolding, in which Gua19N1 and Cyt56N3, which typically form a hydrogen bond as part of the kissing loop interaction between the D- and T-loops, become separated by about 35 Å. This conformational change of the tRNA is probably stabilized by extensive protein–RNA contacts. Although NSUN2 envelops most of the tRNA by a large basic surface (Fig. 2b), the most intimate contact is observed near the NSUN2 regions of higher evolutionary conservation that bind the T-arm and parts of the acceptor arm (Fig. 2c–e). The T-stem is recognized by a row of basic side chains from the NTD that recognizes the pattern of phosphates from the duplexed RNA, ensuring a stem structure adjacent to the methylated cytidine (Fig. 2f and Extended Data Fig. 1l). Here, the guanidinium groups of R133 and R137 also contact N7 of guanine bases, which would cause purines to be preferred at these positions. The unpaired nucleotides in the T-loop form several favourable contacts with NSUN2, through ring-stacking and hydrophobic interactions, along with a few electrostatic contacts (Fig. 2g and Extended Data Fig. 1m). The base of the acceptor arm is buttressed by the MTD helix (α3) that provides a platform for the RNA duplex, and the phenol group of Y222 stabilizes Gua66 by ring stacking (Fig. 2h and Extended Data Fig. 1n). The backbone edge of the duplex RNA in the acceptor arm makes favourable contacts with the CTD1 (Fig. 2i and Extended Data Fig. 1o). At the 3′ end of the acceptor arm, the single-stranded RNA (ssRNA) is ordered enough to be modelled, although less defined than most of the RNA, suggesting that it may have loose interactions with the CTDs (Extended Data Fig. 1p). The key RNA–protein contacts can be summarized using a schematic (Fig. 2j). Several point mutations in these contact areas can diminish productive methylation, in agreement with our structural observations (Fig. 2k). The R133Q mutation has been observed in tumour samples36 (Fig. 2f), and reduced methylation activity may contribute to certain cancers. Therefore, the structure-guided biochemical analysis suggests that the NSUN2–tRNA complex is primarily stabilized by RNA structure-dependent interactions, including electrostatic interactions with the phosphate edges of the duplex (acceptor arm and T-arm) and sequence-independent stacking interactions with the T-loop.
The alternative text for this image may have been generated using AI.Full size imagea, Alignment of unbound (PDB: 1FIR, grey) and NSUN2-bound (PDB: 9Z2N from this study, orange) tRNALysTTT by their acceptor arms. Dashed arrows show the conformational differences in key structural elements. b,c, Models of the NSUN2C271A–tRNALysCTT complex with RNA shown as ribbon and protein in surface representation, coloured by vacuum electrostatic potential (b) or evolutionary conservation score (c). Conservation score per residue calculated using ConSurf, with yellow indicating insufficient data. d, NSUN2 surface representation coloured by domain, except for the residues within 3.8 Å of RNA that are shown in orange. e, Similar to d, protein coloured by domain, with RNA–protein interfaces highlighted with black boxes and panel labels. f–i, Close-up views of protein–RNA interfaces highlighted in e. Protein side chains within 3.8 Å of RNA are shown as sticks, and electrostatic contacts (<3.8 Å) are indicated by black dashed lines. Cyt48 is shown in cyan. j, Schematic of protein–RNA contacts (<3.8 Å) in T- and acceptor arms with protein residues coloured according to d. Arrows indicate electrostatic (triangle) or hydrophobic (circle) interactions between main chain (solid) or side chain (dashed) protein atoms with RNA. k, In vitro methylation activity of NSUN2 mutants with substitutions in NTD (blue) or MTD (purple), with wild-type tRNALysCTT, shown as mean ± s.d. (n = 3). Ctr, no enzyme control. Significance tests are comparisons with NSUN2WT (two-tailed unpaired t-test).
Our NSUN2–tRNA conjugate structures provide a high-resolution view of the NSUN2 catalytic pocket in action, revealing the reaction intermediate (Fig. 3a and Extended Data Fig. 1q). The methylated cytosine ring and the by-product SAH are deeply buried in the catalytic pocket. Our maps are sufficiently resolved to model the SAH and the protein–RNA crosslink between Cyt48 and the catalytic residue C321. An intricate network of polar contacts and van der Waals interactions determines the conformations of the cofactor and the methylated cytosine ring. The specificity for the cytosine base is conferred through recognition of N4 by main-chain carbonyl interactions with V269 and S319 (Extended Data Fig. 1r). RNA drapes over the bound SAH, suggesting the bound cofactor helps organize the RNA near the catalytic pocket (Extended Data Fig. 1s). As expected for residues responsible for precise chemistry, point mutations of the amino acids involved in coordinating the catalytic pocket are deleterious for methylation activity, including the R220C mutation found in multiple types of cancer36 (Fig. 3a,b). Thus, a near-atomic-resolution view of the NSUN2–RNA conjugate provides valuable insight into how the key cytidine is recruited into the catalytic pocket to undergo methylation.
The alternative text for this image may have been generated using AI.Full size imagea, View of the NSUN2C271A–tRNALysCTT catalytic pocket with protein side chains within 3.8 Å of SAH or m5Cyt48 shown as sticks. Electrostatic contacts are shown as black dashed lines, and other nucleotides are removed for clarity. b, In vitro methylation activity of NSUN2 catalytic site mutants, shown as mean ± s.d. (n = 3). Ctr, no enzyme control. Significance tests are comparisons with NSUN2WT (two-tailed unpaired t-test). c–e, Models of apo NSUN2WT–tRNALysCTT complex in two conformations (c), NSUN2WT–tRNALysCTT–SFG complex (d) and NSUN2C271A–tRNALysCTT–SAH complex (e). Protein coloured by domain, SFG and SAH shown as pink sticks, and Cyt48 shown as cyan sticks. f–i, Close-up views of the catalytic sites corresponding to the structures in c–e overlaid with their sharpened cryo-EM density map shown as transparent surfaces. Protein side chains, SFG, SAH and Cyt48 are shown as sticks.
Given the dynamic nature of the bound RNA, as evidenced by the two main conformations observed with the NSUN2C271A–tRNALysCTT complex, we investigated changes in RNA–protein interactions throughout the catalytic cycle. We determined the NSUN2 affinity for SAM to be about 10 µM by isothermal titration calorimetry, similar to previously reported values19, whereas that for tRNAs is >100-fold tighter regardless of cofactor-bound state (Extended Data Fig. 1c,d,t–v). We used wild-type NSUN2–tRNALysCTT complexes to determine cryo-EM structures with no cofactor (apo) or bound with a non-reactive SAM-analogue, sinefungin (SFG) (Fig. 3c–e, Extended Data Table 1 and Extended Data Figs. 3 and 4). The apo state exhibits the greatest RNA conformational heterogeneity, particularly near the T-arm and the catalytic pocket (Fig. 3c,f,g). The vacant cofactor-binding pocket allows the RNA to be more flexible near the active site, resulting in two distinct RNA conformations, especially the T-arm. Each RNA segment shift is followed by the corresponding bound protein domain, underscoring the specific and tight interactions between the T-arm and the NTD, and between the acceptor arm and the CTD. Thus, the two structures from the apo state demonstrate that the domains of NSUN2 can recognize and bind the RNA substrate with sufficient specificity and affinity to bring the target base close to the catalytic pocket, though not enough to stabilize the active conformation (Fig. 3f,g). On binding SFG, Cyt48 docks properly into the catalytic pocket with the target carbon atom (C5) moving approximately 4 Å deeper into the catalytic groove, ready for methylation (Fig. 3d,h). These results align with our findings that efficient crosslinking in the presence of mutant NSUN2C271A requires SAM (Fig. 1b). The protein and RNA conformations in the pre-catalytic conformation with SFG are similar to the catalytic intermediate represented by the crosslinked complex structure with NSUN2C271A (Fig. 3e,i, root mean square deviation (RMSD) = 0.31 Å). Therefore, although the overall affinity of NSUN2 for substrate RNA is sufficient to form stable complexes with approximately correct architecture, establishing an active catalytic pocket requires the bound cofactor to position the cytosine ring with proper geometry.
The alternative text for this image may have been generated using AI.Full size imagea,b, In vitro methylation activity of NSUN2 with wild-type human tRNAs containing a single cytosine in tRNA positions 48, 49 or 50 (a), or mutant tRNAs in which the single cytosine was shifted to the other positions (b). Ctr, no enzyme control. Data are shown as mean ± s.d. (n = 3, two-tailed unpaired t-test). c, In vitro crosslinking activity of NSUN2C271A with the same tRNAs used in b. A representative gel from three replicates is shown. For gel source data, see Supplementary Fig. 1. d, Structure of NSUN2C271A–tRNALysTTT–SAH complex, in the D-arm conformation. Protein is shown as a cartoon, RNA as a grey ribbon and SAH as pink sticks. e, Superimposition of NSUN2C271A–tRNALysCTT and NSUN2C271A–tRNALysTTT D-arm conformation structures by MTD. The RNA conformational shift is shown as orange (tRNALysCTT) and grey (tRNALysTTT) ribbons with protein in surface representation. Cyt49 of tRNALysTTT is coloured cyan. f,g, Close-up views of the junctions of acceptor arm and T-arm in the NSUN2C271A–tRNALysCTT (C48, f) and NSUN2C271A–tRNALysTTT (C49, g) structures. Key nucleotides are shown as sticks for orientation, and target cytosines are coloured cyan with protein shown in surface representation.
To investigate how the same enzyme accommodates several tRNA positions given the extensive contacts and intricate atomic geometry observed in the NSUN2–tRNALysCTT (with Cyt48) complexes, we determined cryo-EM structures of NSUN2 crosslinked to wild-type tRNALysTTT with a Cyt49 (Fig. 4d, Extended Data Table 1 and Extended Data Fig. 5). Comparing the two tRNA structures with distinct cytidine positions shows that the broad specificity of NSUN2 as a methyltransferase arises from its ability to bind the tRNA in several distinct conformations (Fig. 4e). We find that the first base pair of the tRNA T-stem (position 49:65) is dissolved to allow for Cyt49 to enter the catalytic pocket, and the released position 65 pairs with the Ura8 to lengthen the acceptor arm by 1 base pair; the protein conformation remains constant (RMSD = 0.123 Å) in the two structures (Fig. 4f,g and Extended Data Fig. 5h,i). In both structures, NSUN2 recognizes the duplex RNAs surrounding the methylated cytidine, and the protein–RNA contact locations are determined by their distance from the methylated cytidine. To enable Cyt50 to access the catalytic pocket in a similar manner, the resulting three-base-pair T-stem may be too short to fully retain the observed favourable interactions in the structures (Fig. 2j and Extended Data Fig. 5j). Therefore, by comparing how NSUN2 methylates distinct tRNA positions, we identified key RNA features for productive m5C modification.
By combining insights from structures across multiple states and different substrates, we could identify which protein–RNA contacts are important for productive m5C modification. We postulated that RNA structural features drive NSUN2 recognition because of the many interactions with the dsRNA. A linear tRNALysTTT segment (11 nt) containing the m5C site and its neighbouring primary sequence exhibits no detectable methylation activity, indicating the importance of a larger structural context (Fig. 5a,b). Disrupting the secondary structure of the T- and acceptor arms by introducing Cyt-to-Ade mutations that create mismatches in the RNA stems impairs methylation activity. Reinstating the Watson–Crick base pairing while changing the primary sequence (from CAGGG to CAAAG) adjacent to the methylated cytidine rescues the methylation activity, suggesting that the secondary structure is crucial for recognition. However, a mutant tRNALysTTT (DLoopMut) that cannot form proper kissing loop interactions between D- and T-loops required for tRNA tertiary structure37 is methylated with similar efficiency as the wild-type tRNA, suggesting that the full tRNA fold is not required for recognition by NSUN2 (Extended Data Fig. 6a–c). Another tRNA variable region methyltransferase, METTL1–WDR4, is known to recognize the intact tRNA tertiary fold through the elbow structure to methylate Gua46 (refs. 38,39). In contrast to NSUN2, methylation of the DLoopMut mutant by METTL1–WDR4 is undetectable, indicating that the tRNA elbow structure is disrupted (Extended Data Fig. 6d). These data suggest that NSUN2 recognizes RNA duplexes but does not require a full tRNA structure.
The alternative text for this image may have been generated using AI.Full size imagea, Schematics of tRNALysTTT variants. The 11-nt ssRNA ‘Oligo’ segment spans the region marked in blue, and ‘AC-arm’ refers to the anticodon arm. Mutations are coloured red, and compensatory mutations to preserve base pairing are coloured grey. b, In vitro methylation activity of NSUN2 with tRNA substrates shown in a (light grey) and Extended Data Fig. 6a (dark grey). The dashed line separates the two mutagenesis series. RNA sequence following the methylated Cyt (arrow) is shown on the right with position numbers. Data are shown as mean ± s.d. (n = 3, two-tailed unpaired t-test). c, Schematics of tRNALysTTT with arm deletions. d,e, In vitro methylation activity of NSUN2 (d) and the m7G methyltransferase METTL1–WDR4 complex (e) with the tRNAs from c. Significance tests are comparisons with wild-type tRNALysTTT. Data are shown as mean ± s.d. (n = 3, two-tailed unpaired t-test). f, Schematics of Mini NSUN2 substrate derived from NSUN2C271A–tRNALysTTT structures. g, In vitro methylation activity of NSUN2 with Mini (f), with additional deletions or modifications as described. Data are shown as mean ± s.d. (n = 3, two-tailed unpaired t-test). h,i, Schematics of two variants of Mini (f), with a closed C-stem and open bulge (h) or open N-loop (i). Ctr, no enzyme control.
Apart from structural recognition, we consistently observe ordered arginine side chains (R133 and R137) interacting with guanine bases 3′ of the methylated Cyt (Fig. 2f and Extended Data Fig. 1l). Previous transcriptome-wide studies and consensus analyses also suggested a preference for guanosines near NSUN2-dependent m5C sites10,11,12,13,33. We tested the importance of nucleotide sequence by systematically replacing each nucleotide position with Cyt (Fig. 5b and Extended Data Fig. 6e). Uridine was tested alongside cytidine at position 50 because Cyt50 is susceptible to methylation. Although the two nucleotides immediately following the m5C modification site could be replaced with a pyrimidine without altering methylation activity, purines are preferred at the positions that interact with the arginines (Gua52 and Gua53 in tRNALysTTT). Therefore, although single pyrimidine substitutions can still support m5C modification, the CNNRR sequence is preferred by NSUN2.
The uneven distribution of protein contacts on the tRNA in our complex structures suggests that not every arm of the tRNA is important for methylation. We measured changes in enzymatic activity with systematic RNA deletions (Fig. 5c). When we delete each arm of tRNALysTTT, D- and anticodon arms are dispensable whereas T- and acceptor arms are necessary for NSUN2-mediated methylation (Fig. 5d). To examine whether the arm requirement was specific to tRNALysTTT, we made an equivalent deletion series for tRNAMetCAT and observed a similar trend (Extended Data Fig. 6f). We tested the truncated tRNALysTTT constructs with METTL1–WDR4 and found that any arm deletion abrogates m7G methylation by METTL1–WDR4, drawing a stark contrast between the enzymes in their substrate requirements, despite their similar target location (Fig. 5e). These findings corroborate with previous reports that NSUN2 can methylate the mitochondrial tRNASerGCT that lacks a D-arm40,41 (Extended Data Fig. 6g). The in vitro methylation assay results can also be recapitulated using the orthogonal crosslinking assay with NSUN2C271A (Extended Data Fig. 6h). Therefore, unlike some other tRNA modification enzymes, NSUN2 does not require a full tRNA structure to catalyse m5C modification.
We aimed to design a minimal RNA substrate for NSUN2 to further dissect the key features that are important for methylation. Deleting both the D- and anticodon arms preserves methylation at levels comparable to those of the full-length tRNA (Fig. 5f,g). We term this double-deletion construct a Mini substrate for NSUN2, and it contains the two necessary RNA stem structures—N-stem and C-stem—which bind to the NTD or CTDs of NSUN2, respectively. Mini also binds NSUN2 significantly better than the constructs missing the T- or acceptor arm (Extended Data Fig. 7a,b). Shortening the Mini substrate by deleting the overhanging 3′ ssRNA causes a modest decrease in methylation (Mini∆ssRNA; Fig. 5g). We further truncated the blunt-ended RNA by shortening the C-stem to 7 bp, as observed in tRNALysCTT, and this shorter stem resulted in a minor decrease in methylation (Mini7bp). We then tested whether the location of the 5′ and 3′ ends in Mini is important. Closing the C-stem with a tetraloop while opening the bulge region (MiniOpen-Bulge; Fig. 5g, h) preserved methylation activity comparable to the Mini7bp construct, demonstrating that NSUN2 can recognize and methylate multiple types of dual-stem scaffold. However, closing the C-stem with the same tetraloop while opening the N-loop dramatically reduces methylation activity, indicating that the N-stem-loop structure is important to interact with the NTD properly (Fig. 5g,i). We tested the importance of the N-loop sequence that is derived from the tRNA T-loop. Mutating the conserved AG motif to GA does not alter methylation propensity, suggesting that the stem-loop conformation is important but not the sequence (Extended Data Fig. 7c,d). The shortest RNA fragments with good reactivity (Mini7bp and MiniOpen-Bulge) exhibit about half the level of methylation as the full-length tRNALysTTT. Nevertheless, given the wide activity variance among wild-type tRNAs (Fig. 4a), they serve as reasonable minimal substrates for NSUN2, reducing tRNALysTTT from 76 to 39 nucleotides. Considering the important relative orientation of the two stems, we examined how bulge length affects their function. Shorter bulges are increasingly poor NSUN2 substrates, indicating that a sufficient length is required to correctly position the two stems at an angle that allows each to bind its respective protein domain (Extended Data Fig. 7e). These experiments indicate that a dual-stem structure with a bulge or flexible linker on one side is a key RNA element for NSUN2 recognition.
Leveraging our ability to use a minimal substrate, we investigated potential interactions between adjacent m5C sites by synthesizing oligonucleotides with a unique m5C modification adjacent to the target cytosine, analogous to positions 48 and 49 in tRNAs (Extended Data Fig. 7f). Methylation activity is similar between premethylated and unmethylated substrates when the pre-existing m5C is 5′ to the target cytidine and moderately higher when m5C is on the 3′ side (Extended Data Fig. 7g). This apparent activation is probably due to structural variation (Fig. 4), as the two premethylated oligonucleotides also differ in methylation activity. As the first m5C modification can be added to Cyt48 and Cyt49 with similar efficiencies (Fig. 4a,b), our combined data suggest that each NSUN2 methylation event on a tRNA is probably stochastic rather than sequential. Modelling the premethylated nucleotides by substituting the base with m5C does not generate notable clashes or favourable contacts, consistent with the model in which adjacent m5C modifications are independent of each other (Extended Data Fig. 7h).
In summary, through a structure-guided series of biochemical studies, we have identified a small RNA fragment that encapsulates the important features of an RNA substrate for m5C methylation: a dual-stem structure with a bulged connection that can support the stems to bind at an angle, with a preference for the CNNRR motif at the 5′ end of the N-stem loop.
Our ability to design a minimal substrate for NSUN2 suggests that the dual-stem model could be applied to other substrate RNAs lacking a tRNA fold. To extend our understanding of NSUN2 substrates, we investigated how sites outside the tRNA variable region are recognized. Human long noncoding RNA RP11 contains a high-confidence m5C site with a top methylation score, which has been reproduced across multiple studies11,42. Applying the dual-stem framework, we identified a small RP11 fragment that recapitulates methylation and binding at the same site in vitro (Extended Data Fig. 7i,j). Mutating the methylated cytidine to uridine (RP11CtoU) abrogates methylation. When the purines in the CNNRR motif of the RP11 fragment are mutated to pyrimidines (RP11GAGCU), with compensatory mutations to maintain the stem structure, we observe a loss in methylation activity, underscoring the preference for the CNNRR motif. Therefore, our dual-stem substrate-recognition model for NSUN2 can be applied to non-tRNA substrates.
Previous studies have shown that pre-tRNALeuCAA can be modified by NSUN2 in the anticodon loop at position 34, apart from position 48 (refs. 27,32) (Fig. 6a). Cyt34 methylation may appear inconsistent with our model, in which NSUN2 methylates dual-stem RNAs resembling T- and acceptor arms (Fig. 5d). However, the stems important for NSUN2 recognition are determined by their relative positions to the potential m5C site. In pre-tRNALeuCAA, the intron introduces two extra stem structures into the tRNA. As a result, pre-tRNALeuCAA has two separate dual-stem elements: one for Cyt34 and one for Cyt48. We can reconstitute the intron-dependent m5C modification of Cyt34 and the constitutive methylation of Cyt48 for pre-tRNALeuCAA with purified NSUN2, in agreement with previous studies27,32 (Fig. 6b). Moreover, we show that a dual-stem RNA fragment that mostly consists of the intron sequence (Intron, for clarity) can be methylated with similar efficiency as the full-length pre-tRNALeuCAA (Fig. 6c,d). Mutating Intron at the m5C site (IntronC34U) or the CNNRR motif (IntronCAACC) substantially diminished methylation activity, consistent with our NSUN2 substrate-recognition model.
The alternative text for this image may have been generated using AI.Full size imagea, Cloverleaf diagram of pre-tRNALeuCAA with the intron coloured blue and target cytosines highlighted in yellow. Stems recognized by NSUN2 for each m5C site are marked with dashed lines. b, In vitro methylation activity of NSUN2 with tRNALeuCAA mutants. Mature tRNA is without the intron. Data are shown as mean ± s.d. (n = 3, two-tailed unpaired t-test). c, Secondary structure diagrams of wild-type (top) and mutant (bottom) pre-tRNALeuCAA intron fragments. Mutations in the 5′ stem are coloured red, compensatory mutations to preserve base pairing are coloured grey and the intron sequence is coloured blue. d, In vitro methylation activity of NSUN2 with the substrates described in c and the same fragment with a C34U mutation. Data are shown as mean ± s.d. (n = 3, two-tailed unpaired t-test). e, Cryo-EM structure of the NSUN2C271A–IntronWT crosslinked complex. Protein shown as cartoon coloured by domain, RNA as orange ribbon, and SAH as pink sticks. f, Model of NSUN2–RNA recognition and methylation. NSUN2 recognizes dual-stem RNA structures. NTD of NSUN2 binds the N-stem-loop, and CTD of NSUN2 binds the C-stem. At the 5′ end of the N-stem, the CNNRR motif is preferred for efficient methylation of the first Cyt. NSUN2 can recognize the RNA in the absence of the SAM cofactor (apo state), although NSUN2 may also bind SAM first. The empty SAM-binding pocket adjacent to the α3-helix of the MTD allows RNA conformational variability in the apo state. Cofactor binding to NSUN2 (pre-catalytic state) rigidifies the protein–RNA complex and organizes the reaction centre to be poised for catalysis. NSUN2 forms a covalent bond (through C321) with the cytidine to initiate the catalytic reaction (catalytic intermediate state), enabling methyl transfer even in the absence of the second cysteine (C271).
To provide a structural model of NSUN2 recognizing an isolated dual-stem structure, we used cryo-EM to visualize its binding to RP11 or Intron. Although the NSUN2C271A–RP11 complex data did not converge on an interpretable three-dimensional reconstruction, we successfully obtained a cryo-EM structure of the NSUN2C271A–IntronWT complex (Fig. 6e, Extended Data Fig. 8 and Extended Data Table 1). The 30-nucleotide Intron fragment binds NSUN2 with the N- and C-stems positioned similarly to the corresponding stems of intact tRNAs (RMSD, overall: 0.1 Å, RNA: 1.3 Å; Extended Data Fig. 8e), and the purines of CNNRR are similarly recognized by the side chains of R133 and R137 (Extended Data Fig. 8f). During data processing, we observed another similar conformation of the Intron complex with a longer C-stem density (Extended Data Fig. 8b). We did not build a model for the second conformation, as the shorter oligonucleotides are more likely than tRNAs to adopt alternative conformations, including forming artificial concatemers. In summary, we demonstrated that the substrate-recognition model for NSUN2—based on detecting a bulged dual-stem RNA feature with a preference for the CNNRR sequence at the 5′ end of the N-stem—can be applied more generally to NSUN2 methylation sites other than in the tRNA variable region.
Through a series of cryo-EM structures and complementary biochemical investigations, our work sheds light on the dynamic interactions between NSUN2, RNA and SAM cofactor across multiple stages of the methylation catalytic cycle and for different types of substrate (Fig. 6f and Supplementary Video 1). We demonstrate that although the NTD and CTDs robustly bind RNA stems independently of cofactors, the SAM cofactor is essential for precisely docking the target cytosine into the active site. By comparing diverse substrates, we establish that NSUN2 selectivity is primarily driven by a dual-stem RNA element, with a preference for the CNNRR sequence starting with the methylated Cyt, at the 5′ end of the N-stem.
Structure-dependent substrate recognition by NSUN2 explains why a clear consensus motif is difficult to identify with sequence-based methods. Deleting approximately 30 nucleotides in a tRNA between NSUN2 recognition elements maintains similar methylation activity, indicating that large gaps are allowed in sequence space. Nevertheless, our structural evidence for purine preference at positions 4 and 5 corroborates existing transcriptome-wide sequence analyses that are G-rich6,10,11,13. Therefore, by analysing the contributions of sequence within the structural framework, our work provides a unifying model for interpreting data on NSUN2-dependent m5C methylation of RNAs.
Our findings also provide a mechanistic understanding of clinically relevant mutations and regulatory modifications of NSUN2. The G679R mutation is linked to intellectual disability and probably destabilizes the CTD2 hydrophobic core, leading to the protein aggregation we observed in vitro15,43 (Extended Data Fig. 9a). Similarly, cancer-associated mutations and regulatory SUMOylation/ubiquitination sites are positioned at the substrate–enzyme interface, in which they would interfere with substrate binding23,44 (Extended Data Fig. 9b,c). Conversely, the disordered N-terminal region reported to mediate glucose-mediated metabolic regulation did not influence in vitro activity, suggesting that this regulation requires additional cellular factors19 (Extended Data Fig. 9d).
Comparison of NSUN2 with the rest of the enzyme family highlights a divergent model35 (Extended Data Fig. 9e–g). Although the catalytic domains are conserved, substrate specificity is dictated by the unique arrangement of peripheral RNA-binding domains. Although our study primarily used unmodified in vitro transcripts and left the potential influence of crosstalk with other RNA modifications for future study, the series of snapshots provides a robust framework for understanding the catalytic cycle. Ultimately, defining the minimal dual-stem substrate with a CNNRR motif enables the precise identification of direct NSUN2 targets and the design of small-molecule therapeutics to modulate its activity in disease.
DNA templates containing human tRNA genes for in vitro transcription were assembled using primer extension polymerase chain reaction (PCR) with a 5′ extended T7 promoter sequence (GCGAAATTAATACGACTCACTATA-[tRNA gene]). The PCR products were purified using a GeneJET PCR purification kit (Thermo Fisher). DNA templates were transcribed in vitro using 0.1 mg ml−1 recombinant T7 RNA polymerase produced in-house in a final buffer containing 50 mM Tris-HCl, pH 8.0, 1 mM spermidine, 0.01% TritonX-100, 5 mM DTT, 2 mM rNTPs, 18–22 mM MgCl2 and 20 ng µl−1 of DNA template. RNA transcripts were extracted using acid phenol:chloroform pH 4.5 (with IAA, 125:24:1) and precipitated in 1.2 volumes of isopropanol supplemented with sodium acetate pH 5.2 at −20 °C. Precipitated RNA was resuspended in a buffer containing 20 mM Tris-HCl, pH 7.5, 100 mM NaCl and 2 mM MgCl2, then further purified using electrophoresis with a 10% urea-PAGE gel. Target RNAs were eluted with a buffer containing 20 mM Tris-HCl, pH 7.5, 100 mM NaCl and 2 mM MgCl2 for 16 h to extract RNA by passive diffusion. Extracted RNA was precipitated again with sodium acetate and isopropanol at −20 °C, and resuspended in a final RNA dissolving buffer of 20 mM Tris-HCl, pH 7.5, 100 mM NaCl and 2 mM MgCl2. Purified RNAs were analysed by native PAGE gels for evaluation (Supplementary Fig. 2). All RNAs in this study were annealed by incubating at 75 °C for 5 min, followed by slow cooling to 25 °C over 30 min before use. A list of DNA primers used for PCR is provided in Supplementary Table 1. Wild-type tRNA sequences and secondary structures were obtained from gtRNAdb46. The mitochondrial tRNASerGCT sequence and its secondary structure are adapted from ref. 47. Secondary structure diagrams of non-tRNA substrates were adapted from RNAfold calculations at 37 °C using the Andronescu 2007 energy model with coaxially stacked dangling ends unless stated otherwise48. For lncRNA RP11, the fragment was designed around the reported NSUN2-dependent m5C site Hg19, Chr1:854196.
In vitro methylation reactions with NSUN2 were performed with 50 nM enzyme and 100 nM RNA in 20 mM Tris-HCl, pH 7.5, 100 mM NaCl, 2 mM MgCl2, 5% glycerol, 5 mM DTT and a 200 nM [3H]-SAM:SAM mixture at a molar ratio of 1:4 (Figs. 1, 4–6 and Extended Data Figs. 6 and 8) or 1:3 (Figs. 2 and 3). Reactions with METTL1–WDR4 were performed under the same conditions for Extended Data Fig. 6d, and with 100 nM enzyme and 150 nM tRNA in 20 mM Tris-HCl, pH 7.5, 100 mM NaCl, 2 mM MgCl2, 5% glycerol, 5 mM DTT and a 1:3 molar ratio of [3H]-SAM:SAM mixture at a total concentration of 300 nM for Fig. 5e. Reactions were incubated at 37 °C for 1 h and blotted onto Hybond-N+ hybridization membranes. Membranes were washed with a buffer containing 20 mM Tris-HCl, pH 7.5, and 0.01% Triton X-100 before being placed in scintillation vials and suspended in 4 ml of a scintillation cocktail (RPI) for counting using an AccuFLEX LSC-8000 scintillation counter (Hitachi). A list of RNA sequences used is available in Supplementary Table 2.
In vitro crosslinking assays were performed with 500 nM NSUN2C271A, 1 µM RNA and 50 µM SAM in 20 mM Tris-HCl, pH 7.5, 100 mM NaCl, 500 µM MgCl2, 5% glycerol and 5 mM DTT. Reactions were incubated at 37 °C for 1 h, then quenched with an SDS–PAGE gel loading buffer containing 300 mM Tris-HCl, pH 6.8, 12% (w/v) SDS and 40% glycerol supplemented with 10 mM DTT and heated at 95 °C for 5 min. About 350 ng of protein from each sample was loaded and separated by SDS–PAGE, then stained with Coomassie Blue.
Dephosphorylated RNAs were 5′ radiolabelled using 0.4 U µl−1 T4 polynucleotide kinase (PNK) in 1X PNK buffer (New England BioLabs) with 133 nM [γ-32P]ATP (Revvity). Radiolabelled RNAs were purified over Bio-Gel P-6 SEC beads (BioRad) equilibrated in 20 mM Tris-HCl, pH 7.5, 100 mM NaCl and 2 mM MgCl2. RNA–protein complexes were formed by mixing the indicated concentrations of protein with 1 nM radiolabelled RNA substrate in 20 mM Tris-HCl, pH 7.5, 150 mM NaCl, 3 mM MgCl2, 10% glycerol, 5 mM DTT and 10 µM of non-specific DNA oligonucleotides at 25 °C for 1 h, then resolved with a native 8% Tris-borate polyacrylamide gel. Gels were dried and imaged using a Typhoon FLA 9500 phosphorimager (GE Healthcare). Band quantitation was performed with ImageLab v.6.1 (BioRad) using a background subtraction disk size of 8.0 mm. KD values were determined by fitting fractional binding data for three replicates to a Hill slope model with a maximum saturation value constrained to 1.0 using Prism 10.
Binding enthalpies for SAM and NSUN2 were measured at 25 °C in 20 mM Tris-HCl, pH 7.5, 500 mM NaCl and 2 mM TCEP using a MicroCal PEAQ-ITC instrument (Malvern Panalytical). For two independent runs, 425 µM SAM was titrated into 400 µl of 42.5 µM NSUN2 using a reference cell power of 5 µcal s−1. Both titrations spanned 21 injections with an initial injection volume of 0.5 µl of titrant over 1 s, and all subsequent titrations injected 1.9 µl over 3.8 s with a 120-s spacing. Peak integrations for both replicates were calculated using NITPIC v.2.0.7 (ref. 49) and subsequent KD determination with confidence intervals was performed in SEDPHAT v.15.2b (ref. 50). Plotting of thermogram data was done in GUSSI v.1.4.2 (ref. 51).
For samples containing NSUN2WT and tRNALysCTT with and without cofactor, a single complex batch was assembled by mixing NSUN2WT and tRNALysCTT in a 1:1.2 molar ratio in a buffer containing 20 mM Tris-HCl, pH 7.5, 100 mM NaCl, 2 mM MgCl2, 5% glycerol and 5 mM DTT at 37 °C for 1 h. Complexes were then separated from excess RNA by gel filtration over a Superdex 200 pg column (Cytiva) equilibrated in 20 mM Tris-HCl, pH 7.5, 100 mM NaCl, 2 mM MgCl2 and 5 mM DTT. Fractions containing the complex were concentrated and then split into two to prepare apo and pre-catalytic (with 500 µM sinefungin) samples. For catalytic intermediate samples with tRNALysCTT and tRNALysTTT, NSUN2C271A–tRNA adducts were assembled by mixing protein and RNA in a 1:1.6 molar ratio in a buffer containing 20 mM Tris-HCl, pH 7.5, 100 mM NaCl, 500 µM MgCl2, 5% glycerol, 5 mM DTT and 500 µM SAM at 37 °C for 2 h. The crosslinked sample with Intron was assembled at a protein:RNA molar ratio of 1:2 under the same conditions. Complexes were then separated from excess RNA by gel filtration.
For all samples, the final enzyme concentration was 4 mg ml−1, estimated using a Bradford assay, and 0.015% (w/v) NP40 was added for cryo-EM grid preparations. A total of 3 µl of each sample was applied to Quantifoil Cu 1.2/1.3 400-mesh grids, which were glow-discharged at 30 mA for 1 min. These samples were blotted for 4.5 s at a blot force of 13 at 4 °C in 100% relative humidity, then plunge-frozen in liquid ethane using a Vitrobot Mark IV (Thermo). For the crosslinked sample with Intron, Quantifoil Cu 1.2/1.3 300-mesh grids were used, and the grids were blotted with a blot force of 10 for 4 s. Data collection was performed on an FEI Titan Krios equipped with a Falcon 4i detector, a Bioquantum energy filter and a cold-FEG using SerialEM v.4.0.8 (ref. 52).
Motion correction, particle picking and three-dimensional analysis were performed using cryoSPARC v.4.6.0 (ref. 53). Conical FSC curves were produced using the Orientation Diagnostics job in cryoSPARC v.4.6.0. Initial models (Extended Data Table 1) were docked into cryo-EM maps using ChimeraX v.1.8 and were further built and refined using COOT v.0.9.8.7 and Phenix v.1.21.1 (refs. 54,55,56). Guinier plot B-factor values from cryoSPARC non-uniform refinement are provided in Extended Data Table 1. Map sharpening B-factors were scaled from B-factor values determined by Phenix.Autosharpen to avoid oversharpening and overinterpretation from noise during modelling. For modelling protein–RNA covalent linkages, monomer restraints for the conjugated cytosine (5DC) were generated in Phenix.eLBOW and link restraints between the 5DC monomer and NSUN2 C321 were generated using AceDRG distributed through the CCP4 software suite v.8.0.011 (refs. 57,58). Buried surface area per residue was calculated using PDBePISA v.2.0 (ref. 59), and conservation score per residue was calculated using the ConSurf web server60. Human NSUN protein multi-sequence alignment was performed with Clustal Omega v.1.2.4 (ref. 61). Structural biology applications used in this project were compiled and configured by SBGrid62. To model pre-existing m5C adjacent to target cytosines (Extended Data Fig. 6o) using COOT v.0.9.8.7, we used NSUN2WT–tRNALysTTT–SAH D-arm structure for m5Cyt48 and the NSUN2WT–tRNALysCTT–SAH D-arm structure for m5Cyt49.
All in vitro methylation, in vitro crosslinking and EMSAs were performed at least three independent times (technical replicates) using protein and RNA reagents whose homogeneity and purity were verified using SDS–PAGE or native PAGE, respectively (Supplementary Fig. 2). Binding affinities determined using isothermal titration calorimetry were calculated from two independent titration series and reported with a 95% confidence interval. For gel-electrophoresis-based experiments, at least three independent experiments were performed. Uncropped gel images are provided in Supplementary Fig. 1. The micrographs shown are representative images. Representative data were used for figures and statistical tests, and replication attempts over all experiments were similar and successful. Statistical analysis for quantitative experiments was performed using GraphPad Prism 10. An unpaired, two-tailed Student’s t-test was used for each comparison, and the P-value is indicated. The number of replicates is included in each figure legend, with centre bars indicating the mean and error bars representing standard deviation.
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
The cryo-EM structures of NSUN2C271A–tRNALysCTT–SAH (D-arm: 9Z2N and no D-arm: 9Z2O), NSUN2WT–tRNALysCTT (conformation 1: 9Z2P and conformation 2: 9Z2Q), NSUN2WT–tRNALysCTT–SFG (D-arm: 9Z2R and No D-arm: 9Z2T), NSUN2C271A–tRNALysTTT–SAH (D-arm: 9Z2U and no D-arm: 9Z2W) and NSUN2C271A–Intron–SAH (9Z3D) are deposited in the Protein Data Bank (PDB; https://www.rcsb.org/). The cryo-EM density maps and masks used for model building of NSUN2C271A–tRNALysCTT–SAH (D-arm: 73772 and no D-arm: 73773), NSUN2WT–tRNALysCTT (conformation 1: 73774 and conformation 2: 73775), NSUN2WT–tRNALysCTT–SFG (D-arm: 73776 and no D-arm: 73777), NSUN2C271A–tRNALysTTT–SAH (D-arm: 73778 and no D-arm: 73780) and NSUN2C271A–Intron–SAH (73784) are deposited in the Electron Microscopy Data Bank (https://www.ebi.ac.uk/pdbe/emdb/). The crystal structure of Bos Taurus tRNALysTTT used for initial model building was accessed from the PDB: 1FIR. The AlphaFold-predicted structure of Homo sapiens NSUN2 used for initial model building was accessed from the AlphaFold Protein Structure Database (https://alphafold.ebi.ac.uk/, AFDB: AF-Q08J23-3-F1). Source data are provided with this paper.
Suzuki, T. The expanding world of tRNA modifications and their disease relevance. Nat. Rev. Mol. Cell Biol. 22, 375–392 (2021).
Delaunay, S., Helm, M. & Frye, M. RNA modifications in physiology and disease: towards clinical applications. Nat. Rev. Genet. 25, 104–122 (2024).
Sun, H., Li, K., Liu, C. & Yi, C. Regulation and functions of non-m6A mRNA modifications. Nat. Rev. Mol. Cell Biol. 24, 714–731 (2023).
Bohnsack, K. E., Höbartner, C. & Bohnsack, M. T. Eukaryotic 5-methylcytosine (m5C) RNA methyltransferases: mechanisms, cellular functions, and links to disease. Genes 10, 102 (2019).
Yang, X. et al. 5-methylcytosine promotes mRNA export — NSUN2 as the methyltransferase and ALYREF as an m5C reader. Cell Res. 27, 606–625 (2017).
Liu, F. et al. RNA m5C methylation mediated by Ybx1 ensures hematopoietic stem and progenitor cell expansion. Cell Rep. 44, 115324 (2025).
Ma, H. L. et al. SRSF2 plays an unexpected role as reader of m5C on mRNA, linking epitranscriptomics to cancer. Mol. Cell 83, 4239–4254.e10 (2023).
Chen, Y. -S., Yang, W. -L., Zhao, Y. -L. & Yang, Y. -G. Dynamic transcriptomic m5 C and its regulatory role in RNA processing. Wiley Interdiscip. Rev. RNA 12, e1639 (2021).
Guarnacci, M. et al. Substrate diversity of NSUN enzymes and links of 5-methylcytosine to mRNA translation and turnover. Life Sci. Alliance 7, e202402613 (2024).
Huang, T., Chen, W., Liu, J., Gu, N. & Zhang, R. Genome-wide identification of mRNA 5-methylcytosine in mammals. Nat. Struct. Mol. Biol. 26, 380–388 (2019).
Lu, L. et al. Base-resolution m5C profiling across the mammalian transcriptome by bisulfite-free enzyme-assisted chemical labeling approach. Mol. Cell 84, 2984–3000 (2024).
Abbasi-Moheb, L. et al. Mutations in NSUN2 cause autosomal-recessive intellectual disability. Am. J. Hum. Genet. 90, 847–855 (2012).
Khan, M. A. et al. Mutation in NSUN2, which encodes an RNA methyltransferase, causes autosomal-recessive intellectual disability. Am. J. Hum. Genet. 90, 856–863 (2012).
Martinez, F. J. et al. Whole exome sequencing identifies a splicing mutation in NSUN2 as a cause of a Dubowitz-like syndrome. J. Med. Genet. 49, 380–385 (2012).
Yang, Q., Zhang, Q., Qin, Z., Yi, S. & Luo, J. A novel variant in NSUN2 causes intellectual disability in a Chinese family. BMC Med. Genom. 17, 95 (2024).
Zhou, J. et al. Unveiling the potential impact of RNA m5C methyltransferases NSUN2 and NSUN6 on cellular aging. Front. Genet. 16, 1477542 (2025).
Miao, W. et al. Glucose binds and activates NSUN2 to promote translation and epidermal differentiation. Nucleic Acids Res. 52, 13577–13593 (2024).
Chen, Y. et al. 5-Methylcytosine transferase NSUN2 drives NRF2-mediated ferroptosis resistance in non-small cell lung cancer. J. Biol. Chem. 300, 106793 (2024).
Zhang, X. et al. NSUN2/YBX1 promotes the progression of breast cancer by enhancing HGH1 mRNA stability through m5C methylation. Breast Cancer Res. 26, 94 (2024).
Hu, Y. et al. NSUN2 promotes colorectal cancer progression and increases lapatinib sensitivity by enhancing CUL4B/ErbB-STAT3 signalling in a non-m5C manner. Clin. Transl. Med. 15, e70282 (2025).
Hu, Y. et al. NSUN2 modified by SUMO-2/3 promotes gastric cancer progression and regulates mRNA m5C methylation. Cell Death Dis. 12, 842 (2021).
Huang, H. et al. YBX1 promotes 5-fluorouracil resistance in gastric cancer via m5C-dependent ATG9A mRNA stabilization through autophagy. Oncogene 44, 2357–2371 (2025).
Cerami, E. et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2, 401–404 (2012).
Zou, Z. et al. RNA m5C oxidation by TET2 regulates chromatin state and leukaemogenesis. Nature 634, 986–994 (2024).
Auxilien, S., Guérineau, V., Szweykowska-Kulińska, Z. & Golinelli-Pimpaneau, B. The human tRNA m5C methyltransferase Misu is multisite-specific. RNA Biol. 9, 1331–1338 (2012).
Squires, J. E. et al. Widespread occurrence of 5-methylcytosine in human coding and non-coding RNA. Nucleic Acids Res. 40, 5023–5033 (2012).
Li, Z. -L. et al. NSUN2-mediated HCV RNA m5C methylation facilitates viral RNA stability and replication. Genomics Proteom. Bioinform. 23, qzaf008 (2025).
Blanco, S. et al. Aberrant methylation of tRNAs links cellular stress to neuro-developmental disorders. EMBO J. 33, 2020–2039 (2014).
Tuorto, F. et al. RNA cytosine methylation by Dnmt2 and NSun2 promotes tRNA stability and protein synthesis. Nat. Struct. Mol. Biol. 19, 900–905 (2012).
Brzezicha, B. et al. Identification of human tRNA:m5C methyltransferase catalysing intron-dependent m5C formation in the first position of the anticodon of the pre-tRNA \(\left(\begin{array}{c}\mathrm{Leu}\\ \mathrm{CAA}\end{array}\right)\). Nucleic Acids Res. 34, 6034–6043 (2006).
Selmi, T. et al. Sequence- and structure-specific cytosine-5 mRNA methylation by NSUN6. Nucleic Acids Res. 49, 1006–1022 (2021).
King, M. Y. & Redman, K. L. RNA methyltransferases utilize two cysteine residues in the formation of 5-methylcytosine. Biochemistry 41, 11218–11225 (2002).
Liu, R.-J., Long, T., Li, J., Li, H. & Wang, E.-D. Structural basis for substrate binding and catalytic mechanism of a human RNA:m5C methyltransferase NSun6. Nucleic Acids Res. 45, 6684–6697 (2017).
Sondka, Z. et al. COSMIC: a curated database of somatic variants and clinical data for cancer. Nucleic Acids Res. 52, D1210–D1217 (2024).
Antika, T. R. et al. Sequence-specific targeting of Caenorhabditis elegans C-Ala to the D-loop of tRNAAla. J. Biol. Chem. 299, 105149 (2023).
Ruiz-Arroyo, V. M. et al. Structures and mechanisms of tRNA methylation by METTL1–WDR4. Nature 613, 383–390 (2023).
Li, J. et al. Structural basis of regulated m7G tRNA modification by METTL1–WDR4. Nature 613, 391–397 (2023).
Shinoda, S. et al. Mammalian NSUN2 introduces 5-methylcytidines into mitochondrial tRNAs. Nucleic Acids Res. 47, 8734–8745 (2019).
Van Haute, L. et al. NSUN2 introduces 5-methylcytosines in mammalian mitochondrial tRNAs. Nucleic Acids Res. 47, 8720–8733 (2019).
Liu, J. et al. Sequence- and structure-selective mRNA m5C methylation by NSUN6 in animals. Natl Sci. Rev. 8, nwaa273 (2021).
Gonskikh, Y. et al. Spatial regulation of NSUN2-mediated tRNA m5C installation in cognitive function. Nucleic Acids Res. 53, gkae1169 (2025).
Li, Y. et al. Long noncoding RNA DIAPH2-AS1 promotes neural invasion of gastric cancer via stabilizing NSUN2 to enhance the m5C modification of NTN1. Cell Death Dis. 14, 260 (2023).
Studier, F. W. Protein production by auto-induction in high-density shaking cultures. Protein Expr. Purif. 41, 207–234 (2005).
Chan, P. P. & Lowe, T. M. GtRNAdb 2.0: an expanded database of transfer RNA genes identified in complete and draft genomes. Nucleic Acids Res. 44, D184–D189 (2016).
Kuhle, B., Hirschi, M., Doerfel, L. K., Lander, G. C. & Schimmel, P. Structural basis for shape-selective recognition and aminoacylation of a D-armless human mitochondrial tRNA. Nat. Commun. 13, 5100 (2022).
Keller, S. et al. High-precision isothermal titration calorimetry with automated peak shape analysis. Anal. Chem. 84, 5066–5073 (2012).
Zhao, H., Piszczek, G. & Schuck, P. SEDPHAT – a platform for global ITC analysis and global multi-method analysis of molecular interactions. Methods 76, 137–148 (2015).
Brautigam, C. A., Zhao, H., Vargas, C., Keller, S. & Schuck, P. Integration and global analysis of isothermal titration calorimetry data for studying macromolecular interactions. Nat. Protoc. 11, 882–894 (2016).
Mastronarde, D. N. Automated electron microscope tomography using robust prediction of specimen movements. J. Struct. Biol. 152, 36–51 (2005).
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Pettersen, E. F. et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 30, 70–82 (2021).
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 (2010).
Moriarty, N. W., Grosse-Kunstleve, R. W. & Adams, P. D. electronic Ligand Builder and Optimization Workbench (eLBOW): a tool for ligand coordinate and restraint generation. Acta Crystallogr. D Biol. Crystallogr. 65, 1074–1080 (2009).
Nicholls, R. A. et al. Modelling covalent linkages in CCP4. Acta Crystallogr. D Struct. Biol. 77, 712–726 (2021).
Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 372, 774–797 (2007).
Armon, A., Graur, D. & Ben-Tal, N. ConSurf: an algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information. J. Mol. Biol. 307, 447–463 (2001).
Madeira, F. et al. The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Res. 52, W521–W525 (2024).
Morin, A. et al. Cutting edge: collaboration gets the most out of software. eLife 2, e01456 (2013).
This work was supported by the US National Institutes of Health (R01GM122960, R01CA258589 and R01ES038329 to Y.N. and T32GM131963 to J.C.), the Welch Foundation (I-2115-20220331 to Y.N.), the Cancer Prevention and Research Institute of Texas (CPRIT, RP210041 to N.S.S.) and the American Cancer Society (PF-24-1308804-01-RMC to V.M.R.-A.). We also thank the Cryo-Electron Microscopy Facility (CPRIT RP170644) at UT Southwestern for support with cryo-EM data collection. Y.N. is a Packard Fellow, Pew Scholar, Southwestern Medical Foundation Scholar in Biomedical Research, Doris and Bryan Wildenthal Distinguished Chair in Medical Science, and UT Southwestern Presidential Scholar.
Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX, USA
Jacob Canepa, Victor M. Ruiz-Arroyo, Netanya S. Schlamowitz & Yunsun Nam
Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA
Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
Y.N. conceived and supervised the study. J.C. produced recombinant proteins, prepared the cryo-EM samples, processed the cryo-EM data, determined the cryo-EM structures and refined the atomic models. J.C. and V.M.R.-A. prepared the cryo-EM grids and collected data. J.C. and N.S.S. prepared RNAs and performed biochemical assays. J.C. and Y.N. prepared the paper with help from all other co-authors.
Nature thanks Eva Kowalinski who co-reviewed with Luciano Dolce; Yun-Gui Yang who co-reviewed with Chong Zhang; and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
a, Cryo-EM data processing workflow with a representative micrograph and 2D classes provided. Final map density is colored by domain (NTD: blue, MTD: purple, CTD1: light green, CTD2: dark green, RNA: orange, SAH: pink). b-f, Comparison of the “D-arm Conformation” (left) and “No D-arm Conformation” (right) reconstructions. Unsharpened cryo-EM maps of protein-RNA complexes (top) or protein and RNA map segments (bottom) colored by local resolution (b), gold standard FSC curves with 0.5 and 0.143 thresholds (top) and conical FSC curves with relative distribution frequencies (bottom) (c), relative signal amounts by viewing direction in a 2D heatmap (top) and 3D projections (bottom) (d), and Q-scores calculated per protein (e) and RNA (f) residue. Sigma (σ) denotes the threshold level in standard deviations from the mean.
a, Cryo-EM data processing workflow with a representative micrograph and 2D classes provided. Final map density is colored by domain (NTD: blue, MTD: purple, CTD1: light green, CTD2: dark green, RNA: orange). b-g, Comparison of the “Conformation 1” (left) and “Conformation 2” (right) reconstructions. Unsharpened cryo-EM maps of protein-RNA complexes (top) or protein and RNA map segments (bottom) colored by local resolution (b), gold standard FSC curves with 0.5 and 0.143 thresholds (top) and conical FSC curves with relative distribution frequencies (bottom) (c), relative signal amounts by viewing direction in a 2D heatmap (top) and 3D projections (bottom) (d), molecular models for each reconstruction (e), and Q-scores calculated per protein (f) and RNA (g) residue. For models, protein is shown in cartoon representation and colored by domain, with RNA as orange ribbons. Sigma (σ) denotes the threshold level in standard deviations from the mean.
a, Cryo-EM data processing workflow with a representative micrograph and 2D classes provided. Final map density is colored by domain (NTD: blue, MTD: purple, CTD1: light green, CTD2: dark green, RNA: orange, SFG: pink). b-g, Comparison of the “Conformation 1” (left) and “Conformation 2” (right) reconstructions. Unsharpened cryo-EM maps of protein-RNA complexes (top) or protein and RNA map segments (bottom) colored by local resolution (b), gold standard FSC curves with 0.5 and 0.143 thresholds (top) and conical FSC curves with relative distribution frequencies (bottom) (c), relative signal amounts by viewing direction in a 2D heatmap (top) and 3D projections (bottom) (d), molecular models for each reconstruction (e), and Q-scores calculated per protein (f) and RNA (g) residue. For models, protein is shown in cartoon representation and colored by domain, with RNA as orange ribbons and the SFG cofactor as pink sticks. Sigma (σ) denotes the threshold level in standard deviations from the mean.
a, Cryo-EM data processing workflow with a representative micrograph and 2D classes provided. Final map density is colored by domain (NTD: blue, MTD: purple, CTD1: light green, CTD2: dark green, RNA: orange, SAH: pink). b-g, Comparison of the “Conformation 1” (left) and “Conformation 2” (right) reconstructions. Unsharpened cryo-EM maps of protein-RNA complexes (top) or protein and RNA map segments (bottom) colored by local resolution (b), gold standard FSC curves with 0.5 and 0.143 thresholds (top) and conical FSC curves with relative distribution frequencies (bottom) (c), relative signal amounts by viewing direction in a 2D heatmap (top) and 3D projections (bottom) (d), molecular models for each reconstruction (e), and Q-scores calculated per protein (f) and RNA (g) residue. For models, protein is shown in cartoon representation and colored by domain, with RNA as orange ribbons and the SAH cofactor as pink sticks. Sigma (σ) denotes the threshold level in standard deviations from the mean. h-i, Secondary structure diagrams of the Acceptor- T-arm junction in tRNALysCTT (h), tRNALysTTT (i) conformations when complexed with NSUN2. Target cytosines are highlighted in yellow. j, Proposed secondary structure diagram of the Acceptor- T-arm junction in tRNALysTTT(A50C) when complexed with NSUN2. Target cytosines are highlighted in yellow.
a,b, EMSA of NSUN2 binding tRNALysTTT, tRNALysTTT(∆Acceptor), tRNALysTTT(∆T-arm), and the Mini substrate. Representative gels among 3 replicates (a), and quantitation (b) are shown. Protein concentrations from left to right are: 0, 16, 32, 65, 130, 260, and 520 nM. For source data, see Supplementary Fig. 1. c, Diagram of Mini (Fig. 5f) and MiniN-LoopMut substrates with mutated nucleotides shown in red and the target cytosine highlighted yellow. d, Methylation activity of NSUN2 with the substrates in c. Ctr, no enzyme control. Data are shown as mean ± s.d (n = 3, two-tailed unpaired t-test). e, In vitro methylation activity of NSUN2 with a series of Mini bulge truncations. Ctr, no enzyme control. Data are shown as mean ± s.d (n = 3, two-tailed unpaired t-test). f, Secondary structure diagrams of minimized oligonucleotide substrates with m5C or U at specific positions. Target cytosines are highlighted in yellow. g, Methylation activity of NSUN2 with the substrates in f at different time points. Data are shown as mean ± s.d (n = 3, two-tailed unpaired t-test). h, Close-up views of NSUN2:tRNA complexes modeled with a pre-existing m5Cyt48 (left) or m5Cyt49 (right). Protein (purple) side chains within 3.8 Å of the modeled RNA base (orange) are shown as sticks. Cytosine residues in the catalytic pocket are colored cyan. i, Secondary structure diagrams of WT (top) and mutant (bottom) RP11 fragments. CNNRR nucleotides are bolded, with motif mutations colored red and complementary mutations colored gray. Target cytosines are highlighted in yellow. j, In vitro methylation activity of NSUN2 with the substrates described in i and RP11CtoU, with the m5C site mutated. Data are shown as mean ± s.d (n = 3, two-tailed unpaired t-test).
a, Cryo-EM data processing workflow with a representative micrograph and 2D classes provided. Final map density for Intron Conformation 1 is colored by domain (NTD: blue, MTD: purple, CTD1: light green, CTD2: dark green, RNA: orange, SAH: pink). b-d, Comparison of the “Intron Conformation 1” (left) and “Intron Conformation 2” (right) reconstructions. Unsharpened cryo-EM maps of protein-RNA complexes (top) or protein and RNA map segments (bottom) colored by local resolution (b), gold standard FSC curves with 0.5 and 0.143 thresholds (top) and conical FSC curves with relative distribution frequencies (bottom) (c), and relative signal amounts by viewing direction in a 2D heatmap (top) and 3D projections (bottom) (d). e, Superimposition of NSUN2C271A-tRNALysCTT “D-arm Conformation” and NSUN2C271A-Intron structures by MTD, and the RNA conformational shift is depicted with orange (tRNALysCTT) and gray (Intron) ribbons. Cyt34 of Intron is colored cyan. f, Close-up view of the purines of CNNRR binding R133 and R137 in NSUN2C271A-tRNALysCTT (left) and NSUN2C271A-Intron (right) structures. Protein side chains (blue) and RNA nucleotides (orange) are shown as sticks with sharpened cryo-EM maps as transparent surfaces at σ = 6. g,h, Q-scores calculated for protein (g) and RNA (h) residues for the NSUN2C271A-Intron structure. Sigma (σ) denotes the threshold level in standard deviations from the mean.
a, Close-up view of the NSUN2 CTD2 (dark green) with a modeled G679R disease mutation (red), with nearby residues (<3.8 Å) shown as sticks. b, Table of disease-relevant NSUN2 mutations referenced in this text. c, Reported ubiquitination sites of NSUN2 are shown as spheres colored by domain (CTD1, light green; CTD2, dark green), and RNA is shown as an orange ribbon. d, In vitro methylation activity of NSUN2 with WT tRNAMetCAT or tRNAArgACG over different concentrations of glucose in the reaction. Data are shown as mean ± s.d (n = 3). e, Superimposition of the methyltransferase domains of structurally characterized NSUN family enzymes NSUN4 (green, PDB: 4FP9), NSUN5 (blue, PDB: 2B9E), NSUN6 (gray, PDB: 5WWR), and NSUN2 (purple, PDB: 9Z2R, this study) from the NSUN2-tRNALysCTT-SFG “D-arm Conformation” structure in cartoon representation. Sinefungin and cytosine 48 from the NSUN2 structure are shown as pink and orange ball-and-sticks, respectively. f, Comparison of tRNA recognition between sinefungin-bound NSUN2 (left, PDB: 9Z2R, this study) and sinefungin-bound NSUN6 (PDB: 5WWR). Protein is shown in cartoon representation, colored by domain, with RNA as orange ribbons and the target cytosine as cyan sticks. g, Multiple sequence alignment of human NSUN family enzymes (with Uniprot Accession numbers): NOP2/NSUN1 (P46087), NSUN2 (Q08J23), NSUN3 (Q9H649), NSUN4 (Q96CB9), NSUN5 (Q96P11), NSUN6 (Q8TEA1), and NSUN7 (Q8NE18). Sequence indices are colored by NSUN2 domain (NTD: blue, MTD: purple, CTD1: light green, CTD2: dark green) and amino acid codes are colored by similarity. Sequence alignment was performed with Clustal Omega and displayed using Geneious Prime.
Supplementary Fig. 1: Source gel images for figures. Supplementary Fig. 2: Quality control for tRNA and protein purifications. Supplementary Table 1: List of DNA primers used for RNA in vitro transcription. Supplementary Table 2: Complete list of RNA sequences used in this study.
The structural landscape of NSUN2 tRNA m5C methylation. The unbound state shows an AlphaFold model of NSUN2 (AFDB: AF-Q08J23-F1-v6) with a crystal structure of B. Taurus tRNALysTTT (PDB: 1FIR). The bound state shows NSUN2WT–tRNALysCTT conformation 1 structure (PDB: 9Z2P) and is identical to Apo conformation 1. The Apo conformation 2 state is NSUN2WT–tRNALysCTT conformation 2 (PDB: 9Z2Q). The SFG-bound state is NSUN2WT–tRNALysCTT–SFG D-arm conformation (PDB: 9Z2R). The catalytic intermediate and C48 conformation state is NSUN2C271A–tRNALysCTT–SAH D-arm conformation (PDB: 9Z2N). The C49 conformation state shows NSUN2C271A–tRNALysTTT–SAH D-arm conformation (PDB: 9Z2U). Morphs were generated in ChimeraX using linear interpolation and are not molecular dynamics data.
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Original Source
This content was distilled for a focused reading experience. All rights belong to Nature.
Read original publication