RNAs play a multitude of diverse cellular roles in many biological reactions, from catalysis,[
Chinese Physics B, Volume. 29, Issue 10, (2020)
The theory of helix-based RNA folding kinetics and its application
RNAs carry out diverse biological functions, partly because different conformations of the same RNA sequence can play different roles in cellular activities. To fully understand the biological functions of RNAs requires a conceptual framework to investigate the folding kinetics of RNA molecules, instead of native structures alone. Over the past several decades, many experimental and theoretical methods have been developed to address RNA folding. The helix-based RNA folding theory is the one which uses helices as building blocks, to calculate folding kinetics of secondary structures with pseudoknots of long RNA in two different folding scenarios. Here, we will briefly review the helix-based RNA folding theory and its application in exploring regulation mechanisms of several riboswitches and self-cleavage activities of the hepatitis delta virus (HDV) ribozyme.
1. Introduction
RNAs play a multitude of diverse cellular roles in many biological reactions, from catalysis,[
RNA folding is one of the core issues to comprehensively understand the cellular activities of RNA. There are two kinds of folding manners:[
Under a transcription context, many naturally evolved RNA molecules can effectively avoid misfolded intermediates and form correct structures on a biologically reasonable timescale.[
The recently developed helix-based RNA folding theory are suitable to calculate folding kinetics of RNA secondary structure with pseudoknots for long RNA sequences.[
2. Helix-based RNA folding kinetics
Due to the incredible complexity of the cellular environment, studying RNA folding almost exclusively starts in vitro, with a random, unfolded chain in an optimal condition (a suitable ionic concentration and temperature).[
2.1. Refolding kinetics
Opening/closing a base stack is the most elementary step in RNA folding, which has been studied by molecule dynamic simulations.[
A basic process in RNA folding is helix formation, which includes closing several consecutive stacks. After the first few stacks are formed, closing the subsequent stacks in the helix could be fast, as the rate of stack formation is larger than that of stack disruption (except the first stack). It is reasonable and efficient to use helices as elementary units for studying the overall folding kinetics of RNAs.[
In the conformation space, an elementary kinetic move between two structures is forming, disrupting a helix or exchange between two helices. If two structures only have one different helix, they can directly transit to each other by formation or disruption of the helix via the zipping pathway. This kind of pathway is the most probable pathway for helix formation, because breaking an existing stack or forming another distant stack is much slower than formation of a neighboring stack. For example, by forming the red helix, structure A can fold to B through the zipping pathway in Fig. 1 with a rate of
Figure 1.Transitions between states (A, B, C) through formation (A to B), disruption (B to A) of a helix (red), and exchange between two helices in A (green) and C (the left/right shoulder of the helix is colored black/green). The relevant pathways labeled along the arrow are shown in the bottom boxes, where the dotted dark lines denote the schematic energy landscape of zipping and tunneling pathways. The unfolding-refolding pathway are shown with gray color, U is the unfolded, open chain.
When two helices overlap with each other, direct transition between them is helix exchange through the unfolding-refolding or tunneling pathways (see Fig. 1). Compared to completely unfolding the green helix and then refolding the other, the tunneling pathway where disrupting a stack in A is followed by concurrently forming a stack in C after breaking several stacks in A, returns a much lower transition barrier. The rate constants to disrupt (form) a stack in A (C) are supposed to be ki and
According to the detailed balance condition, all reverse transition rates are equal to the product of relevant forwards transition rates and e−ΔG / kBT, where ΔG is the free energy difference of the two states. After all transition rates are calculated, the population pi (t) of state i over time t can be obtained by solving the master equation
Based on the calculated population distribution, the detailed refolding pathway is identified as follows.[
2.2. Co-transcriptional folding kinetics and transition node approximation
The basic idea to deal with the co-transcriptional folding is dividing the whole transcription process into a series of transcription steps, each of which corresponds to synthesis one nucleotide.[
The folding kinetics of each step is calculated in a similar way to that in refolding. Here, we use a certain step M as an example to illustrate. At the M-th transcription step, the RNA chain has M nucleotides available to form structures. The newly transcribed nucleotide is the M-th nucleotide, which could extend the 3’ single-strand tail, pair with an upstream nucleotide to elongate a helix or form a new helix. The conformation space for this M-nt chain, free energies of all possible states and transition rates are first obtained as described earlier. Then, the master equation is solved to get the population kinetics within this step, where the initial condition is determined by the structure relationship between the two adjacent steps (step M and M − 1). If a state has one or more newly formed helices, its initial population at this step will be zero. Otherwise, the initial population is equal to its end population of step M − 1.
When RNA molecule increases in size, it still generates a large conformation space, which could low the calculation efficiency. In general, when more nucleotides are released, the initial population of each following step mostly concentrated in several metastable states. If these states formed before the current step, are much more stable than the newly formed states, it will be impossible for RNA to fold the new states. By searching the possible transitions, we can therefore neglect these structures except those at the main folding pathways, which can contribute to population flow. The approximation can efficiently reduce the conformation space especially after around the 120-th transcription step. It makes predictions for long RNA sequences with large conformation ensembles become possible and computationally viable.
3. The application of the helix-based RNA folding kinetics
For a certain RNA molecule, its biological function relies heavily upon the folding process. To make a careful analysis of RNA folding process therefore becomes a core issue and prerequisite in exploring the cellular activities. The analysis on RNA folding process inevitably concerns the information of main folding pathways and associated structures. For functional mRNAs, such as riboswitches and HDV, all these mentioned information can be provided by using the helix-based RNA folding theory.
3.1. Refolding and co-transcriptional folding of HDV ribozyme
The virulence of hepatitis B virus infections could be accelerated and enhanced by co-infection or super-infection with HDV, a human pathogen.[
Figure 2.The main pathways of HDV ribozyme under two different scenarios: refolding (a) and co-transcriptional folding (b). Upper and lowercase letters denote the ribozyme region and the flanking region. The unpaired nucleotides in the external loop are simply described by dotted lines in panel (a). The rate-limited transition in the slow refolding pathway panel (a) and the main co-transcriptional transition with net flux about 90% (b) are shown with red and green arrows respectively. Except the different RNA lengths in panels (a) and (b), structure model of states denoted inside and outside parentheses in panel (b) are the same.
Early experiment suggested that the self-cleavage of HDV in vitro is bi-phasic: about 30% RNAs fold into the native structure N in around 15 s and the rest slowly cleavages in the next 30 minute (min).[
Different from the refolding behaviors, the main folding pathway is from C4 to C6 then to state N with flowing population of 90% under a transcription of 15 nt/s (see Fig. 2(b)).[
In vivo, ribozymes are often embedded in large molecules with flanking sequences. These sequences are not essential for catalysis, but their presence has a significant effect on the folding of HDV and other ribozymes.[
3.2. The regulation mechanisms of riboswitches
As genetic control elements, riboswitches can regulate gene expression via a signal-dependent change in RNA structure.[
To mimic the effect of external triggers, ligand binding kinetics is incorporated into the helix-based RNA folding theory.[
3.2.1. Kinetically controlled riboswitches
Among the more than 30 discovered riboswitch species, the yjdF riboswitch belongs to a new riboswitch class which senses natural azaaromatics that are toxic to the host cells.[
According to the predicted co-transcriptional folding behaviors,[
Figure 3.The co-transcriptional folding behaviors of the yjdF riboswitch from
As the transition rate from OFF state to the pocket structure closes to the mRNA decay rate (kdecay = 3 min−1),[
Unlike the translational addA riboswitch,[
Figure 4.Structure transitions on main co-transcriptional folding pathways of the pbuE riboswitch. T is the terminator hairpin. Nucleotides within helix regions of the aptamer structure and the pause site are colored differently.
3.2.2. Thermodynamically driven riboswitches
The thiamine pyrophosphate (TPP) riboswitch in NMT1 mRNA from N. crassa is a typical representative that regulates gene expression by controlling mRNA splicing.[
Figure 5.Regulatory behaviors of the TPP (a) and
According to the experimental observations, TPP and the mutation in helix P3 can switch genetic off separately.[
In addition to these common features shared by thermodynamically controlled riboswitches, the TPP and E. faecalisSMK riboswitch own some unique characters because they only have one single domain.[
4. Conclusion and perspectives
RNA folding process is a crucial step in functional characterization and structural biology. As intermediate structures formed and transited fast in this process, it mounts a great challenge to fully monitor folding pathways under different cellar conditions. Based on RNA secondary structure, the helix-based RNA folding theory has been developed to explore folding behaviors of several riboswitches and HDV. The good agreement with experiments suggests it becomes a reliable tool to simulate RNA folding directly in a variety of RNA structures, including structures with pseudoknot. Compared to the recently developed CRKR resampling algorithm which needs to run the master equation for the whole chain,[
Acknowledgment
Acknowledgment. The numerical calculations related to our work in this review were performed on the supercomputing system in the Supercomputing Center of Wuhan University.
[1] X Zhuang. Science, 288, 2048(2000).
[2] C A Strulson, R C Molden, C D Keating, P C Bevilacqua. Nat. Chem., 4, 941(2012).
[3] R Das, J Karanicolas, D Baker. Nat. Methods, 7, 291(2010).
[4] U Förster, J E Weigand, P Trojanowski, B Suess, J Wachtveitl. Nucleic Acids Res., 40, 1807(2012).
[5] B Gong, D Klein. J. Am. Chem. Soc., 133(2011).
[6] L A Marraffini, E J Sontheimer. Nat. Rev. Genet., 11, 181(2010).
[7] F Schluenzen, A Tocilj, R Zarivach, J Harms, M Gluehmann, D Janell, A Bashan, H Bartels, I Agmon, F Franceschi, A Yonath. Cell, 102, 615(2000).
[8] P Nissen, J Hansen, N Ban, P B Moore, T A Steitz. Science, 289, 920(2000).
[9] S Ahmad, S Muthukumar, S K Kuncha, S B Routh, A S K Yerabham, T Hussain, V Kamarthapu, S P Kruparani, R Sankaranarayanan. Nat. Commun., 6, 1(2015).
[10] G Zhong, H Wang, W He, Y Li, H Mou, Z J Tickner, M H Tran, T Ou, Y Yin, H Diao, M Farzan. Nat. Biotechnol., 38, 169(2020).
[11] B T Wimberly, D E Brodersen, W M Clemons, R J Morgan-Warren, A P Carter, C Vonrhein, T Hartsch, V Ramakrishnan. Nature, 407, 327(2000).
[12] D Herschlag. J. Biol. Chem., 270(1995).
[13] M Geis, C Flamm, M T Wolfinger, A Tanzer, I L Hofacker, M Middendorf, C Mandl, P F Stadler, C Thurner. J. Mol. Biol., 379, 160(2008).
[14] D Thirumalai, C Hyeon. Biochemistry, 44, 4957(2005).
[15] K L Frieda, S M Block. Science, 338, 397(2012).
[16] S Gong, Y J Wang, W B Zhang. J. Chem. Phys., 143(2015).
[17] R A Poot, N V Tsareva, I V Boni, J van Duin. Proc. Natl. Acad. Sci., 94(1997).
[18] K Gerdes, E G H Wagner. Curr. Opin. Microbiol., 10, 117(2007).
[19] A Ren, K R Rajashankar, D J Patel. Nature, 486, 85(2012).
[20] G Zemora, C Waldsich. RNA Biol., 7, 634(2010).
[21] S DebRoy, M Gebbie, A Ramesh, J R Goodson, M R Cruz, A van Hoof, W C Winkler, D A Garsin. Science, 345, 937(2014).
[22] J F Lemay, G Desnoyers, S Blouin, B Heppell, L Bastet, P St-Pierre, E Massé, D A Lafontaine. PLoS Genet., 7(2011).
[23] R Schroeder, R Grossberger, A Pichler, C Waldsich. Curr. Opin. Struct. Biol., 12, 296(2002).
[24] T N Wong, T Pan. Methods Enzymol., 468, 167(2009).
[25] L Lubkowska, A S Maharjan, N Komissarova. J. Biol. Chem., 286(2011).
[26] J Boyle, G T Robillard, S H Kim. J. Mol. Biol., 139, 601(1980).
[27] R Nussinov, I Tinoco. J. Mol. Biol., 151, 519(1981).
[28] L Zhang, P Bao, M J Leibowitz, Y Zhang. RNA, 15, 1986(2009).
[29] T N Wong, T R Sosnick, T Pan. Proc. Natl. Acad. Sci., 104(2007).
[30] T Pan, I Artsimovitch, X W Fang, R Landick, T R Sosnick. Proc. Natl. Acad. Sci., 96, 9545(1999).
[31] S L Heilman-Miller, S A Woodson. RNA, 9, 722(2003).
[32] T R Cech. Ann. Rev. Biochem., 59, 543(1990).
[33] F Michel. Ann. Rev. Biochem., 64, 435(1995).
[34] B Lutz, M Faber, A Verma, S Klumpp, A Schug. Nucleic Acids Res., 42, 2687(2014).
[35] B Sauerwine, M Widom. Phys. Rev. E, 84(2011).
[36] M Faber, S Klumpp. Phys. Rev. E, 88(2013).
[37] L V Danilova, D D Pervouchine, A V Favorov, A A Mironov. J. Bioinform. Comput. Biol., 4, 589(2006).
[38] I L Hofacker, C Flamm, C Heine, M T Wolfinger, G Scheuermann, P F Stadler. RNA, 16, 1308(2010).
[39] A Xayaphoummine, T Bucher, F Thalmann, H Isambert. Proc. Natl. Acad. Sci., 100(2003).
[40] A Xayaphoummine, T Bucher, H Isambert. Nucleic Acids Res., 33, 605(2005).
[41] S Gong, Y J Wang, W B Zhang. J. Chem. Phys., 142(2015).
[42] J Zhao, L Hyman, C Moore. Microbiol. Mol. Biol. Rev., 63, 405(1999).
[43] P N Zhao, W B Zhang, S J Chen. J. Chem. Phys., 135(2011).
[44] S Gong, Y L Wang, Z Wang, Y J Wang, W B Zhang. J. Theor. Biol., 439, 152(2018).
[45] J W Chen, W B Zhang. J. Chem. Phys., 137(2012).
[46] J W Chen, S Gong, Y J Wang, W B Zhang. J. Chem. Phys., 140(2014).
[47] Y L Wang, Z Wang, T G Liu, S Gong, W B Zhang. RNA, 24, 1229(2018).
[48] S Gong, Y J Wang, Z Wang, Y L Wang, W B Zhang. J. Phys. Chem. B, 120(2016).
[49] F Colizzi, G Bussi. J. Am. Chem. Soc., 134, 5173(2012).
[50] X Xu, T Yu, S J Chen. Proc. Natl. Acad. Sci. USA, 113, 116(2016).
[51] Y J Wang, Z Wang, Y L Wang, W B Zhang. Chin. Phys. B, 26(2017).
[52] Y J Wang, S Gong, Z Wang, W B Zhang. J. Chem. Phys., 144(2016).
[53] Y J Wang, T G Liu, T Yu, Z J Tan, W B Zhang. RNA, 26, 470(2020).
[54] Y J Wang, Z Wang, Y L Wang, T G Liu, W B Zhang. J. Chem. Phys., 148(2018).
[55] W B Zhang, S J Chen. Biophys. J., 90, 765(2006).
[56] P N Zhao, W B Zhang, S J Chen. Biophys. J., 98, 1617(2010).
[57] T Xia, J SantaLucia, M E Burkard, R Kierzek, S J Schroeder, X Jiao, C Cox, D H Turner. Biochemistry, 37(1998).
[58] S Urban, R Bartenschlager, R Kubitz, F Zoulim. Gastroenterology, 147, 48(2014).
[59] A Diegelman-Parente, P C Bevilacqua. J. Mol. Biol., 324, 1(2002).
[60] D M Chadalavada, S E Senchak, P C Bevilacqua. J. Mol. Biol., 317, 559(2002).
[61] T B Macnaughton, S T Shi, L E Modahl, M M C Lai. J. Virol., 76, 3920(2002).
[62] D M Chadalavada, S M Knudsen, S Nakano, P C Bevilacqua. J. Mol. Biol., 301, 349(2000).
[63] S A Woodson, V L Emerick. Mol. Cell. Biol., 13, 1137(1993).
[64] Y Cao, S A Woodson. RNA, 6, 1248(2000).
[65] D M Chadalavada, A L Cerrone-Szakal, P C Bevilacqua. RNA, 13, 2189(2007).
[66] V Delfosse, P Bouchard, E Bonneau, P Dagenais, S Centre-ville. Nucleic Acids Res., 38, 2057(2010).
[67] S P Hennelly, I V Novikova, K Y Sanbonmatsu. Nucleic Acids Res., 41, 1922(2013).
[68] G A Perdrizet, I Artsimovitch, R Furman, T R Sosnick, T Pan. Proc. Natl. Acad. Sci. USA, 109, 3323(2012).
[69] J R Mellin, M Koutero, D Dar, M-A Nahori, R Sorek, P Cossart. Science, 345, 940(2014).
[70] J Feng, N G Walter, C L Brooks. J. Am. Chem. Soc., 133, 4196(2011).
[71] E Kierzek, R Kierzek. J. Biol. Chem., 295, 2568(2020).
[72] B Strobel, M Spöring, H Klein, D Blazevic, W Rust, S Sayols, J S Hartig, S Kreuz. Nat. Commun., 11, 714(2020).
[73] Z Weinberg, J X Wang, J Bogue, J Yang, K Corbino, R H Moy, R R Breaker. Genome Biol., 11, R31(2010).
[74] R R Breaker. Cold Spring Harb. Perspect. Biol., 4, 1(2012).
[75] S Li, X Y Hwang, S Stav, R R Breaker. RNA, 22, 530(2016).
[76] S M Studer, S Joseph. Mol. Cell, 22, 105(2006).
[77] J C Lin, J Yoon, C Hyeon, D Thirumalai. Methods in Enzymology, 235-258(2015).
[78] A Reining, S Nozinovic, K Schlepckow, F Buhr, B Fürtig, H Schwalbe. Nature, 499, 355(2013).
[79] A Wachter, M Tunc-Ozdemir, B C Grove, P J Green, D K Shintani, R R Breaker. Plant Cell, 19, 3437(2007).
[80] M T Cheah, A Wachter, N Sudarsan, R R Breaker. Nature, 447, 497(2007).
[81] J C Lin, D Thirumalai. J. Am. Chem. Soc., 135(2013).
[82] R Rieder, K Lang, D Graber, R Micura. ChemBioChem, 8, 896(2007).
[83] C Lu, A M Smith, F Ding, A Chowdhury, T M Henkin, A Ke. J. Mol. Biol., 409, 786(2011).
[84] W Huang, J Kim, S Jha, F Aboul-ela. J. Mol. Biol., 418, 331(2012).
[85] T Sun, C Zhao, S J Chen. J. Phys. Chem. B, 122, 7484(2018).
[86] O M Ottink, S M Rampersad, M Tessari, G J R Zaman, H A Heus, S S Wijmenga. RNA, 13, 2202(2007).
[87] A M Soto, V Misra, D E Draper. Biochemistry, 46, 2973(2007).
[88] Z J Tan, S J Chen. Biophys. J., 101, 176(2011).
[89] Z J Tan, S J Chen. Biophys. J., 99, 1565(2010).
Get Citation
Copy Citation Text
Sha Gong, Taigang Liu, Yanli Wang, Wenbing Zhang. The theory of helix-based RNA folding kinetics and its application[J]. Chinese Physics B, 2020, 29(10):
Received: Jun. 29, 2020
Accepted: --
Published Online: Apr. 21, 2021
The Author Email: Wenbing Zhang (wbzhang@whu.edu.cn)