In Silico Perspectives on RNA Structures Modulating Viral Gene Expression and Mechanics of tRNA Transport
Abstract
The repertoire of cellular functions mediated by Ribonucleic acid (RNA) molecules have expanded considerably during the last two decades. The role played by RNA in controlling and regulating gene expression in viruses, prokaryotes and eukaryotes has been a matter of continuous investigations. This interest has arisen primarily due to the discoveries of cisacting RNA structures like riboswitches, ribosensors and frameshift elements, which are found in either the 5’-, 3’-untranslated regions of mRNA or in the open reading frames. These structures control gene expression at the level of translation by either sequestering the Shine-Dalgarno (SD) sequence to regulate translation initiation or modulating ribosomal positions during an active translation process. Very often, these structures comprise of an RNA pseudoknot and it has been observed that these pseudoknots exist in a dynamic equilibrium with other intermediate structures. This equilibrium could be shifted by several factors including presence of ions, metabolites, temperature and external force. RNA pseudoknots represent the most versatile and ubiquitous class of RNA structures in the cell, whose unique folding topology could be exploited in a number of ways by the cellular machinery.
In this thesis, a thorough study of programmed -1 ribosomal frameshifting (-1 PRF) process, which is a well known gene regulation event employed by many RNA viruses, was carried out. -1 PRF is a translation recoding process, necessary for viruses to main-tain a stoichiometric ratio of structural: enzymatic proteins. This ratio varies among different viral species. At the heart of this process, lies an RNA pseudoknot accompanied by a seven nucleotide long sequence motif, which pauses an actively translating ribosome on mRNA and causes it to shift its reading frame. The frameshift inducing efficiency of pseudoknot depends on multiple factors, for example the time scale of ribosomal pause and RNA unfolding, subsequent refolding of structure to native/intermediate states and/or environment conditions. With the aim of illustrating the fundamentals of the process, multiple factors involved in -1 PRF were studied. Chapters 2-4 represent distinct aspects of -1 PRF process, while Chapter 5 discusses a different work concerned with nucleocytoplasmic transport of tRNA carried out by nuclear export receptor Exporting.
Chapter 1 gives an overview of the different regulatory activities with which RNA structures and sequences are found to be associated and the evolution of these stud-ies. It discusses the different types of structural motifs found to constitute tertiary RNA structure and secondary structure prediction and determination techniques. A brief description of ab initio RNA structure modeling and other relevant tools and methodologies used in this work has been presented. Details of techniques used in each study have been provided in relevant chapters.
Chapter 2 describes how local factors like ionic conditions, hydration patterns, presence of protonated residues and single residue mutations affect the structural dynamics of an RNA pseudoknot involved in -1 PRF from a plant luteovirus. Single residue mutations in the loop regions or certain base-pair inversions in the stem regions of pseudoknot increase the frameshift inducing ability of the pseudoknot structure, while some others decrease this efficiency. However, it was not clear how the changes made to the wild-type (WT) RNA pseudoknot from Beet Western Yellow Mosaic virus were affecting the global structure in terms of its dynamics and other parameters. To study this, multiple all-atom molecular dynamics simulations (MD) were performed on WT and mutant structures created in silico. The effect of presence and absence of magnesium ions on the structural geometry was also studied. The analysis was done to identify the increase/decrease in the number of hydrogen bonds formed by Watson-Crick base-pairs in stem region or non Watson-Crick pairs between stem and loop. Ionic and water densities were analyzed and the role of potential ribosome-pseudoknot interaction was elaborated.
With the aim of mimicking ribosome induced unfolding of an RNA pseudoknot, steered molecular dynamics pulling experiments were performed. This work was done primarily to understand the unfolding pathway of Hairpin(H)-type pseudoknots in general and the intermediate structures formed. Chapter 3 describes the thermodynamics and mechanics associated with the mechanical pulling of -1 PRF inducing RNA pseudoknot and its mutants described in previous chapter. Analysis of the trajectories reveal relative unfolding patterns in terms of disruption of various hydrogen bonds. This study allowed us to pinpoint the kind of intermediate structures being formed during pulling and whether these intermediate structures correspond to any known secondary structures, such as simple stem-loops. This information could be used for gaining insights into the folding pathways of these structures.
An RNA pseudoknot stimulates -1 PRF in conjunction with a heptanucleotide “slippery site” and an intervening spacer sequence. A comprehensive study of analyzing the sequence signatures and composition of all overlapping gene segments harboring these frameshift elements from four different RNA virus families was carried out. Chapter 4 describes the sequence composition of all overlapping gene segments in Astroviridae, Coronaviridae, Retroviridae and Luteoviridae viral families which are known to employ -1 PRF process for maintaining their protein products. Sequence analysis revealed preference for GC bases in the structure forming sequence regions. A comparative study between multiple sequence alignment and secondary structure prediction revealed that while pseudoknots have a clear preference for specific base-pairs in their stem regions, viral families that employ a hairpin loop as -1 PRF structure, doesn’t show this preference. Information derived from secondary structure prediction was then used for RNA ab initio modeling to generate tertiary structures. Furthermore, the structural parameters were calculated for the helices of the frameshift inducing pseudoknots and were compared with the values calculated for a set of non -1 PRF inducing H-type pseudo-knots. This study highlighted the differences between -1 PRF pseudoknots and other H-type pseudoknot structures as well as specific sequence and structural preferences of the former.
Chapter 5 discusses the dynamics of a tRNA transport factor Exportint (Xpot), which transports mature tRNA molecules from nucleus to cytoplasm and belongs to Importitβ family of proteins. The global conformational dynamics of other transport receptors has been reported earlier, using coarse-grained modeling and Elastic Network Models (ENMs), but a detailed description of the dynamics at an all-atomic resolution was lacking. This transport requires association of Xpot with RanGTP, a G-protein, in the nucleus and hydrolysis of RanGTP in the cytoplasm. The chain of events leading to tRNA release from Xpot after RanGTP hydrolysis was not studied previously. With these objectives, several molecular complexes containing Xpot bound to Ran or tRNA or both in the GTP and GDP ligand states as well as free Xpot structures in nuclear and cytosolic forms were studied. A combination of conventional and accelerated molecular dynamics simulations was used to study these molecular complexes. The study highlighted various aspects associated with tRNA release and conformational change which occurs in Xpot in cytosolic form. The nuclear to cytosolic state transition in Xpot could be attributed to large fluctuations in C-terminal region and dynamic hinge-points located between specific HEAT repeats. A secondary role of Xpot in controlling the quality of tRNA transport has been proposed based on multiple sequence and structure alignment with Importin-β protein. The loss of critical contacts like hydrogen bonds and salt bridges between Xpot/Ran and Xpot/tRNA interface was evaluated in order to study the initial effects of RanGTP hydrolysis and how it influences receptor-cargo binding. This study revealed various aspects of tRNA transport process by Xpot, not understood previously.
The results presented in this thesis illustrate the role of RNA sequence elements and pseudoknots present in RNA viruses in modulating -1 PRF process and how multiple environmental factors affect -1 PRF inducing ability of the structure. From the studies of Xpot and its complexes, the effects of GTP hydrolysis leading to tRNA dissociation have been presented and the progression of conformational transition in Xpot after tRNA dissociation has been highlighted. Chapter 6 summarizes major conclusions of this thesis work.
The refolding of single stranded RNA chains, subjected to a previous unfolding simulation is studied. Appendix A describes this work and initial results. Appendix B describes the effect of improved molecular dynamics force fields, containing corrections for χ torsion angle for RNA, on the conformation of tertiary RNA structures.
Part of the work presented in this thesis has been reported in the following publications.
1.Asmita Gupta and Manju Bansal. Local Structural and Environmental Factors De-fine the Efficiency of an RNA Pseudoknot Involved in Programmed Ribosomal Frameshift Process. J. Phys. Chem. B. 118 (41), pp 11905-11920. 2014
2.Asmita Gupta, Senthilkumar Kailasam and Manju Bansal. Insights Into Nucleo-cytoplasmic Transport of tRNA by Exportin-t. Manuscript under review.
List of manuscripts that are being prepared from the work reported in Chapter 3 in this thesis.
1 Asmita Gupta and Manju Bansal. The role of sequence effects on altering the un-folding pathway of an RNA pseudoknot: a steered molecular dynamics study. Manuscript in preparation.
2 Asmita Gupta and Manju Bansal. Molecular basis for nucleocytoplasmic transport of tRNA by Exportin-t. Journal of Biomolecular Structure and Dynamics, May;33 Suppl 1:59-60, 2015