Integration of sequences, structures, dynamics to study functional divergence in homologous proteins and their assemblies
Abstract
Functions of proteins are governed by their sequences, structures, dynamics and assembly. Modulation of these factors during evolution renders functional divergence in related proteins that have originated from the common ancestral gene. In this thesis, we present studies with the broad objective of understanding functional divergence based on information derived from sequences, structures and dynamics of homologous systems. Here, homologous systems refer four hierarchical levels namely domains, individual proteins, protein-protein complexes and assemblies. First, we performed exhaustive investigation in the proteomes of evolutionary related species classified under ‘Afrotheria’ superorder to probe for diversity in the protein sequences. From the study, we characterized evolutionary relationships among six Afrotherian species and identified a number of unique molecular features that may potentially associated to phenotypic features.
To relate differences in sequences to functional divergence, information on 3-D structure as well as dynamics are required. However, unavailability of 3-D structures for most proteins of known sequence demands the use of protein structure models generated using comparative modeling. Before employing protein models for practical applications, we assessed the accuracy of the modelled structures and information on dynamics arrived at using them. We find that structure and dynamics features of comparative models are reliable. With the confidence gained on the reliability, we used protein models of two related enzyme systems (RNA endonucleases and EGFR kinase) to identify contrasting features in dynamics that reconcile with observed difference in biochemical functions.
Subsequently, we extended these investigations on a multi-protein complex, SF3b, an assembly of seven protein spliceosomal subcomplex, for understanding diversity acquired in the sequence of each protein during evolution and associated modulations in the integrity of their assembly structure as well as functional regulation. We demonstrate that a non-conserved protein harbours allosteric role to SF3b function. Finally, we characterize functional state-specific dynamics features of multiple domains present within a protein and cooperativity among domains in the trimeric form of spike protein from SARS-CoV-2. In essence, this thesis integrates sequence, structure, dynamics and functional states information to gain insights on functional divergence of proteins and their assemblies.