protein domains
Dotaz
Zobrazit nápovědu
Domains are distinct units within proteins that typically can fold independently into recognizable three-dimensional structures to facilitate their functions. The structural and functional independence of protein domains is reflected by their apparent modularity in the context of multi-domain proteins. In this work, we examined the coupling of evolution of domain sequences co-occurring within multi-domain proteins to see if it proceeds independently, or in a coordinated manner. We used continuous information theory measures to assess the extent of correlated mutations among domains in multi-domain proteins from organisms across the tree of life. In all multi-domain architectures we examined, domains co-occurring within protein sequences had to some degree undergone concerted evolution. This finding challenges the notion of complete modularity and independence of protein domains, providing new perspective on the evolution of protein sequence and function.
- MeSH
- biologické modely * MeSH
- informační teorie MeSH
- molekulární evoluce * MeSH
- proteinové domény * MeSH
- proteiny genetika metabolismus MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- proteiny MeSH
Light-Oxygen-Voltage (LOV) domains are the protein-based light switches used in nature to trigger and regulate various processes. They allow light signals to be converted into metabolic signaling cascades. Various LOV-domain proteins have been characterized in the last few decades and have been used to develop light-sensitive tools in cell biology research. LOV-based applications exploit the light-driven regulation of effector elements to activate signaling pathways, activate genes, or locate proteins within cells. A relatively new application of an engineered small LOV-domain protein called miniSOG (mini singlet oxygen generator) is based on the light-induced formation of reactive oxygen species (ROS). The first miniSOG was engineered from a LOV domain from Arabidopsis thaliana. This engineered 14 kDa light-responsive flavin-containing protein can be exploited as protein tag for the light-triggered localized production of ROS. Such tunable ROS production by miniSOG or similarly redesigned LOV-domains can be of use in studies focused on subcellular phenomena but may also allow new light-fueled catalytic processes. This review provides an overview of the discovery of LOV domains and their development into tools for cell biology. It also highlights recent advancements in engineering LOV domains for various biotechnological applications and cell biology studies.
- Klíčová slova
- LOV domain, Light-responsive proteins, Optogenetics, Protein engineering, Protein localization, Reactive oxygen species, miniSOG,
- MeSH
- Arabidopsis genetika metabolismus MeSH
- lidé MeSH
- proteinové domény MeSH
- proteinové inženýrství * metody MeSH
- proteiny huseníčku genetika chemie metabolismus MeSH
- reaktivní formy kyslíku metabolismus MeSH
- světlo MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
- Názvy látek
- proteiny huseníčku MeSH
- reaktivní formy kyslíku MeSH
Genetic variation occurring within conserved functional protein domains warrants special attention when examining DNA variation in the context of disease causation. Here we introduce a resource, freely available at www.prot2hg.com, that addresses the question of whether a particular variant falls onto an annotated protein domain and directly translates chromosomal coordinates onto protein residues. The tool can perform a multiple-site query in a simple way, and the whole dataset is available for download as well as incorporated into our own accessible pipeline. To create this resource, National Center for Biotechnology Information protein data were retrieved using the Entrez Programming Utilities. After processing all human protein domains, residue positions were reverse translated and mapped to the reference genome hg19 and stored in a MySQL database. In total, 760 487 protein domains from 42 371 protein models were mapped to hg19 coordinates and made publicly available for search or download (www.prot2hg.com). In addition, this annotation was implemented into the genomics research platform GENESIS in order to query nearly 8000 exomes and genomes of families with rare Mendelian disorders (tgp-foundation.org). When applied to patient genetic data, we found that rare (<1%) variants in the Genome Aggregation Database were significantly more annotated onto a protein domain in comparison to common (>1%) variants. Similarly, variants described as pathogenic or likely pathogenic in ClinVar were more likely to be annotated onto a domain. In addition, we tested a dataset consisting of 60 causal variants in a cohort of patients with epileptic encephalopathy and found that 71% of them (43 variants) were propagated onto protein domains. In summary, we developed a resource that annotates variants in the coding part of the genome onto conserved protein domains in order to increase variant prioritization efficiency. Database URL: www.prot2hg.com.
- MeSH
- anotace sekvence metody MeSH
- data mining metody MeSH
- databáze genetické * MeSH
- datové kurátorství metody MeSH
- genetická variace * MeSH
- genom lidský genetika MeSH
- genomika metody MeSH
- internet MeSH
- lidé MeSH
- proteinové domény genetika MeSH
- proteiny chemie genetika metabolismus MeSH
- výpočetní biologie metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- proteiny MeSH
Proteins are naturally formed by domains edging their functional and structural properties. A domain out of the context of an entire protein can retain its structure and to some extent also function on its own. These properties rationalize construction of artificial fusion multidomain proteins with unique combination of various functions. Information on the specific functional and structural characteristics of individual domains in the context of new artificial fusion proteins is inevitably encoded in sequential order of composing domains defining their mutual spatial positions. So the challenges in designing new proteins with new domain combinations lie dominantly in structure/function prediction and its context dependency. Despite the enormous body of publications on artificial fusion proteins, the task of their structure/function prediction is complex and nontrivial. The degree of spatial freedom facilitated by a linker between domains and their mutual orientation driven by noncovalent interactions is beyond a simple and straightforward methodology to predict their structure with reasonable accuracy. In the presented manuscript, we tested methodology using available modeling tools and computational methods. We show that the process and methodology of such prediction are not straightforward and must be done with care even when recently introduced AlphaFold II is used. We also addressed a question of benchmarking standards for prediction of multidomain protein structures-x-ray or Nuclear Magnetic Resonance experiments. On the study of six two-domain protein chimeras as well as their composing domains and their x-ray structures selected from PDB, we conclude that the major obstacle for justified prediction is inappropriate sampling of the conformational space by the explored methods. On the other hands, we can still address particular steps of the methodology and improve the process of chimera proteins prediction.
- Klíčová slova
- 3D structure prediction, fusion proteins, molecular simulations, x-ray crystallography,
- MeSH
- proteinové domény MeSH
- proteiny * chemie MeSH
- rekombinantní fúzní proteiny * chemie MeSH
- rentgenové záření MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- proteiny * MeSH
- rekombinantní fúzní proteiny * MeSH
This review summarizes available data concerning intradomain structures (IS) such as functionally important amino acid residues, short linear motifs, conserved or disordered regions, peptide repeats, broadly occurring secondary structures or folds, etc. IS form structural features (units or elements) necessary for interactions with proteins or non-peptidic ligands, enzyme reactions and some structural properties of proteins. These features have often been related to a single structural level (e.g. primary structure) mostly requiring certain structural context of other levels (e.g. secondary structures or supersecondary folds) as follows also from some examples reported or demonstrated here. In addition, we deal with some functionally important dynamic properties of IS (e.g. flexibility and different forms of accessibility), and more special dynamic changes of IS during enzyme reactions and allosteric regulation. Selected notes concern also some experimental methods, still more necessary tools of bioinformatic processing and clinically interesting relationships.
- MeSH
- katalytická doména MeSH
- lidé MeSH
- proteiny chemie MeSH
- sekundární struktura proteinů * MeSH
- sekvenční seřazení MeSH
- terciární struktura proteinů MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
- Názvy látek
- proteiny MeSH
Clathrin-mediated endocytosis (CME) is the gatekeeper of the plasma membrane. In contrast to animals and yeasts, CME in plants depends on the TPLATE complex (TPC), an evolutionary ancient adaptor complex. However, the mechanistic contribution of the individual TPC subunits to plant CME remains elusive. In this study, we used a multidisciplinary approach to elucidate the structural and functional roles of the evolutionary conserved N-terminal Eps15 homology (EH) domains of the TPC subunit AtEH1/Pan1. By integrating high-resolution structural information obtained by X-ray crystallography and NMR spectroscopy with all-atom molecular dynamics simulations, we provide structural insight into the function of both EH domains. Both domains bind phosphatidic acid with a different strength, and only the second domain binds phosphatidylinositol 4,5-bisphosphate. Unbiased peptidome profiling by mass-spectrometry revealed that the first EH domain preferentially interacts with the double N-terminal NPF motif of a previously unidentified TPC interactor, the integral membrane protein Secretory Carrier Membrane Protein 5 (SCAMP5). Furthermore, we show that AtEH/Pan1 proteins control the internalization of SCAMP5 via this double NPF peptide interaction motif. Collectively, our structural and functional studies reveal distinct but complementary roles of the EH domains of AtEH/Pan1 in plant CME and connect the internalization of SCAMP5 to the TPLATE complex.
- MeSH
- adaptorové proteiny signální transdukční chemie genetika MeSH
- buněčná membrána metabolismus MeSH
- endocytóza * MeSH
- geneticky modifikované rostliny MeSH
- krystalografie rentgenová MeSH
- membránové proteiny chemie MeSH
- proteinové domény MeSH
- proteiny huseníčku MeSH
- proteiny vázající vápník chemie genetika MeSH
- rostlinné proteiny chemie genetika MeSH
- sekvenční seřazení MeSH
- simulace molekulární dynamiky MeSH
- tabák genetika MeSH
- transport proteinů MeSH
- vazba proteinů * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- adaptorové proteiny signální transdukční MeSH
- membránové proteiny MeSH
- proteiny huseníčku MeSH
- proteiny vázající vápník MeSH
- rostlinné proteiny MeSH
- TPLATE protein, Arabidopsis MeSH Prohlížeč
Neural precursor cell expressed developmentally down-regulated 4 ligase (Nedd4-2) is an E3 ubiquitin ligase that targets proteins for ubiquitination and endocytosis, thereby regulating numerous ion channels, membrane receptors and tumor suppressors. Nedd4-2 activity is regulated by autoinhibition, calcium binding, oxidative stress, substrate binding, phosphorylation and 14-3-3 protein binding. However, the structural basis of 14-3-3-mediated Nedd4-2 regulation remains poorly understood. Here, we combined several techniques of integrative structural biology to characterize Nedd4-2 and its complex with 14-3-3. We demonstrate that phosphorylated Ser342 and Ser448 are the key residues that facilitate 14-3-3 protein binding to Nedd4-2 and that 14-3-3 protein binding induces a structural rearrangement of Nedd4-2 by inhibiting interactions between its structured domains. Overall, our findings provide the structural glimpse into the 14-3-3-mediated Nedd4-2 regulation and highlight the potential of the Nedd4-2:14-3-3 complex as a pharmacological target for Nedd4-2-associated diseases such as hypertension, epilepsy, kidney disease and cancer.
- MeSH
- down regulace MeSH
- fosforylace MeSH
- myši genetika metabolismus MeSH
- proteiny 14-3-3 genetika metabolismus MeSH
- ubikvitinace MeSH
- ubikvitinligasy Nedd4 genetika metabolismus MeSH
- vazba proteinů MeSH
- WW domény * MeSH
- zvířata MeSH
- Check Tag
- myši genetika metabolismus MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- Nedd4l protein, mouse MeSH Prohlížeč
- proteiny 14-3-3 MeSH
- Sfn protein, mouse MeSH Prohlížeč
- ubikvitinligasy Nedd4 MeSH
The assembly of immature retroviral particles is initiated in the cytoplasm by the binding of the structural polyprotein precursor Gag with viral genomic RNA. The protein interactions necessary for assembly are mediated predominantly by the capsid (CA) and nucleocapsid (NC) domains, which have conserved structures. In contrast, the structural arrangement of the CA-NC connecting region differs between retroviral species. In HIV-1 and Rous sarcoma virus, this region forms a rod-like structure that separates the CA and NC domains, whereas in Mason-Pfizer monkey virus, this region is densely packed, thus holding the CA and NC domains in close proximity. Interestingly, the sequence connecting the CA and NC domains in gammaretroviruses, such as murine leukemia virus (MLV), is unique. The sequence is called a charged assembly helix (CAH) due to a high number of positively and negatively charged residues. Although both computational and deletion analyses suggested that the MLV CAH forms a helical conformation, no structural or biochemical data supporting this hypothesis have been published. Using an in vitro assembly assay, alanine scanning mutagenesis, and biophysical techniques (circular dichroism, NMR, microcalorimetry, and electrophoretic mobility shift assay), we have characterized the structure and function of the MLV CAH. We provide experimental evidence that the MLV CAH belongs to a group of charged, E(R/K)-rich, single α-helices. This is the first single α-helix motif identified in viral proteins.
- Klíčová slova
- capsid protein (CA), charged assembly helix (CAH), circular dichroism (CD), electron microscopy (EM), murine leukemia virus (MLV), nuclear magnetic resonance (NMR), retrovirus, single alpha-helix (SAH), spacer peptide (SP), virus assembly,
- MeSH
- mutageneze MeSH
- myši MeSH
- proteinové domény MeSH
- sekundární struktura proteinů MeSH
- virové plášťové proteiny chemie genetika MeSH
- virus myší leukemie chemie genetika MeSH
- zvířata MeSH
- Check Tag
- myši MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- virové plášťové proteiny MeSH
Transient receptor potential (TRPs) channels are crucial downstream targets of calcium signalling cascades. They can be modulated either by calcium itself and/or by calcium-binding proteins (CBPs). Intracellular messengers usually interact with binding domains present at the most variable TRP regions-N- and C-cytoplasmic termini. Calmodulin (CaM) is a calcium-dependent cytosolic protein serving as a modulator of most transmembrane receptors. Although CaM-binding domains are widespread within intracellular parts of TRPs, no such binding domain has been characterised at the TRP melastatin member-the transient receptor potential melastatin 6 (TRPM6) channel. Another CBP, the S100 calcium-binding protein A1 (S100A1), is also known for its modulatory activities towards receptors. S100A1 commonly shares a CaM-binding domain. Here, we present the first identified CaM and S100A1 binding sites at the N-terminal of TRPM6. We have confirmed the L520-R535 N-terminal TRPM6 domain as a shared binding site for CaM and S100A1 using biophysical and molecular modelling methods. A specific domain of basic amino acid residues (R526/R531/K532/R535) present at this TRPM6 domain has been identified as crucial to maintain non-covalent interactions with the ligands. Our data unambiguously confirm that CaM and S100A1 share the same binding domain at the TRPM6 N-terminus although the ligand-binding mechanism is different.
- Klíčová slova
- CaM and S100A1, TRPM6, binding domain, calmodulin binding motif, fluorescence anisotropy, molecular modelling,
- MeSH
- kalmodulin chemie MeSH
- kationtové kanály TRPM chemie MeSH
- lidé MeSH
- molekulární modely * MeSH
- proteinové domény MeSH
- proteiny S100 chemie MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- kalmodulin MeSH
- kationtové kanály TRPM MeSH
- proteiny S100 MeSH
- S100A1 protein MeSH Prohlížeč
- TRPM6 protein, human MeSH Prohlížeč
The RNA-binding protein La is found in most eukaryotes, and despite being essential in many organisms, its function is not completely clear. Trypanosoma brucei, the causative agent of human African trypanosomiasis, encodes a 'classical' La protein (TbLa) composed of a La-motif, two RNA recognition motifs (RRM1 and RRM2α), a C-terminal short basic motif (SBM), and a nuclear localization signal (NLS). In T. brucei, like in most eukaryotes, position 34 of tRNATyr, -Asp, -Asn and -His is modified with queuosine (Q34). The steady-state levels of queuosine-modified tRNA in the insect form (procyclic) of T. brucei can fluctuate dynamically depending on growth conditions, but the mechanism(s) controlling Q34 levels are not well understood. A well-established function of La is in precursor-tRNA 3'-end metabolism, but in this work, we demonstrate that La also controls Q34-tRNA levels. Individual domain deletions showed that while deletion of La motif or RRM1 causes dysregulation of Q34-tRNA levels, no other domain plays a similar role. We also show that La is important for the normal balance of several additional tRNA modifications. These findings are discussed in the context of substrate competition between La and modification enzymes, also highlighting subcellular localization as a key determinant of tRNA function.
- MeSH
- nukleosid Q metabolismus analogy a deriváty MeSH
- posttranskripční úpravy RNA * MeSH
- proteinové domény MeSH
- proteiny vázající RNA * metabolismus chemie genetika MeSH
- protozoální proteiny * metabolismus chemie genetika MeSH
- RNA transferová * metabolismus genetika MeSH
- Trypanosoma brucei brucei * genetika metabolismus MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- nukleosid Q MeSH
- proteiny vázající RNA * MeSH
- protozoální proteiny * MeSH
- RNA transferová * MeSH