Nejvíce citovaný článek - PubMed ID 29372899
A DNA structural alphabet provides new insight into DNA flexibility
Nine new crystal structures of CG-rich DNA 18-mers with the sequence 5'-GGTGGGGGC-XZ-GCCCCACC-3', which are related to the bacterial repetitive extragenic palindromes, are reported. 18-mer oligonucleotides with the central XZ dinucleotide systematically mutated to all 16 sequences show complex behavior in solution, but all ten so far successfully crystallized 18-mers crystallized as A-form duplexes. The refinement protocol benefited from the recurrent use of geometries of the dinucleotide conformer (NtC) classes as refinement restraints in regions of poor electron density. The restraints are automatically generated at the dnatco.datmos.org web service and are available for download. This NtC-driven protocol significantly helped to stabilize the structure refinement. The NtC-driven refinement protocol can be adapted to other low-resolution data such as cryo-EM maps. To test the quality of the final structural models, a novel validation method based on comparison of the electron density and conformational similarity to the NtC classes was employed.
- Klíčová slova
- DNA structure, base pairing, dnatco.datmos.org, structure refinement, structure validation,
- MeSH
- DNA * chemie MeSH
- elektronová kryomikroskopie metody MeSH
- konformace nukleové kyseliny MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA * MeSH
Water plays an important role in stabilizing the structure of DNA and mediating its interactions. Here, the hydration of DNA was analyzed in terms of dinucleotide fragments from an ensemble of 2727 nonredundant DNA chains containing 41 853 dinucleotides and 316 265 associated first-shell water molecules. The dinucleotides were classified into categories based on their 16 sequences and the previously determined structural classes known as nucleotide conformers (NtCs). The construction of hydrated dinucleotide building blocks allowed dinucleotide hydration to be calculated as the probability of water density distributions. Peaks in the water densities, known as hydration sites (HSs), uncovered the interplay between base and sugar-phosphate hydration in the context of sequence and structure. To demonstrate the predictive power of hydrated DNA building blocks, they were then used to predict hydration in an independent set of crystal and NMR structures. In ten tested crystal structures, the positions of predicted HSs and experimental waters were in good agreement (more than 40% were within 0.5 Å) and correctly reproduced the known features of DNA hydration, for example the `spine of hydration' in B-DNA. Therefore, it is proposed that hydrated building blocks can be used to predict DNA hydration in structures solved by NMR and cryo-EM, thus providing a guide to the interpretation of experimental data and computer models. The data for the hydrated building blocks and the predictions are available for browsing and visualization at the website https://watlas.datmos.org/watna/.
- Klíčová slova
- DNA hydration, WatNA, dinucleotide fragments, knowledge-based prediction, water,
- MeSH
- DNA * chemie MeSH
- konformace nukleové kyseliny MeSH
- nukleotidy MeSH
- voda * chemie MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA * MeSH
- nukleotidy MeSH
- voda * MeSH
In this review, we describe the creation of the Nucleic Acid Database (NDB) at Rutgers University and how it became a testbed for the current infrastructure of the RCSB Protein Data Bank. We describe some of the special features of the NDB and how it has been used to enable research. Plans for the next phase as the Nucleic Acid Knowledgebase (NAKB) are summarized.
- Klíčová slova
- DNA, RNA, biological structure database, nucleic acid conformation, nucleic acid structures, validation standards,
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
Solution and crystal data are reported for DNA 18-mers with sequences related to those of bacterial noncoding single-stranded DNA segments called repetitive extragenic palindromes (REPs). Solution CD and melting data showed that the CG-rich, near-palindromic REPs from various bacterial species exhibit dynamic temperature-dependent and concentration-dependent equilibria, including architectures compatible with not only hairpins, which are expected to be biologically relevant, but also antiparallel duplexes and bimolecular tetraplexes. Three 18-mer oligonucleotides named Hpar-18 (PDB entry 6rou), Chom-18 (PDB entry 6ros) and its brominated variant Chom-18Br (PDB entry 6ror) crystallized as isomorphic right-handed A-like duplexes. The low-resolution crystal structures were solved with the help of experimental phases for Chom-18Br. The center of the duplexes is formed by two successive T-T noncanonical base pairs (mismatches). They do not deform the double-helical geometry. The presence of T-T mismatches prompted an analysis of the geometries of these and other noncanonical pairs in other DNA crystals in terms of their fit to the experimental electron densities (RSCC) and their geometric fit to the NtC (dinucleotide conformational) classes (https://dnatco.datmos.org/). Throughout this work, knowledge of the NtC classes was used to refine and validate the crystal structures, and to analyze the mismatches.
- Klíčová slova
- CD spectra, DNA structure, REPs, T–T mismatch, crystal structure, noncanonical base pairs, repetitive extragenic palindromes,
- MeSH
- Cardiobacterium genetika MeSH
- DNA bakterií chemie MeSH
- Haemophilus parasuis genetika MeSH
- molekulární modely MeSH
- molekulární struktura * MeSH
- nukleotidové motivy * MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA bakterií MeSH
A detailed description of the dnatco.datmos.org web server implementing the universal structural alphabet of nucleic acids is presented. It is capable of processing any mmCIF- or PDB-formatted files containing DNA or RNA molecules; these can either be uploaded by the user or supplied as the wwPDB or PDB-REDO structural database access code. The web server performs an assignment of the nucleic acid conformations and presents the results for the intuitive annotation, validation, modeling and refinement of nucleic acids.
- Klíčová slova
- annotation, nucleic acids, refinement, structural alphabets, validation,
- MeSH
- databáze nukleových kyselin MeSH
- DNA chemie MeSH
- internet MeSH
- konformace nukleové kyseliny MeSH
- molekulární modely MeSH
- RNA chemie MeSH
- software * MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA MeSH
- RNA MeSH
By analyzing almost 120 000 dinucleotides in over 2000 nonredundant nucleic acid crystal structures, we define 96+1 diNucleotide Conformers, NtCs, which describe the geometry of RNA and DNA dinucleotides. NtC classes are grouped into 15 codes of the structural alphabet CANA (Conformational Alphabet of Nucleic Acids) to simplify symbolic annotation of the prominent structural features of NAs and their intuitive graphical display. The search for nontrivial patterns of NtCs resulted in the identification of several types of RNA loops, some of them observed for the first time. Over 30% of the nearly six million dinucleotides in the PDB cannot be assigned to any NtC class but we demonstrate that up to a half of them can be re-refined with the help of proper refinement targets. A statistical analysis of the preferences of NtCs and CANA codes for the 16 dinucleotide sequences showed that neither the NtC class AA00, which forms the scaffold of RNA structures, nor BB00, the DNA most populated class, are sequence neutral but their distributions are significantly biased. The reported automated assignment of the NtC classes and CANA codes available at dnatco.org provides a powerful tool for unbiased analysis of nucleic acid structures by structural and molecular biologists.
- MeSH
- biokatalýza MeSH
- DNA chemie klasifikace MeSH
- konformace nukleové kyseliny * MeSH
- nukleotidové motivy * MeSH
- nukleotidy chemie klasifikace MeSH
- reprodukovatelnost výsledků MeSH
- riboswitch MeSH
- ribozomy chemie metabolismus MeSH
- RNA katalytická chemie metabolismus MeSH
- RNA chemie klasifikace MeSH
- vazebná místa MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA MeSH
- nukleotidy MeSH
- riboswitch MeSH
- RNA katalytická MeSH
- RNA MeSH
Structural bioinformatics provides the scientific methods and tools to analyse, archive, validate, and present the biomolecular structure data generated by the structural biology community. It also provides an important link with the genomics community, as structural bioinformaticians also use the extensive sequence data to predict protein structures and their functional sites. A very broad and active community of structural bioinformaticians exists across Europe, and 3D-Bioinfo will establish formal platforms to address their needs and better integrate their activities and initiatives. Our mission will be to strengthen the ties with the structural biology research communities in Europe covering life sciences, as well as chemistry and physics and to bridge the gap between these researchers in order to fully realize the potential of structural bioinformatics. Our Community will also undertake dedicated educational, training and outreach efforts to facilitate this, bringing new insights and thus facilitating the development of much needed innovative applications e.g. for human health, drug and protein design. Our combined efforts will be of critical importance to keep the European research efforts competitive in this respect. Here we highlight the major European contributions to the field of structural bioinformatics, the most pressing challenges remaining and how Europe-wide interactions, enabled by ELIXIR and its platforms, will help in addressing these challenges and in coordinating structural bioinformatics resources across Europe. In particular, we present recent activities and future plans to consolidate an ELIXIR 3D-Bioinfo Community in structural bioinformatics and propose means to develop better links across the community. These include building new consortia, organising workshops to establish data standards and seeking community agreement on benchmark data sets and strategies. We also highlight existing and planned collaborations with other ELIXIR Communities and other European infrastructures, such as the structural biology community supported by Instruct-ERIC, with whom we have synergies and overlapping common interests.
- Klíčová slova
- ELIXIR, Instruct-ERIC, biomolecular structure, nucleic acids structure, protein structure, structural bioinformatics,
- MeSH
- biologické vědy * MeSH
- genomika MeSH
- lidé MeSH
- proteiny MeSH
- výpočetní biologie organizace a řízení MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Geografické názvy
- Evropa MeSH
- Názvy látek
- proteiny MeSH
In this article, we present a method for the enhanced molecular dynamics simulation of protein and DNA systems called potential of mean force (PMF)-enriched sampling. The method uses partitions derived from the potentials of mean force, which we determined from DNA and protein structures in the Protein Data Bank (PDB). We define a partition function from a set of PDB-derived PMFs, which efficiently compensates for the error introduced by the assumption of a homogeneous partition function from the PDB datasets. The bias based on the PDB-derived partitions is added in the form of a hybrid Hamiltonian using a renormalization method, which adds the PMF-enriched gradient to the system depending on a linear weighting factor and the underlying force field. We validated the method using simulations of dialanine, the folding of TrpCage, and the conformational sampling of the Dickerson⁻Drew DNA dodecamer. Our results show the potential for the PMF-enriched simulation technique to enrich the conformational space of biomolecules along their order parameters, while we also observe a considerable speed increase in the sampling by factors ranging from 13.1 to 82. The novel method can effectively be combined with enhanced sampling or coarse-graining methods to enrich conformational sampling with a partition derived from the PDB.
- Klíčová slova
- DNA simulation, enhanced molecular dynamics simulations, protein folding,
- MeSH
- databáze proteinů * MeSH
- DNA * chemie genetika MeSH
- počítačová simulace * MeSH
- sbalování proteinů * MeSH
- simulace molekulární dynamiky * MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA * MeSH