PDBe: improved findability of macromolecular structure data in the PDB

. 2020 Jan 08 ; 48 (D1) : D335-D343.

Jazyk angličtina Země Velká Británie, Anglie Médium print

Typ dokumentu časopisecké články, práce podpořená grantem

Perzistentní odkaz   https://www.medvik.cz/link/pmid31691821

Grantová podpora
Wellcome Trust - United Kingdom
104948 Wellcome Trust - United Kingdom
BB/G022577/1 Biotechnology and Biological Sciences Research Council - United Kingdom

The Protein Data Bank in Europe (PDBe), a founding member of the Worldwide Protein Data Bank (wwPDB), actively participates in the deposition, curation, validation, archiving and dissemination of macromolecular structure data. PDBe supports diverse research communities in their use of macromolecular structures by enriching the PDB data and by providing advanced tools and services for effective data access, visualization and analysis. This paper details the enrichment of data at PDBe, including mapping of RNA structures to Rfam, and identification of molecules that act as cofactors. PDBe has developed an advanced search facility with ∼100 data categories and sequence searches. New features have been included in the LiteMol viewer at PDBe, with updated visualization of carbohydrates and nucleic acids. Small molecules are now mapped more extensively to external databases and their visual representation has been enhanced. These advances help users to more easily find and interpret macromolecular structure data in order to solve scientific problems.

Komentář v

10.1093/nar/gkz853 PubMed

Zobrazit více v PubMed

Berman H., Henrick K., Nakamura H., Markley J.L.. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 2007; 35:D301–D302. PubMed PMC

wwPDB consortium Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Res. 2019; 47:D520–D528. PubMed PMC

Burley S.K., Berman H.M., Bhikadiya C., Bi C., Chen L., Di Costanzo L., Christie C., Dalenberg K., Duarte J.M., Dutta S. et al. .. RCSB Protein Data Bank: Biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy. Nucleic Acids Res. 2019; 47:D464–D474. PubMed PMC

Kinjo A.R., Bekker G.-J., Suzuki H., Tsuchiya Y., Kawabata T., Ikegawa Y., Nakamura H.. Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures. Nucleic Acids Res. 2017; 45:D282–D288. PubMed PMC

Ulrich E.L., Akutsu H., Doreleijers J.F., Harano Y., Ioannidis Y.E., Lin J., Livny M., Mading S., Maziuk D., Miller Z. et al. .. BioMagResBank. Nucleic Acids Res. 2008; 36:D402–D408. PubMed PMC

Wilkinson M.D., Dumontier M., Aalbersberg Ij.J., Appleton G., Axton M., Baak A., Blomberg N., Boiten J.-W., da Silva Santos L.B., Bourne P.E. et al. .. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 2016; 3:160018. PubMed PMC

Young J.Y.J.Y., Westbrook J.D.J.D., Feng Z., Sala R., Peisach E., Oldfield T.J.T.J., Sen S., Gutmanas A., Armstrong D.R.D.R., Berrisford J.M.J.M. et al. .. OneDep: Unified wwPDB System for Deposition, Biocuration, and Validation of Macromolecular Structures in the PDB Archive. Structure. 2017; 25:536–545. PubMed PMC

Abbott S., Iudin A., Korir P.K., Somasundharam S., Patwardhan A.. EMDB Web Resources. Curr. Protoc. Bioinforma. 2018; 61:5.10.1–5.10.12. PubMed PMC

Dana J.M., Gutmanas A., Tyagi N., Qi G., O’Donovan C., Martin M., Velankar S.. SIFTS: Updated Structure Integration with Function, Taxonomy and Sequences resource allows 40-fold increase in coverage of structure-based annotations for proteins. Nucleic Acids Res. 2019; 47:D482–D489. PubMed PMC

Bateman A. UniProt: A worldwide hub of protein knowledge. Nucleic Acids Res. 2019; 47:D506–D515. PubMed PMC

El-Gebali S., Mistry J., Bateman A., Eddy S.R., Luciani A., Potter S.C., Qureshi M., Richardson L.J., Salazar G.A., Smart A. et al. .. The Pfam protein families database in 2019. Nucleic Acids Res. 2019; 47:D427–D432. PubMed PMC

Mitchell A.L., Attwood T.K., Babbitt P.C., Blum M., Bork P., Bridge A., Brown S.D., Chang H.Y., El-Gebali S., Fraser M.I. et al. .. InterPro in 2019: Improving coverage, classification and access to protein sequence annotations. Nucleic Acids Res. 2019; 47:D351–D360. PubMed PMC

Dawson N.L., Lewis T.E., Das S., Lees J.G., Lee D., Ashford P., Orengo C.A., Sillitoe I.. CATH: an expanded resource to predict protein function through structure and sequence. Nucleic Acids Res. 2017; 45:D289–D295. PubMed PMC

Lo Conte L., Ailey B., Hubbard T.J., Brenner S.E., Murzin A.G., Chothia C.. SCOP: a structural classification of proteins database. Nucleic Acids Res. 2000; 28:257–259. PubMed PMC

Hunt S.E., McLaren W., Gil L., Thormann A., Schuilenburg H., Sheppard D., Parton A., Armean I.M., Trevanion S.J., Flicek P. et al. .. Ensembl variation resources. Database (Oxford). 2018; 2018:bay119. PubMed PMC

Agarwala R., Barrett T., Beck J., Benson D.A., Bollin C., Bolton E., Bourexis D., Brister J.R., Bryant S.H., Canese K. et al. .. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2018; 46:D8–D13. PubMed PMC

Cook C.E., Lopez R., Stroe O., Cochrane G., Brooksbank C., Birney E., Apweiler R.. The European Bioinformatics Institute in 2018: Tools, infrastructure and training. Nucleic Acids Res. 2019; 47:D15–D22. PubMed PMC

PDBe-KB consortium PDBe-KB: a community-driven resource for structural and functional annotations. Nucleic Acids Res. 2020; doi:10.1093/nar/gkz853. PubMed PMC

Mir S., Alhroub Y., Anyango S., Armstrong D.R., Berrisford J.M., Clark A.R., Conroy M.J., Dana J.M., Deshpande M., Gupta D. et al. .. PDBe: towards reusable data delivery infrastructure at protein data bank in Europe. Nucleic Acids Res. 2018; 46:D486–D492. PubMed PMC

Westbrook J.D., Shao C., Feng Z., Zhuravleva M., Velankar S., Young J.. The chemical component dictionary: complete descriptions of constituent molecules in experimentally determined 3D macromolecules in the Protein Data Bank. Bioinformatics. 2015; 31:1274–1278. PubMed PMC

Gaulton A., Hersey A., Nowotka M.L., Patricia Bento A., Chambers J., Mendez D., Mutowo P., Atkinson F., Bellis L.J., Cibrian-Uhalte E. et al. .. The ChEMBL database in 2017. Nucleic Acids Res. 2017; 45:D945–D954. PubMed PMC

Hastings J., Owen G., Dekker A., Ennis M., Kale N., Muthukrishnan V., Turner S., Swainston N., Mendes P., Steinbeck C.. ChEBI in 2016: improved services and an expanding collection of metabolites. Nucleic Acids Res. 2016; 44:D1214–D1219. PubMed PMC

Sterling T., Irwin J.J.. ZINC 15 - ligand discovery for everyone. J. Chem. Inf. Model. 2015; 55:2324–2337. PubMed PMC

Wishart D.S., Feunang Y.D., Guo A.C., Lo E.J., Marcu A., Grant J.R., Sajed T., Johnson D., Li C., Sayeeda Z. et al. .. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2018; 46:D1074–D1082. PubMed PMC

Kalvari I., Argasinska J., Quinones-Olvera N., Nawrocki E.P., Rivas E., Eddy S.R., Bateman A., Finn R.D., Petrov A.I.. Rfam 13.0: Shifting to a genome-centric resource for non-coding RNA families. Nucleic Acids Res. 2018; 46:D335–D342. PubMed PMC

Mukhopadhyay A., Borkakoti N., Pravda L., Tyzack J.D., Thornton J.M., Velankar S.. Finding enzyme cofactors in Protein Data Bank. Bioinformatics. 2019; 35:3510–3511. PubMed PMC

Niggli P. Krystallographische und Strukturtheoretische Grundbegriffe. Handb. der Exp. 1928; 7:108–176.

McCoy A.J., Grosse-Kunstleve R.W., Adams P.D., Winn M.D., Storoni L.C., Read R.J.. Phaser crystallographic software. J. Appl. Crystallogr. 2007; 40:658–674. PubMed PMC

Krissinel E. On the relationship between sequence and structure similarities in proteomics. Bioinformatics. 2007; 23:717–723. PubMed

Mirdita M., Steinegger M., Söding J.. MMseqs2 desktop and local web server app for fast, interactive sequence searches. Bioinformatics. 2019; 35:2856–2858. PubMed PMC

Velankar S., Van Ginkel G., Alhroub Y., Battle G.M.G.M., Berrisford J.M.J.M., Conroy M.J.M.J., Dana J.M.J.M., Gore S.P.S.P., Gutmanas A., Haslam P. et al. .. PDBe: Improved accessibility of macromolecular structure data from PDB and EMDB. Nucleic Acids Res. 2016; 44:D385–D395. PubMed PMC

Callaway E. The revolution will not be crystallized: A new method sweeps through structural biology. Nature. 2015; 525:172–174. PubMed

Young J.Y., Westbrook J.D., Feng Z., Peisach E., Persikova I., Sala R., Sen S., Berrisford J.M., Swaminathan G.J., Oldfield T.J. et al. .. Worldwide Protein Data Bank biocuration supporting open access to high-quality 3D structural biology data. Database (Oxford). 2018; 2018:D520–D528. PubMed PMC

Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J.. Basic local alignment search tool. J. Mol. Biol. 1990; 215:403–410. PubMed

Lipman D.J., Pearson W.R.. Rapid and sensitive protein similarity searches published by: american association for the advancement of science stable URL. Science. 1985; 227:1435–1441. PubMed

Finn R.D., Clements J., Eddy S.R.. HMMER web server: Interactive sequence similarity searching. Nucleic Acids Res. 2011; 39:W29–W37. PubMed PMC

Groom C.R., Bruno I.J., Lightfoot M.P., Ward S.C.. The Cambridge structural database. Acta Crystallogr. Sect. B Struct. Sci. Cryst. Eng. Mater. 2016; B72:171–179. PubMed PMC

Chambers J., Davies M., Gaulton A., Hersey A., Velankar S., Petryszak R., Hastings J., Bellis L., McGlinchey S., Overington J.P.. UniChem: A unified chemical structure cross-referencing and identifier tracking system. J. Cheminform. 2013; 5:3. PubMed PMC

Sehnal D., Deshpande M., Vařeková R.S., Mir S., Berka K., Midlik A., Pravda L., Velankar S., Koča J.. LiteMol suite: Interactive web-based visualization of large-scale macromolecular structure data. Nat. Methods. 2017; 14:1121–1122. PubMed

Thieker D.F., Hadden J.A., Schulten K., Woods R.J.. 3D implementation of the symbol nomenclature for graphical representation of glycans. Glycobiology. 2016; 26:786–787. PubMed PMC

Sehnal D., Grant O.C.. Rapidly Display Glycan Symbols in 3D Structures: 3D-SNFG in LiteMol. J. Proteome Res. 2019; 18:770–774. PubMed

Meldal B.H.M., Forner-Martinez O., Costanzo M.C., Dana J., Demeter J., Dumousseau M., Dwight S.S., Gaulton A., Licata L., Melidoni A.N. et al. .. The complex portal - An encyclopaedia of macromolecular complexes. Nucleic Acids Res. 2015; 43:D479–D484. PubMed PMC

Fabregat A., Jupe S., Matthews L., Sidiropoulos K., Gillespie M., Garapati P., Haw R., Jassal B., Korninger F., May B. et al. .. The Reactome Pathway Knowledgebase. Nucleic Acids Res. 2018; 46:D649–D655. PubMed PMC

Iudin A., Korir P.K., Salavert-Torres J., Kleywegt G.J., Patwardhan A.. EMPIAR: a public archive for raw electron microscopy image data. Nat. Methods. 2016; 13:387–388. PubMed

Morin A., Eisenbraun B., Key J., Sanschagrin P.C., Timony M.A., Ottaviano M., Sliz P.. Collaboration gets the most out of software. Elife. 2013; 2:e01456. PubMed PMC

Grabowski M., Langner K.M., Cymborowski M., Porebski P.J., Sroka P., Zheng H., Cooper D.R., Zimmerman M.D., Elsliger M.A., Burley S.K. et al. .. A public database of macromolecular diffraction experiments. Acta Crystallogr. Sect. D Struct. Biol. 2016; 72:1181–1193. PubMed PMC

Watkins X., Garcia L.J., Pundir S., Martin M.J.. ProtVista: Visualization of protein sequence annotations. Bioinformatics. 2017; 33:2040–2041. PubMed PMC

Favuzza P., Guffart E., Tamborrini M., Scherer B., Dreyer A.M., Rufer A.C., Erny J., Hoernschemeyer J., Thoma R., Schmid G. et al. .. Structure of the malaria vaccine candidate antigen CyRPA and its complex with a parasite invasion inhibitory antibody. Elife. 2017; 6:e20383. PubMed PMC

Yamada M., Watanabe Y., Gootenberg J.S., Hirano H., Ran F.A., Nakane T., Ishitani R., Zhang F., Nishimasu H., Nureki O.. Crystal structure of the minimal cas9 from campylobacter jejuni reveals the molecular diversity in the crispr-cas9 systems. Mol. Cell. 2017; 65:1109–1121. PubMed

Nejnovějších 20 citací...

Zobrazit více v
Medvik | PubMed

R2DT: a comprehensive platform for visualizing RNA secondary structure

. 2025 Feb 08 ; 53 (4) : .

Genomics 2 Proteins portal: a resource and discovery tool for linking genetic screening outputs to protein sequences and structures

. 2024 Oct ; 21 (10) : 1947-1957. [epub] 20240918

R2DT: A COMPREHENSIVE PLATFORM FOR VISUALISING RNA SECONDARY STRUCTURE

. 2024 Sep 30 ; () : . [epub] 20240930

Dataset from a human-in-the-loop approach to identify functionally important protein residues from literature

. 2024 Sep 27 ; 11 (1) : 1032. [epub] 20240927

Analysis and Visualization of Protein Channels, Tunnels, and Pores with MOLEonline and ChannelsDB 2.0

ChannelsDB 2.0: a comprehensive database of protein tunnels and pores in AlphaFold era

. 2024 Jan 05 ; 52 (D1) : D413-D418.

PDBImages: a command-line tool for automated macromolecular structure visualization

. 2023 Dec 01 ; 39 (12) : .

Miniature RNAs are embedded in an exceptionally protein-rich mitoribosome via an elaborate assembly pathway

. 2023 Jul 07 ; 51 (12) : 6443-6460.

Mol* Volumes and Segmentations: visualization and interpretation of cell imaging data alongside macromolecular structure data and biological annotations

. 2023 Jul 05 ; 51 (W1) : W326-W330.

The CCP4 suite: integrative software for macromolecular crystallography

. 2023 Jun 01 ; 79 (Pt 6) : 449-461. [epub] 20230530

PDBe and PDBe-KB: Providing high-quality, up-to-date and integrated resources of macromolecular structures to support basic and applied research and education

. 2022 Oct ; 31 (10) : e4439.

2DProts: database of family-wide protein secondary structure diagrams

. 2021 Dec 07 ; 37 (23) : 4599-4601.

R2DT is a framework for predicting and visualising RNA secondary structure using templates

. 2021 Jun 09 ; 12 (1) : 3494. [epub] 20210609

MISCAST: MIssense variant to protein StruCture Analysis web SuiTe

. 2020 Jul 02 ; 48 (W1) : W132-W139.

Najít záznam

Citační ukazatele

Nahrávání dat ...

    Možnosti archivace