The IDSM mass spectrometry extension: searching mass spectra using SPARQL

. 2024 Mar 29 ; 40 (4) : .

Status In-Process Jazyk angličtina Země Velká Británie, Anglie Médium print

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/pmid38561173

Grantová podpora
Ministry of Education
101082304 European Union's Horizon Europe Programme

SUMMARY: The Integrated Database of Small Molecules (IDSM) integrates data from small-molecule datasets, making them accessible through the SPARQL query language. Its unique feature is the ability to search for compounds through SPARQL based on their molecular structure. We extended IDSM to enable mass spectra databases to be integrated and searched for based on mass spectrum similarity. As sources of mass spectra, we employed the MassBank of North America database and the In Silico Spectral Database of natural products. AVAILABILITY AND IMPLEMENTATION: The extension is an integral part of IDSM, which is available at https://idsm.elixir-czech.cz. The manual and usage examples are available at https://idsm.elixir-czech.cz/docs/ms. The source codes of all IDSM parts are available under open-source licences at https://github.com/idsm-src.

Zobrazit více v PubMed

Allard P-M, Bisson J, Rutz A.. ISDB. In Silico Spectral Databases of Natural Products. Zenodo, 2023. 10.5281/zenodo.8287341. DOI

Bansal P, Morgat A, Axelsen KB. et al. Rhea, the reaction knowledgebase in 2022. Nucleic Acids Res 2022;50:D693–700. PubMed PMC

Cote R, Reisinger F, Martens L. et al. The ontology lookup service: bigger and better. Nucleic Acids Res 2010;38:W155–60. PubMed PMC

Coudert E, Gehant S, de Castro E. et al.; UniProt Consortium. Annotation of biologically relevant ligands in UniProtKB using ChEBI. Bioinformatics 2023;39:btac793. PubMed PMC

Davies M, Nowotka M, Papadatos G. et al. ChEMBL web services: streamlining access to drug discovery data and utilities. Nucleic Acids Res 2015;43:W612–20. PubMed PMC

DCMI Usage Board. DCMI Metadata Terms. 2020. http://dublincore.org/specifications/dublin-core/dcmi-terms/2020-01-20/.

Djoumbou Feunang Y, Eisner R, Knox C. et al. ClassyFire: automated chemical classification with a comprehensive, computable taxonomy. J Cheminform 2016;8:61. PubMed PMC

Dumontier M, Baker CJ, Baran J. et al. The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery. J Biomed Semantics 2014;5:14. PubMed PMC

Fu G, Batchelor C, Dumontier M. et al. PubChemRDF: towards the semantic annotation of PubChem compound and substance databases. J Cheminform 2015;7:34. PubMed PMC

Galgonek J, Vondrášek J.. IDSM ChemWebRDF: SPARQLing small-molecule datasets. J Cheminform 2021;13:38. PubMed PMC

Harris S, Seaborne A.. SPARQL 1.1 Query Language. World Wide Web Consortium, 2013. https://www.w3.org/TR/2013/REC-sparql11-query-20130321/.

Hastings J, Chepelev L, Willighagen E. et al. The chemical information ontology: provenance and disambiguation for chemical data on the biological semantic web. PLoS One 2011;6:e25513. PubMed PMC

Hastings J, Owen G, Dekker A. et al. ChEBI in 2016: improved services and an expanding collection of metabolites. Nucleic Acids Res 2016;44:D1214–9. PubMed PMC

Heller SR, McNaught A, Pletnev I. et al. InChI, the IUPAC international chemical identifier. J Cheminform 2015;7:23. PubMed PMC

Huber F, Verhoeven S, Meijer C. et al. matchms – processing and similarity evaluation of mass spectrometry data. JOSS 2020;5:2411.

Iannella R, McKinney J.. vCard Ontology – for describing People and Organizations. World Wide Web Consortium, 2014. https://www.w3.org/TR/2014/NOTE-vcard-rdf-20140522/.

Jackson R, Matentzoglu N, Overton JA. et al. OBO foundry in 2021: operationalizing open data principles to evaluate ontologies. Database (Oxford) 2021;2021:baab069. PubMed PMC

Jackson RC, Balhoff JP, Douglass E. et al. ROBOT: a tool for automating ontology workflows. BMC Bioinform 2019;20:407. PubMed PMC

Kratochvíl M, Vondrášek J, Galgonek J.. Sachem: a chemical cartridge for high-performance substructure search. J Cheminform 2018;10:27. PubMed PMC

Kratochvíl M, Vondrášek J, Galgonek J.. Interoperable chemical structure search service. J Cheminform 2019;11:45. PubMed PMC

Martens L, Chambers M, Sturm M. et al. mzML – a community standard for mass spectrometry data. Mol Cell Proteomics 2011;10:R110. PubMed PMC

Mayer G, Montecchi-Palazzi L, Ovelleiro D. et al.; HUPO-PSI Group. The HUPO proteomics standards initiative – mass spectrometry controlled vocabulary. Database (Oxford) 2013;2013:bat009. PubMed PMC

Miles A, Bechhofer S.. SKOS Simple Knowledge Organization System Reference. World Wide Web Consortium, 2009. https://www.w3.org/TR/2009/REC-skos-reference-20090818/.

Ong E, Xiang Z, Zhao B. et al. Ontobee: a linked ontology data server to support ontology term dereferencing, linkage, query and integration. Nucleic Acids Res 2017;45:D347–52. PubMed PMC

Rijgersberg H, Wigham M, Top JL.. How semantics can improve engineering processes: a case of units of measure and quantities. Adv Eng Inform 2011;25:276–87.

Rogers FB. Medical subject headings. Bull Med Libr Assoc 1963;51:114–6. PubMed PMC

Rutz A, Sorokina M, Galgonek J. et al. The LOTUS initiative for open knowledge management in natural products research. Elife 2022;11:e70780. PubMed PMC

Schreiber G, Raimond Y.. RDF 1.1 Primer. World Wide Web Consortium, 2014. https://www.w3.org/TR/2014/NOTE-rdf11-primer-20140624/.

SIB Swiss Institute of Bioinformatics RDF Group Members. The SIB Swiss Institute of Bioinformatics Semantic Web of data. Nucleic Acids Res 2024;52(D1):D44–D51. PubMed PMC

Whetzel PL, Noy NF, Shah NH. et al. BioPortal: enhanced functionality via new web services from the National Center for Biomedical Ontology to access and use ontologies in software applications. Nucleic Acids Res 2011;39:W541–5. PubMed PMC

Yamamoto Y, Yamaguchi A, Splendiani A.. YummyData: providing high-quality open life science data. Database (Oxford) 2018;2018:bay022. PubMed PMC

Najít záznam

Citační ukazatele

Nahrávání dat ...

    Možnosti archivace