A universal language for finding mass spectrometry data patterns
Jazyk angličtina Země Spojené státy americké Médium print-electronic
Typ dokumentu časopisecké články
Grantová podpora
R35 GM155026
NIGMS NIH HHS - United States
R01 GM107550
NIGMS NIH HHS - United States
R01 GM155383
NIGMS NIH HHS - United States
U24 DK141185
NIDDK NIH HHS - United States
R35 GM128690
NIGMS NIH HHS - United States
R21 AI156669
NIAID NIH HHS - United States
R35 GM146934
NIGMS NIH HHS - United States
R01 GM125943
NIGMS NIH HHS - United States
R03 OD034493
NIH HHS - United States
R15 AI137996
NIAID NIH HHS - United States
U2C DK119886
NIDDK NIH HHS - United States
U24 DK133658
NIDDK NIH HHS - United States
PubMed
40355727
PubMed Central
PMC12334354
DOI
10.1038/s41592-025-02660-z
PII: 10.1038/s41592-025-02660-z
Knihovny.cz E-zdroje
- MeSH
- data mining * metody MeSH
- hmotnostní spektrometrie * metody MeSH
- lidé MeSH
- metabolomika * metody MeSH
- programovací jazyk * MeSH
- software * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
Despite being information rich, the vast majority of untargeted mass spectrometry data are underutilized; most analytes are not used for downstream interpretation or reanalysis after publication. The inability to dive into these rich raw mass spectrometry datasets is due to the limited flexibility and scalability of existing software tools. Here we introduce a new language, the Mass Spectrometry Query Language (MassQL), and an accompanying software ecosystem that addresses these issues by enabling the community to directly query mass spectrometry data with an expressive set of user-defined mass spectrometry patterns. Illustrated by real-world examples, MassQL provides a data-driven definition of chemical diversity by enabling the reanalysis of all public untargeted metabolomics data, empowering scientists across many disciplines to make new discoveries. MassQL has been widely implemented in multiple open-source and commercial mass spectrometry analysis tools, which enhances the ability, interoperability and reproducibility of mining of mass spectrometry data for the research community.
Bioinformatics Group Wageningen University and Research Wageningen the Netherlands
Biologicals and Natural Products Crop Protection R and D Corteva Agrisciences Indianapolis IN USA
BioMolecular Sciences School of Pharmacy University of Mississippi Oxford MS USA
Center for Urban Waters University of Washington Tacoma WA USA
Chemistry and Chemical Biology Northeastern University Boston MA USA
Clinical Biomarkers Laboratory School of Medicine Emory University Atlanta GA USA
College of Pharmacy Sookmyung Women's University Seoul Republic of Korea
College of Pharmacy University of Rhode Island Kingston RI USA
Crop Protection R and D Corteva Agrisciences Indianapolis IN USA
Data Science and Bioinformatics Corteva Agrisciences Dublin OH USA
Department of Biochemistry University of California Riverside Riverside CA USA
Department of Biochemistry University of Johannesburg Johannesburg South Africa
Department of Bioengineering University of California San Diego La Jolla CA USA
Department of BioMolecular Sciences School of Pharmacy University of Mississippi Oxford MS USA
Department of Biotechnology and Biomedicine Technical University of Denmark Kongens Lyngby Denmark
Department of Chemistry and Biochemistry San Diego State University San Diego CA USA
Department of Chemistry and Biochemistry UC Santa Cruz Santa Cruz CA USA
Department of Chemistry and Biochemistry University of Arizona Tucson AZ USA
Department of Chemistry and Biochemistry University of Denver Denver CO USA
Department of Chemistry BMC Science for Life Laboratory Uppsala University Uppsala Sweden
Department of Chemistry Case Western Reserve University Cleveland OH USA
Department of Computer Science University of California Riverside Riverside CA USA
Department of Fundamental Chemistry Institute of Chemistry University of São Paulo São Paulo Brazil
Department of Medicinal Chemistry College of Pharmacy University of Michigan Ann Arbor MI USA
Department of Pharmacy University of Marburg Marburg Germany
Environmental Genomics and Systems Biology Division Lawrence Berkeley National Lab Berkeley CA USA
Faculty of Chemistry Institute of Exact and Natural Science Federal University of Para Belem Brazil
Functional Metabolomics Lab CMFI Cluster of Excellence University of Tuebingen Tuebingen Germany
Institute for Biomedicine Eurac Research Bolzano Italy
Institute of Inorganic and Analytical Chemistry University of Münster Münster Germany
Institute of Pharmaceutical Biology Goethe University Frankfurt Frankfurt Germany
Institute of Pharmaceutical Biology University of Bonn Bonn Germany
Institute of Pharmacy Freie Universität Berlin Berlin Germany
Natural Products Discovery Core Life Sciences Institute University of Michigan Ann Arbor MI USA
Pharmacognosy Department Faculty of Pharmacy Cairo University Cairo Egypt
Pharmacognosy Faculty of Pharmacy Al Azhar University Nasr City Egypt
RIKEN Center for Integrative Medical Sciences Tsurumi ku Japan
RIKEN Center for Sustainable Resource Science Tsurumi ku Japan
School of Chemistry and Biochemistry Georgia Institute of Technology Atlanta GA USA
The Joint Genome Institute Lawrence Berkeley National Lab Berkeley CA USA
West Coast Metabolomics Center University of California Davis Davis CA USA
Zobrazit více v PubMed
Stein, S. E. & Scott, D. R. Optimization and testing of mass spectral library search algorithms for compound identification. PubMed
Baars, O., Morel, F. M. M. & Perlman, D. H. ChelomEx: isotope-assisted discovery of metal chelates in complex media using high-resolution LC–MS. PubMed
Huber, F. et al. matchms—processing and similarity evaluation of mass spectrometry data.
Chang, H.-Y. et al. A practical guide to metabolomics software development. PubMed PMC
Matsuda, F. Regular expressions of MS/MS spectra for partial annotation of metabolite features.
Wang, M. et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. PubMed PMC
Sud, M. et al. Metabolomics Workbench: an international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools. PubMed PMC
Haug, K. et al. MetaboLights: a resource evolving in response to the needs of its scientific community. PubMed PMC
Petras, D. et al. GNPS Dashboard: collaborative exploration of mass spectrometry data in the web browser. PubMed PMC
Schmid, R. et al. Integrative analysis of multimodal mass spectrometry data in MZmine 3. PubMed PMC
Pfeuffer, J. et al. OpenMS 3 enables reproducible analysis of large-scale mass spectrometry data. PubMed
Tsugawa, H. et al. MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. PubMed PMC
Kostelic, M. M., & Marty, M. T. Deconvolving native and intact protein mass spectra with UniDec. PubMed
Rainer, J. et al. A modular and expandable ecosystem for metabolomics data annotation in R. PubMed PMC
Hider, R. C. & Kong, X. Chemistry and biology of siderophores. PubMed
Sandy, M. & Butler, A. Microbial iron acquisition: marine and terrestrial siderophores. PubMed PMC
Aron, A. T. et al. Native mass spectrometry-based metabolomics identifies metal-binding compounds. PubMed PMC
Schmid, R. et al. Ion identity molecular networking for mass spectrometry-based metabolomics in the GNPS environment. PubMed PMC
Frank, A. M. et al. Clustering millions of tandem mass spectra. PubMed PMC
Cruz-Huerta, E. et al. Short communication: identification of iron-binding peptides from whey protein hydrolysates using iron (III)-immobilized metal ion affinity chromatography and reversed phase-HPLC-tandem mass spectrometry. PubMed
Nalini, S. & Balasubramanian, K. A. Studies on iron binding by free fatty acids. PubMed
Sanyal, A. J., Hirsch, J. I. & Moore, E. W. Premicellar taurocholate avidly binds ferrous (Fe PubMed
Tamilmani, P. & Pandey, M. C. Iron binding efficiency of polyphenols: comparison of effect of ascorbic acid and ethylenediaminetetraacetic acid on catechol and galloyl groups. PubMed
Reemtsma, T., Quintana, J. B., Rodil, R., García-López, M. & Rodríguez, I. Organophosphorus flame retardants and plasticizers in water and air I. Occurrence and fate.
van der Veen, I. & de Boer, J. Phosphorus flame retardants: properties, production, environmental occurrence, toxicity and analysis. PubMed
Yao, C., Yang, H. & Li, Y. A review on organophosphate flame retardants in the environment: occurrence, accumulation, metabolism and toxicity. PubMed
Meng, W. et al. Functional group-dependent screening of organophosphate esters (OPEs) and discovery of an abundant OPE bis-(2-ethylhexyl)-phenyl phosphate in indoor dust. PubMed
Wang, L., Jia, Y. & Hu, J. Nine alkyl organophosphate triesters newly identified in house dust. PubMed
Ye, L., Meng, W., Huang, J., Li, J. & Su, G. Establishment of a target, suspect, and functional group-dependent screening strategy for organophosphate esters (OPEs): “into the unknown” of OPEs in the sediment of Taihu Lake, China. PubMed
Bittremieux, W., Laukens, K., Noble, W. S. & Dorrestein, P. C. Large-scale tandem mass spectrum clustering using fast nearest neighbor searching. PubMed PMC
Mohanty, I. et al. The underappreciated diversity of bile acid modifications. PubMed PMC
El Abiead, Y. et al. Heterogeneous multimeric metabolite ion species observed in LC–MS based metabolomics data sets. PubMed
Oesterle, I. et al. Exposomic biomonitoring of polyphenols by non-targeted analysis and suspect screening. PubMed PMC
Liu, Z. et al. Localized cardiac small molecule trajectories and persistent chemical sequelae in experimental Chagas disease. PubMed PMC
Ahmed, M. M. A., Tripathi, S. K. & Boudreau, P. D. Comparative metabolomic profiling of Cupriavidus necator B-4383 revealed production of cupriachelin siderophores, one with activity against PubMed PMC
Ahmed, M. M. A. & Boudreau, P. D. LCMS-metabolomic profiling and genome mining of PubMed PMC
Allard, P.-M. et al. Open and reusable annotated mass spectrometry dataset of a chemodiverse collection of 1,600 plant extracts. PubMed PMC
Berger, T. et al. A MassQL-integrated molecular networking approach for the discovery and substructure annotation of bioactive. PubMed
Gaudry, A. et al. A sample-centric and knowledge-driven computational framework for natural products drug discovery. PubMed PMC
Leão, T. F. et al. NPOmix: a machine learning classifier to connect mass spectrometry fragmentation data to biosynthetic gene clusters. PubMed PMC
Quiros-Guerrero, L.-M. et al. Comprehensive mass spectrometric metabolomic profiling of a chemically diverse collection of plants of the Celastraceae family. PubMed PMC
Selegato, D. M., Zanatta, A. C., Pilon, A. C., Veloso, J. H. & Castro-Gamboa, I. Application of feature-based molecular networking and MassQL for the MS/MS fragmentation study of depsipeptides. PubMed PMC
Bittremieux, W. et al. Comparison of cosine, modified cosine, and neutral loss based spectrum alignment for discovery of structurally related molecules. PubMed
Wang, M. et al. Mass spectrometry searches using MASST. PubMed PMC
Goloborodko, A. A., Levitsky, L. I., Ivanov, M. V. & Gorshkov, M. V. Pyteomics—a Python framework for exploratory data analysis and rapid software prototyping in proteomics. PubMed
Martens, L. et al. mzML—a community standard for mass spectrometry data. PubMed PMC
Di Tommaso, P. et al. Nextflow enables reproducible computational workflows. PubMed
Wang, M. et al. mwang87/MassQueryLanguage: release 2024.12.12.