Batch alignment via retention orders for preprocessing large-scale multi-batch LC-MS experiments
Language English Country Great Britain, England Media print
Document type Journal Article, Research Support, Non-U.S. Gov't
Grant support
RVO 68378050
Czech Academy of Sciences (CAS)
CZ.02.1.01/0.0/0.0/16_013/0001789
Czech Centre for Phenogenomics provided by the Ministry of Education, Youth and Sports of the Czech Republic (MEYS)
Upgrade of the Czech Centre for Phenogenomics: developing towards translation research by MEYS and European Structural Investment Funds (ESIF)
PubMed
35748696
DOI
10.1093/bioinformatics/btac407
PII: 6617343
Knihovny.cz E-resources
- MeSH
- Algorithms MeSH
- Chromatography, Liquid methods MeSH
- Metabolomics MeSH
- Proteomics * methods MeSH
- Tandem Mass Spectrometry * methods MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
MOTIVATION: Meticulous selection of chromatographic peak detection parameters and algorithms is a crucial step in preprocessing liquid chromatography-mass spectrometry (LC-MS) data. However, as mass-to-charge ratio and retention time shifts are larger between batches than within batches, finding apt parameters for all samples of a large-scale multi-batch experiment with the aim of minimizing information loss becomes a challenging task. Preprocessing independent batches individually can curtail said problems but requires a method for aligning and combining them for further downstream analysis. RESULTS: We present two methods for aligning and combining individually preprocessed batches in multi-batch LC-MS experiments. Our developed methods were tested on six sets of simulated and six sets of real datasets. Furthermore, by estimating the probabilities of peak insertion, deletion and swap between batches in authentic datasets, we demonstrate that retention order swaps are not rare in untargeted LC-MS data. AVAILABILITY AND IMPLEMENTATION: kmersAlignment and rtcorrectedAlignment algorithms are made available as an R package with raw data at https://metabocombiner.img.cas.cz. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
References provided by Crossref.org