Mass spectrometry-based lipidomics and metabolomics generate extensive data sets that, along with metadata such as clinical parameters, require specific data exploration skills to identify and visualize statistically significant trends and biologically relevant differences. Besides tailored methods developed by individual labs, a solid core of freely accessible tools exists for exploratory data analysis and visualization, which we have compiled here, including preparation of descriptive statistics, annotated box plots, hypothesis testing, volcano plots, lipid maps and fatty acyl chain plots, unsupervised and supervised dimensionality reduction, dendrograms, and heat maps. This review is intended for those who would like to develop their skills in data analysis and visualization using freely available R or Python solutions. Beginners are guided through a selection of R and Python libraries for producing publication-ready graphics without being overwhelmed by the code complexity. This manuscript, along with associated GitBook code repository containing step-by-step instructions, offers readers a comprehensive guide, encouraging the application of R and Python for robust and reproducible chemometric analysis of omics data.
- MeSH
- Mass Spectrometry MeSH
- Humans MeSH
- Lipidomics * methods MeSH
- Metabolomics * methods MeSH
- Programming Languages MeSH
- Software * MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Review MeSH
In this community effort, we compare measurements between 34 laboratories from 19 countries, utilizing mixtures of labelled authentic synthetic standards, to quantify by mass spectrometry four clinically used ceramide species in the NIST (National Institute of Standards and Technology) human blood plasma Standard Reference Material (SRM) 1950, as well as a set of candidate plasma reference materials (RM 8231). Participants either utilized a provided validated method and/or their method of choice. Mean concentration values, and intra- and inter-laboratory coefficients of variation (CV) were calculated using single-point and multi-point calibrations, respectively. These results are the most precise (intra-laboratory CVs ≤ 4.2%) and concordant (inter-laboratory CVs < 14%) community-derived absolute concentration values reported to date for four clinically used ceramides in the commonly analyzed SRM 1950. We demonstrate that calibration using authentic labelled standards dramatically reduces data variability. Furthermore, we show how the use of shared RM can correct systematic quantitative biases and help in harmonizing lipidomics. Collectively, the results from the present study provide a significant knowledge base for translation of lipidomic technologies to future clinical applications that might require the determination of reference intervals (RIs) in various human populations or might need to estimate reference change values (RCV), when analytical variability is a key factor for recall during multiple testing of individuals.
- MeSH
- Ceramides * blood MeSH
- Mass Spectrometry methods MeSH
- Calibration MeSH
- Laboratories * standards MeSH
- Humans MeSH
- Lipidomics methods MeSH
- Reference Standards * MeSH
- Reproducibility of Results MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Ceramides * MeSH
Pancreatic cancer has the worst prognosis among all cancers. Cancer screening of body fluids may improve the survival time prognosis of patients, who are often diagnosed too late at an incurable stage. Several studies report the dysregulation of lipid metabolism in tumor cells, suggesting that changes in the blood lipidome may accompany tumor growth. Here we show that the comprehensive mass spectrometric determination of a wide range of serum lipids reveals statistically significant differences between pancreatic cancer patients and healthy controls, as visualized by multivariate data analysis. Three phases of biomarker discovery research (discovery, qualification, and verification) are applied for 830 samples in total, which shows the dysregulation of some very long chain sphingomyelins, ceramides, and (lyso)phosphatidylcholines. The sensitivity and specificity to diagnose pancreatic cancer are over 90%, which outperforms CA 19-9, especially at an early stage, and is comparable to established diagnostic imaging methods. Furthermore, selected lipid species indicate a potential as prognostic biomarkers.
- MeSH
- CA-19-9 Antigen blood MeSH
- Ceramides blood MeSH
- Humans MeSH
- Lipidomics methods MeSH
- Lysophosphatidylcholines blood MeSH
- Lipid Metabolism genetics MeSH
- Multivariate Analysis MeSH
- Biomarkers, Tumor blood genetics MeSH
- Pancreatic Neoplasms blood diagnosis mortality pathology MeSH
- Proportional Hazards Models MeSH
- Sensitivity and Specificity MeSH
- Sphingomyelins blood MeSH
- Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization MeSH
- Case-Control Studies MeSH
- Check Tag
- Humans MeSH
- Male MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- CA-19-9 Antigen MeSH
- Ceramides MeSH
- Lysophosphatidylcholines MeSH
- Biomarkers, Tumor MeSH
- Sphingomyelins MeSH