Current classifications (World Health Organization-HAEM5/ICC) define up to 26 molecular B-cell precursor acute lymphoblastic leukemia (BCP-ALL) disease subtypes by genomic driver aberrations and corresponding gene expression signatures. Identification of driver aberrations by transcriptome sequencing (RNA-Seq) is well established, while systematic approaches for gene expression analysis are less advanced. Therefore, we developed ALLCatchR, a machine learning-based classifier using RNA-Seq gene expression data to allocate BCP-ALL samples to all 21 gene expression-defined molecular subtypes. Trained on n = 1869 transcriptome profiles with established subtype definitions (4 cohorts; 55% pediatric / 45% adult), ALLCatchR allowed subtype allocation in 3 independent hold-out cohorts (n = 1018; 75% pediatric / 25% adult) with 95.7% accuracy (averaged sensitivity across subtypes: 91.1% / specificity: 99.8%). High-confidence predictions were achieved in 83.7% of samples with 98.9% accuracy. Only 1.2% of samples remained unclassified. ALLCatchR outperformed existing tools and identified novel driver candidates in previously unassigned samples. Additional modules provided predictions of samples blast counts, patient's sex, and immunophenotype, allowing the imputation in cases where these information are missing. We established a novel RNA-Seq reference of human B-lymphopoiesis using 7 FACS-sorted progenitor stages from healthy bone marrow donors. Implementation in ALLCatchR enabled projection of BCP-ALL samples to this trajectory. This identified shared proximity patterns of BCP-ALL subtypes to normal lymphopoiesis stages, extending immunophenotypic classifications with a novel framework for developmental comparisons of BCP-ALL. ALLCatchR enables RNA-Seq routine application for BCP-ALL diagnostics with systematic gene expression analysis for accurate subtype allocation and novel insights into underlying developmental trajectories.
- Publikační typ
- časopisecké články MeSH
INTRODUCTION: The malignant transformation leading to a maturation arrest in B-cell precursor acute lymphoblastic leukemia (BCP-ALL) occurs early in B-cell development, in a pro-B or pre-B cell, when somatic recombination of variable (V), diversity (D), and joining (J) segment immunoglobulin (IG) genes and the B-cell rescue mechanism of VH replacement might be ongoing or fully active, driving clonal evolution. In this study of newly diagnosed BCP-ALL, we sought to understand the mechanistic details of oligoclonal composition of the leukemia at diagnosis, clonal evolution during follow-up, and clonal distribution in different hematopoietic compartments. METHODS: Utilizing high-throughput sequencing assays and bespoke bioinformatics we identified BCP-ALL-derived clonally-related IGH sequences by their shared 'DNJ-stem'. RESULTS: We introduce the concept of 'marker DNJ-stem' to cover the entirety of, even lowly abundant, clonally-related family members. In a cohort of 280 adult patients with BCP-ALL, IGH clonal evolution at diagnosis was identified in one-third of patients. The phenomenon was linked to contemporaneous recombinant and editing activity driven by aberrant ongoing DH/VH-DJH recombination and VH replacement, and we share insights and examples for both. Furthermore, in a subset of 167 patients with molecular subtype allocation, high prevalence and high degree of clonal evolution driven by ongoing DH/VH-DJH recombination were associated with the presence of KMT2A gene rearrangements, while VH replacements occurred more frequently in Ph-like and DUX4 BCP-ALL. Analysis of 46 matched diagnostic bone marrow and peripheral blood samples showed a comparable clonal and clonotypic distribution in both hematopoietic compartments, but the clonotypic composition markedly changed in longitudinal follow-up analysis in select cases. Thus, finally, we present cases where the specific dynamics of clonal evolution have implications for both the initial marker identification and the MRD monitoring in follow-up samples. DISCUSSION: Consequently, we suggest to follow the marker DNJ-stem (capturing all family members) rather than specific clonotypes as the MRD target, as well as to follow both VDJH and DJH family members since their respective kinetics are not always parallel. Our study further highlights the intricacy, importance, and present and future challenges of IGH clonal evolution in BCP-ALL.