Detail
Článek
Článek online
FT
Medvik - BMČ
  • Je něco špatně v tomto záznamu ?

Bioinformatic pipelines for whole transcriptome sequencing data exploitation in leukemia patients with complex structural variants

J. Hynst, K. Plevova, L. Radova, V. Bystry, K. Pal, S. Pospisilova,

. 2019 ; 7 (-) : e7071. [pub] 20190612

Jazyk angličtina Země Spojené státy americké

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/bmc19028857

Grantová podpora
NV15-31834A MZ0 CEP - Centrální evidence projektů

Background: Extensive genome rearrangements, known as chromothripsis, have been recently identified in several cancer types. Chromothripsis leads to complex structural variants (cSVs) causing aberrant gene expression and the formation of de novo fusion genes, which can trigger cancer development, or worsen its clinical course. The functional impact of cSVs can be studied at the RNA level using whole transcriptome sequencing (total RNA-Seq). It represents a powerful tool for discovering, profiling, and quantifying changes of gene expression in the overall genomic context. However, bioinformatic analysis of transcriptomic data, especially in cases with cSVs, is a complex and challenging task, and the development of proper bioinformatic tools for transcriptome studies is necessary. Methods: We designed a bioinformatic workflow for the analysis of total RNA-Seq data consisting of two separate parts (pipelines): The first pipeline incorporates a statistical solution for differential gene expression analysis in a biologically heterogeneous sample set. We utilized results from transcriptomic arrays which were carried out in parallel to increase the precision of the analysis. The second pipeline is used for the identification of de novo fusion genes. Special attention was given to the filtering of false positives (FPs), which was achieved through consensus fusion calling with several fusion gene callers. We applied the workflow to the data obtained from ten patients with chronic lymphocytic leukemia (CLL) to describe the consequences of their cSVs in detail. The fusion genes identified by our pipeline were correlated with genomic break-points detected by genomic arrays. Results: We set up a novel solution for differential gene expression analysis of individual samples and de novo fusion gene detection from total RNA-Seq data. The results of the differential gene expression analysis were concordant with results obtained by transcriptomic arrays, which demonstrates the analytical capabilities of our method. We also showed that the consensus fusion gene detection approach was able to identify true positives (TPs) efficiently. Detected coordinates of fusion gene junctions were in concordance with genomic breakpoints assessed using genomic arrays. Discussion: Byapplying our methods to real clinical samples, we proved that our approach for total RNA-Seq data analysis generates results consistent with other genomic analytical techniques. The data obtained by our analyses provided clues for the study of the biological consequences of cSVs with far-reaching implications for clinical outcome and management of cancer patients. The bioinformatic workflow is also widely applicable for addressing other research questions in different contexts, for which transcriptomic data are generated.

Citace poskytuje Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc19028857
003      
CZ-PrNML
005      
20210310131927.0
007      
ta
008      
190813s2019 xxu f 000 0|eng||
009      
AR
024    7_
$a 10.7717/peerj.7071 $2 doi
035    __
$a (PubMed)31223530
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a xxu
100    1_
$a Hynst, Jakub $u Central European Institute of Technology, Masaryk University, Brno, Czech Republic. Department of Internal Medicine-Hematology and Oncology, Faculty of Medicine, Masaryk University, Brno, Czech Republic.
245    10
$a Bioinformatic pipelines for whole transcriptome sequencing data exploitation in leukemia patients with complex structural variants / $c J. Hynst, K. Plevova, L. Radova, V. Bystry, K. Pal, S. Pospisilova,
520    9_
$a Background: Extensive genome rearrangements, known as chromothripsis, have been recently identified in several cancer types. Chromothripsis leads to complex structural variants (cSVs) causing aberrant gene expression and the formation of de novo fusion genes, which can trigger cancer development, or worsen its clinical course. The functional impact of cSVs can be studied at the RNA level using whole transcriptome sequencing (total RNA-Seq). It represents a powerful tool for discovering, profiling, and quantifying changes of gene expression in the overall genomic context. However, bioinformatic analysis of transcriptomic data, especially in cases with cSVs, is a complex and challenging task, and the development of proper bioinformatic tools for transcriptome studies is necessary. Methods: We designed a bioinformatic workflow for the analysis of total RNA-Seq data consisting of two separate parts (pipelines): The first pipeline incorporates a statistical solution for differential gene expression analysis in a biologically heterogeneous sample set. We utilized results from transcriptomic arrays which were carried out in parallel to increase the precision of the analysis. The second pipeline is used for the identification of de novo fusion genes. Special attention was given to the filtering of false positives (FPs), which was achieved through consensus fusion calling with several fusion gene callers. We applied the workflow to the data obtained from ten patients with chronic lymphocytic leukemia (CLL) to describe the consequences of their cSVs in detail. The fusion genes identified by our pipeline were correlated with genomic break-points detected by genomic arrays. Results: We set up a novel solution for differential gene expression analysis of individual samples and de novo fusion gene detection from total RNA-Seq data. The results of the differential gene expression analysis were concordant with results obtained by transcriptomic arrays, which demonstrates the analytical capabilities of our method. We also showed that the consensus fusion gene detection approach was able to identify true positives (TPs) efficiently. Detected coordinates of fusion gene junctions were in concordance with genomic breakpoints assessed using genomic arrays. Discussion: Byapplying our methods to real clinical samples, we proved that our approach for total RNA-Seq data analysis generates results consistent with other genomic analytical techniques. The data obtained by our analyses provided clues for the study of the biological consequences of cSVs with far-reaching implications for clinical outcome and management of cancer patients. The bioinformatic workflow is also widely applicable for addressing other research questions in different contexts, for which transcriptomic data are generated.
655    _2
$a časopisecké články $7 D016428
700    1_
$a Plevova, Karla $u Central European Institute of Technology, Masaryk University, Brno, Czech Republic. Department of Internal Medicine-Hematology and Oncology, Faculty of Medicine, Masaryk University, Brno, Czech Republic. Department of Internal Medicine-Hematology and Oncology, University Hospital Brno, Brno, Czech Republic.
700    1_
$a Radova, Lenka $u Central European Institute of Technology, Masaryk University, Brno, Czech Republic.
700    1_
$a Bystry, Vojtech $u Central European Institute of Technology, Masaryk University, Brno, Czech Republic.
700    1_
$a Pal, Karol $u Central European Institute of Technology, Masaryk University, Brno, Czech Republic. Department of Internal Medicine-Hematology and Oncology, Faculty of Medicine, Masaryk University, Brno, Czech Republic.
700    1_
$a Pospisilova, Sarka $u Central European Institute of Technology, Masaryk University, Brno, Czech Republic. Department of Internal Medicine-Hematology and Oncology, Faculty of Medicine, Masaryk University, Brno, Czech Republic. Department of Internal Medicine-Hematology and Oncology, University Hospital Brno, Brno, Czech Republic.
773    0_
$w MED00184567 $t PeerJ $x 2167-8359 $g Roč. 7, č. - (2019), s. e7071
856    41
$u https://pubmed.ncbi.nlm.nih.gov/31223530 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y a $z 0
990    __
$a 20190813 $b ABA008
991    __
$a 20210310131923 $b ABA008
999    __
$a ind $b bmc $g 1434006 $s 1067317
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2019 $b 7 $c - $d e7071 $e 20190612 $i 2167-8359 $m PeerJ $n PeerJ $x MED00184567
GRA    __
$a NV15-31834A $p MZ0
LZP    __
$a Pubmed-20190813

Najít záznam

Citační ukazatele

Nahrávání dat ...

Možnosti archivace

Nahrávání dat ...