• Je něco špatně v tomto záznamu ?

New telomere to telomere assembly of human chromosome 8 reveals a previous underestimation of G-quadruplex forming sequences and inverted repeats

V. Brázda, N. Bohálová, RP. Bowater

. 2022 ; 810 (-) : 146058. [pub] 20211101

Jazyk angličtina Země Nizozemsko

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/bmc22011055

Taking advantage of evolving and improving sequencing methods, human chromosome 8 is now available as a gapless, end-to-end assembly. Thanks to advances in long-read sequencing technologies, its centromere, telomeres, duplicated gene families and repeat-rich regions are now fully sequenced. We were interested to assess if the new assembly altered our understanding of the potential impact of non-B DNA structures within this completed chromosome sequence. It has been shown that non-B secondary structures, such as G-quadruplexes, hairpins and cruciforms, have important regulatory functions and potential as targeted therapeutics. Therefore, we analysed the presence of putative G-quadruplex forming sequences and inverted repeats in the current human reference genome (GRCh38) and in the new end-to-end assembly of chromosome 8. The comparison revealed that the new assembly contains significantly more inverted repeats and G-quadruplex forming sequences compared to the current reference sequence. This observation can be explained by improved accuracy of the new sequencing methods, particularly in regions that contain extensive repeats of bases, as is preferred by many non-B DNA structures. These results show a significant underestimation of the prevalence of non-B DNA secondary structure in previous assembly versions of the human genome and point to their importance being not fully appreciated. We anticipate that similar observations will occur as the improved sequencing technologies fill in gaps across the genomes of humans and other organisms.

Citace poskytuje Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc22011055
003      
CZ-PrNML
005      
20220506125916.0
007      
ta
008      
220425s2022 ne f 000 0|eng||
009      
AR
024    7_
$a 10.1016/j.gene.2021.146058 $2 doi
035    __
$a (PubMed)34737002
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a ne
100    1_
$a Brázda, Václav $u Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, Brno 612 65, Czech Republic. Electronic address: vaclav@ibp.cz
245    10
$a New telomere to telomere assembly of human chromosome 8 reveals a previous underestimation of G-quadruplex forming sequences and inverted repeats / $c V. Brázda, N. Bohálová, RP. Bowater
520    9_
$a Taking advantage of evolving and improving sequencing methods, human chromosome 8 is now available as a gapless, end-to-end assembly. Thanks to advances in long-read sequencing technologies, its centromere, telomeres, duplicated gene families and repeat-rich regions are now fully sequenced. We were interested to assess if the new assembly altered our understanding of the potential impact of non-B DNA structures within this completed chromosome sequence. It has been shown that non-B secondary structures, such as G-quadruplexes, hairpins and cruciforms, have important regulatory functions and potential as targeted therapeutics. Therefore, we analysed the presence of putative G-quadruplex forming sequences and inverted repeats in the current human reference genome (GRCh38) and in the new end-to-end assembly of chromosome 8. The comparison revealed that the new assembly contains significantly more inverted repeats and G-quadruplex forming sequences compared to the current reference sequence. This observation can be explained by improved accuracy of the new sequencing methods, particularly in regions that contain extensive repeats of bases, as is preferred by many non-B DNA structures. These results show a significant underestimation of the prevalence of non-B DNA secondary structure in previous assembly versions of the human genome and point to their importance being not fully appreciated. We anticipate that similar observations will occur as the improved sequencing technologies fill in gaps across the genomes of humans and other organisms.
650    12
$a lidské chromozomy, pár 8 $7 D002898
650    12
$a G-kvadruplexy $7 D054856
650    _2
$a genom lidský $7 D015894
650    _2
$a lidé $7 D006801
650    _2
$a sekvenční analýza DNA $7 D017422
650    12
$a inverze sekvence $7 D057345
650    12
$a telomery $7 D016615
655    _2
$a časopisecké články $7 D016428
700    1_
$a Bohálová, Natália $u Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, Brno 612 65, Czech Republic; Department of Experimental Biology, Faculty of Science, Masaryk University, Kamenice 5, Brno 62500, Czech Republic
700    1_
$a Bowater, Richard P $u School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, United Kingdom. Electronic address: R.Bowater@uea.ac.uk
773    0_
$w MED00001888 $t Gene $x 1879-0038 $g Roč. 810, č. - (2022), s. 146058
856    41
$u https://pubmed.ncbi.nlm.nih.gov/34737002 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y p $z 0
990    __
$a 20220425 $b ABA008
991    __
$a 20220506125908 $b ABA008
999    __
$a ok $b bmc $g 1788913 $s 1162253
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2022 $b 810 $c - $d 146058 $e 20211101 $i 1879-0038 $m Gene $n Gene $x MED00001888
LZP    __
$a Pubmed-20220425

Najít záznam

Citační ukazatele

Nahrávání dat ...

Možnosti archivace

Nahrávání dat ...