-
Something wrong with this record ?
New telomere to telomere assembly of human chromosome 8 reveals a previous underestimation of G-quadruplex forming sequences and inverted repeats
V. Brázda, N. Bohálová, RP. Bowater
Language English Country Netherlands
Document type Journal Article
- MeSH
- G-Quadruplexes * MeSH
- Genome, Human MeSH
- Sequence Inversion * MeSH
- Humans MeSH
- Chromosomes, Human, Pair 8 * MeSH
- Sequence Analysis, DNA MeSH
- Telomere * MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
Taking advantage of evolving and improving sequencing methods, human chromosome 8 is now available as a gapless, end-to-end assembly. Thanks to advances in long-read sequencing technologies, its centromere, telomeres, duplicated gene families and repeat-rich regions are now fully sequenced. We were interested to assess if the new assembly altered our understanding of the potential impact of non-B DNA structures within this completed chromosome sequence. It has been shown that non-B secondary structures, such as G-quadruplexes, hairpins and cruciforms, have important regulatory functions and potential as targeted therapeutics. Therefore, we analysed the presence of putative G-quadruplex forming sequences and inverted repeats in the current human reference genome (GRCh38) and in the new end-to-end assembly of chromosome 8. The comparison revealed that the new assembly contains significantly more inverted repeats and G-quadruplex forming sequences compared to the current reference sequence. This observation can be explained by improved accuracy of the new sequencing methods, particularly in regions that contain extensive repeats of bases, as is preferred by many non-B DNA structures. These results show a significant underestimation of the prevalence of non-B DNA secondary structure in previous assembly versions of the human genome and point to their importance being not fully appreciated. We anticipate that similar observations will occur as the improved sequencing technologies fill in gaps across the genomes of humans and other organisms.
References provided by Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc22011055
- 003
- CZ-PrNML
- 005
- 20220506125916.0
- 007
- ta
- 008
- 220425s2022 ne f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1016/j.gene.2021.146058 $2 doi
- 035 __
- $a (PubMed)34737002
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a ne
- 100 1_
- $a Brázda, Václav $u Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, Brno 612 65, Czech Republic. Electronic address: vaclav@ibp.cz
- 245 10
- $a New telomere to telomere assembly of human chromosome 8 reveals a previous underestimation of G-quadruplex forming sequences and inverted repeats / $c V. Brázda, N. Bohálová, RP. Bowater
- 520 9_
- $a Taking advantage of evolving and improving sequencing methods, human chromosome 8 is now available as a gapless, end-to-end assembly. Thanks to advances in long-read sequencing technologies, its centromere, telomeres, duplicated gene families and repeat-rich regions are now fully sequenced. We were interested to assess if the new assembly altered our understanding of the potential impact of non-B DNA structures within this completed chromosome sequence. It has been shown that non-B secondary structures, such as G-quadruplexes, hairpins and cruciforms, have important regulatory functions and potential as targeted therapeutics. Therefore, we analysed the presence of putative G-quadruplex forming sequences and inverted repeats in the current human reference genome (GRCh38) and in the new end-to-end assembly of chromosome 8. The comparison revealed that the new assembly contains significantly more inverted repeats and G-quadruplex forming sequences compared to the current reference sequence. This observation can be explained by improved accuracy of the new sequencing methods, particularly in regions that contain extensive repeats of bases, as is preferred by many non-B DNA structures. These results show a significant underestimation of the prevalence of non-B DNA secondary structure in previous assembly versions of the human genome and point to their importance being not fully appreciated. We anticipate that similar observations will occur as the improved sequencing technologies fill in gaps across the genomes of humans and other organisms.
- 650 12
- $a lidské chromozomy, pár 8 $7 D002898
- 650 12
- $a G-kvadruplexy $7 D054856
- 650 _2
- $a genom lidský $7 D015894
- 650 _2
- $a lidé $7 D006801
- 650 _2
- $a sekvenční analýza DNA $7 D017422
- 650 12
- $a inverze sekvence $7 D057345
- 650 12
- $a telomery $7 D016615
- 655 _2
- $a časopisecké články $7 D016428
- 700 1_
- $a Bohálová, Natália $u Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, Brno 612 65, Czech Republic; Department of Experimental Biology, Faculty of Science, Masaryk University, Kamenice 5, Brno 62500, Czech Republic
- 700 1_
- $a Bowater, Richard P $u School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, United Kingdom. Electronic address: R.Bowater@uea.ac.uk
- 773 0_
- $w MED00001888 $t Gene $x 1879-0038 $g Roč. 810, č. - (2022), s. 146058
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/34737002 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y p $z 0
- 990 __
- $a 20220425 $b ABA008
- 991 __
- $a 20220506125908 $b ABA008
- 999 __
- $a ok $b bmc $g 1788913 $s 1162253
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2022 $b 810 $c - $d 146058 $e 20211101 $i 1879-0038 $m Gene $n Gene $x MED00001888
- LZP __
- $a Pubmed-20220425