• Je něco špatně v tomto záznamu ?

The Dark Matter of Large Cereal Genomes: Long Tandem Repeats

V. Kapustová, Z. Tulpová, H. Toegelová, P. Novák, J. Macas, M. Karafiátová, E. Hřibová, J. Doležel, H. Šimková,

. 2019 ; 20 (10) : . [pub] 20190520

Jazyk angličtina Země Švýcarsko

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/bmc19044796

Grantová podpora
17-17564S Grantová Agentura České Republiky
CZ.02.1.01/0.0/0.0/16_019/0000827 European Regional Development Fund

Reference genomes of important cereals, including barley, emmer wheat and bread wheat, were released recently. Their comparison with genome size estimates obtained by flow cytometry indicated that the assemblies represent not more than 88-98% of the complete genome. This work is aimed at identifying the missing parts in two cereal genomes and proposing techniques to make the assemblies more complete. We focused on tandemly organised repetitive sequences, known to be underrepresented in genome assemblies generated from short-read sequence data. Our study found arrays of three tandem repeats with unit sizes of 1242 to 2726 bp present in the bread wheat reference genome generated from short reads. However, this and another wheat genome assembly employing long PacBio reads failed in integrating correctly the 2726-bp repeat in the pseudomolecule context. This suggests that tandem repeats of this size, frequently incorporated in unassigned scaffolds, may contribute to shrinking of pseudomolecules without reducing size of the entire assembly. We demonstrate how this missing information may be added to the pseudomolecules with the aid of nanopore sequencing of individual BAC clones and optical mapping. Using the latter technique, we identified and localised a 470-kb long array of 45S ribosomal DNA absent from the reference genome of barley.

Citace poskytuje Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc19044796
003      
CZ-PrNML
005      
20200113081323.0
007      
ta
008      
200109s2019 sz f 000 0|eng||
009      
AR
024    7_
$a 10.3390/ijms20102483 $2 doi
035    __
$a (PubMed)31137466
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a sz
100    1_
$a Kapustová, Veronika $u Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-78371 Olomouc, Czech Republic. kapustova@ueb.cas.cz.
245    14
$a The Dark Matter of Large Cereal Genomes: Long Tandem Repeats / $c V. Kapustová, Z. Tulpová, H. Toegelová, P. Novák, J. Macas, M. Karafiátová, E. Hřibová, J. Doležel, H. Šimková,
520    9_
$a Reference genomes of important cereals, including barley, emmer wheat and bread wheat, were released recently. Their comparison with genome size estimates obtained by flow cytometry indicated that the assemblies represent not more than 88-98% of the complete genome. This work is aimed at identifying the missing parts in two cereal genomes and proposing techniques to make the assemblies more complete. We focused on tandemly organised repetitive sequences, known to be underrepresented in genome assemblies generated from short-read sequence data. Our study found arrays of three tandem repeats with unit sizes of 1242 to 2726 bp present in the bread wheat reference genome generated from short reads. However, this and another wheat genome assembly employing long PacBio reads failed in integrating correctly the 2726-bp repeat in the pseudomolecule context. This suggests that tandem repeats of this size, frequently incorporated in unassigned scaffolds, may contribute to shrinking of pseudomolecules without reducing size of the entire assembly. We demonstrate how this missing information may be added to the pseudomolecules with the aid of nanopore sequencing of individual BAC clones and optical mapping. Using the latter technique, we identified and localised a 470-kb long array of 45S ribosomal DNA absent from the reference genome of barley.
650    _2
$a chromozomy rostlin $x genetika $7 D032461
650    12
$a genom rostlinný $7 D018745
650    _2
$a ječmen (rod) $x genetika $7 D001467
650    12
$a tandemové repetitivní sekvence $7 D020080
650    _2
$a pšenice $x genetika $7 D014908
655    _2
$a časopisecké články $7 D016428
700    1_
$a Tulpová, Zuzana $u Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-78371 Olomouc, Czech Republic. tulpova@ueb.cas.cz.
700    1_
$a Toegelová, Helena $u Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-78371 Olomouc, Czech Republic. toegelova@ueb.cas.cz.
700    1_
$a Novák, Petr $u Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, CZ-37005 České Budějovice, Czech Republic. petr@umbr.cas.cz.
700    1_
$a Macas, Jiří $u Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, CZ-37005 České Budějovice, Czech Republic. macas@umbr.cas.cz.
700    1_
$a Karafiátová, Miroslava $u Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-78371 Olomouc, Czech Republic. karafiatova@ueb.cas.cz.
700    1_
$a Hřibová, Eva $u Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-78371 Olomouc, Czech Republic. hribova@ueb.cas.cz.
700    1_
$a Doležel, Jaroslav $u Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-78371 Olomouc, Czech Republic. dolezel@ueb.cas.cz.
700    1_
$a Šimková, Hana $u Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-78371 Olomouc, Czech Republic. simkovah@ueb.cas.cz.
773    0_
$w MED00176142 $t International journal of molecular sciences $x 1422-0067 $g Roč. 20, č. 10 (2019)
856    41
$u https://pubmed.ncbi.nlm.nih.gov/31137466 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y a $z 0
990    __
$a 20200109 $b ABA008
991    __
$a 20200113081654 $b ABA008
999    __
$a ok $b bmc $g 1483065 $s 1083469
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2019 $b 20 $c 10 $e 20190520 $i 1422-0067 $m International journal of molecular sciences $n Int J Mol Sci $x MED00176142
GRA    __
$a 17-17564S $p Grantová Agentura České Republiky
GRA    __
$a CZ.02.1.01/0.0/0.0/16_019/0000827 $p European Regional Development Fund
LZP    __
$a Pubmed-20200109

Najít záznam

Citační ukazatele

Nahrávání dat ...

Možnosti archivace

Nahrávání dat ...