• Je něco špatně v tomto záznamu ?

Improving Illumina assemblies with Hi-C and long reads: An example with the North African dromedary

JP. Elbers, MF. Rogers, PL. Perelman, AA. Proskuryakova, NA. Serdyukova, WE. Johnson, P. Horin, J. Corander, D. Murphy, PA. Burger,

. 2019 ; 19 (4) : 1015-1026. [pub] 20190517

Jazyk angličtina Země Anglie, Velká Británie

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/bmc19034641

Grantová podpora
16-14-10009 Russian Science Foundation
17-00-00146 Russian Foundation for Basic Research
RPG-2017-287 Leverhulme Trust
P29623-B25 Austrian Science Fund

Researchers have assembled thousands of eukaryotic genomes using Illumina reads, but traditional mate-pair libraries cannot span all repetitive elements, resulting in highly fragmented assemblies. However, both chromosome conformation capture techniques, such as Hi-C and Dovetail Genomics Chicago libraries and long-read sequencing, such as Pacific Biosciences and Oxford Nanopore, help span and resolve repetitive regions and therefore improve genome assemblies. One important livestock species of arid regions that does not have a high-quality contiguous reference genome is the dromedary (Camelus dromedarius). Draft genomes exist but are highly fragmented, and a high-quality reference genome is needed to understand adaptation to desert environments and artificial selection during domestication. Dromedaries are among the last livestock species to have been domesticated, and together with wild and domestic Bactrian camels, they are the only representatives of the Camelini tribe, which highlights their evolutionary significance. Here we describe our efforts to improve the North African dromedary genome. We used Chicago and Hi-C sequencing libraries from Dovetail Genomics to resolve the order of previously assembled contigs, producing almost chromosome-level scaffolds. Remaining gaps were filled with Pacific Biosciences long reads, and then scaffolds were comparatively mapped to chromosomes. Long reads added 99.32 Mbp to the total length of the new assembly. Dovetail Chicago and Hi-C libraries increased the longest scaffold over 12-fold, from 9.71 Mbp to 124.99 Mbp and the scaffold N50 over 50-fold, from 1.48 Mbp to 75.02 Mbp. We demonstrate that Illumina de novo assemblies can be substantially upgraded by combining chromosome conformation capture and long-read sequencing.

000      
00000naa a2200000 a 4500
001      
bmc19034641
003      
CZ-PrNML
005      
20191017091126.0
007      
ta
008      
191007s2019 enk f 000 0|eng||
009      
AR
024    7_
$a 10.1111/1755-0998.13020 $2 doi
035    __
$a (PubMed)30972949
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a enk
100    1_
$a Elbers, Jean P $u Department of Integrative Biology and Evolution, Research Institute of Wildlife Ecology, Vetmeduni Vienna, Vienna, Austria.
245    10
$a Improving Illumina assemblies with Hi-C and long reads: An example with the North African dromedary / $c JP. Elbers, MF. Rogers, PL. Perelman, AA. Proskuryakova, NA. Serdyukova, WE. Johnson, P. Horin, J. Corander, D. Murphy, PA. Burger,
520    9_
$a Researchers have assembled thousands of eukaryotic genomes using Illumina reads, but traditional mate-pair libraries cannot span all repetitive elements, resulting in highly fragmented assemblies. However, both chromosome conformation capture techniques, such as Hi-C and Dovetail Genomics Chicago libraries and long-read sequencing, such as Pacific Biosciences and Oxford Nanopore, help span and resolve repetitive regions and therefore improve genome assemblies. One important livestock species of arid regions that does not have a high-quality contiguous reference genome is the dromedary (Camelus dromedarius). Draft genomes exist but are highly fragmented, and a high-quality reference genome is needed to understand adaptation to desert environments and artificial selection during domestication. Dromedaries are among the last livestock species to have been domesticated, and together with wild and domestic Bactrian camels, they are the only representatives of the Camelini tribe, which highlights their evolutionary significance. Here we describe our efforts to improve the North African dromedary genome. We used Chicago and Hi-C sequencing libraries from Dovetail Genomics to resolve the order of previously assembled contigs, producing almost chromosome-level scaffolds. Remaining gaps were filled with Pacific Biosciences long reads, and then scaffolds were comparatively mapped to chromosomes. Long reads added 99.32 Mbp to the total length of the new assembly. Dovetail Chicago and Hi-C libraries increased the longest scaffold over 12-fold, from 9.71 Mbp to 124.99 Mbp and the scaffold N50 over 50-fold, from 1.48 Mbp to 75.02 Mbp. We demonstrate that Illumina de novo assemblies can be substantially upgraded by combining chromosome conformation capture and long-read sequencing.
650    _2
$a zvířata $7 D000818
650    _2
$a velbloudi $x genetika $7 D002162
650    _2
$a výpočetní biologie $x metody $7 D019295
650    _2
$a pouštní klima $7 D003889
650    12
$a genom $7 D016678
650    _2
$a genomika $x metody $7 D023281
650    _2
$a sekvenční analýza DNA $x metody $7 D017422
655    _2
$a časopisecké články $7 D016428
700    1_
$a Rogers, Mark F $u Intelligent Systems Laboratory, University of Bristol, Bristol, UK.
700    1_
$a Perelman, Polina L $u Institute of Molecular and Cellular Biology, SB RAS and Novosibirsk State University, Novosibirsk, Russia.
700    1_
$a Proskuryakova, Anastasia A $u Institute of Molecular and Cellular Biology, SB RAS and Novosibirsk State University, Novosibirsk, Russia.
700    1_
$a Serdyukova, Natalia A $u Institute of Molecular and Cellular Biology, SB RAS and Novosibirsk State University, Novosibirsk, Russia.
700    1_
$a Johnson, Warren E $u The Walter Reed Biosystematics Unit, Smithsonian Institution, Museum Support Center MRC-534, Suitland, Maryland.
700    1_
$a Horin, Petr $u Department of Animal Genetics, Faculty of Veterinary Medicine, Ceitec VFU, RG Animal Immunogenomics, University of Veterinary and Pharmaceutical Sciences, Brno, Czech Republic.
700    1_
$a Corander, Jukka $u Department of Biostatistics, University of Oslo, Oslo, Norway. Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland.
700    1_
$a Murphy, David $u Bristol Medical School: Translational Health Sciences, Molecular Neuroendocrinology Research Group, University of Bristol, Bristol, UK.
700    1_
$a Burger, Pamela A $u Department of Integrative Biology and Evolution, Research Institute of Wildlife Ecology, Vetmeduni Vienna, Vienna, Austria.
773    0_
$w MED00180393 $t Molecular ecology resources $x 1755-0998 $g Roč. 19, č. 4 (2019), s. 1015-1026
856    41
$u https://pubmed.ncbi.nlm.nih.gov/30972949 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y a $z 0
990    __
$a 20191007 $b ABA008
991    __
$a 20191017091554 $b ABA008
999    __
$a ok $b bmc $g 1451301 $s 1073191
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2019 $b 19 $c 4 $d 1015-1026 $e 20190517 $i 1755-0998 $m Molecular ecology resources $n Mol. ecol. resour. $x MED00180393
GRA    __
$a 16-14-10009 $p Russian Science Foundation
GRA    __
$a 17-00-00146 $p Russian Foundation for Basic Research
GRA    __
$a RPG-2017-287 $p Leverhulme Trust
GRA    __
$a P29623-B25 $p Austrian Science Fund
LZP    __
$a Pubmed-20191007

Najít záznam

Citační ukazatele

Nahrávání dat...

Možnosti archivace

Nahrávání dat...