-
Je něco špatně v tomto záznamu ?
Improving Illumina assemblies with Hi-C and long reads: An example with the North African dromedary
JP. Elbers, MF. Rogers, PL. Perelman, AA. Proskuryakova, NA. Serdyukova, WE. Johnson, P. Horin, J. Corander, D. Murphy, PA. Burger,
Jazyk angličtina Země Anglie, Velká Británie
Typ dokumentu časopisecké články
Grantová podpora
16-14-10009
Russian Science Foundation
17-00-00146
Russian Foundation for Basic Research
RPG-2017-287
Leverhulme Trust
P29623-B25
Austrian Science Fund
PubMed
30972949
DOI
10.1111/1755-0998.13020
Knihovny.cz E-zdroje
- MeSH
- genom * MeSH
- genomika metody MeSH
- pouštní klima MeSH
- sekvenční analýza DNA metody MeSH
- velbloudi genetika MeSH
- výpočetní biologie metody MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
Researchers have assembled thousands of eukaryotic genomes using Illumina reads, but traditional mate-pair libraries cannot span all repetitive elements, resulting in highly fragmented assemblies. However, both chromosome conformation capture techniques, such as Hi-C and Dovetail Genomics Chicago libraries and long-read sequencing, such as Pacific Biosciences and Oxford Nanopore, help span and resolve repetitive regions and therefore improve genome assemblies. One important livestock species of arid regions that does not have a high-quality contiguous reference genome is the dromedary (Camelus dromedarius). Draft genomes exist but are highly fragmented, and a high-quality reference genome is needed to understand adaptation to desert environments and artificial selection during domestication. Dromedaries are among the last livestock species to have been domesticated, and together with wild and domestic Bactrian camels, they are the only representatives of the Camelini tribe, which highlights their evolutionary significance. Here we describe our efforts to improve the North African dromedary genome. We used Chicago and Hi-C sequencing libraries from Dovetail Genomics to resolve the order of previously assembled contigs, producing almost chromosome-level scaffolds. Remaining gaps were filled with Pacific Biosciences long reads, and then scaffolds were comparatively mapped to chromosomes. Long reads added 99.32 Mbp to the total length of the new assembly. Dovetail Chicago and Hi-C libraries increased the longest scaffold over 12-fold, from 9.71 Mbp to 124.99 Mbp and the scaffold N50 over 50-fold, from 1.48 Mbp to 75.02 Mbp. We demonstrate that Illumina de novo assemblies can be substantially upgraded by combining chromosome conformation capture and long-read sequencing.
Citace poskytuje Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc19034641
- 003
- CZ-PrNML
- 005
- 20191017091126.0
- 007
- ta
- 008
- 191007s2019 enk f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1111/1755-0998.13020 $2 doi
- 035 __
- $a (PubMed)30972949
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a enk
- 100 1_
- $a Elbers, Jean P $u Department of Integrative Biology and Evolution, Research Institute of Wildlife Ecology, Vetmeduni Vienna, Vienna, Austria.
- 245 10
- $a Improving Illumina assemblies with Hi-C and long reads: An example with the North African dromedary / $c JP. Elbers, MF. Rogers, PL. Perelman, AA. Proskuryakova, NA. Serdyukova, WE. Johnson, P. Horin, J. Corander, D. Murphy, PA. Burger,
- 520 9_
- $a Researchers have assembled thousands of eukaryotic genomes using Illumina reads, but traditional mate-pair libraries cannot span all repetitive elements, resulting in highly fragmented assemblies. However, both chromosome conformation capture techniques, such as Hi-C and Dovetail Genomics Chicago libraries and long-read sequencing, such as Pacific Biosciences and Oxford Nanopore, help span and resolve repetitive regions and therefore improve genome assemblies. One important livestock species of arid regions that does not have a high-quality contiguous reference genome is the dromedary (Camelus dromedarius). Draft genomes exist but are highly fragmented, and a high-quality reference genome is needed to understand adaptation to desert environments and artificial selection during domestication. Dromedaries are among the last livestock species to have been domesticated, and together with wild and domestic Bactrian camels, they are the only representatives of the Camelini tribe, which highlights their evolutionary significance. Here we describe our efforts to improve the North African dromedary genome. We used Chicago and Hi-C sequencing libraries from Dovetail Genomics to resolve the order of previously assembled contigs, producing almost chromosome-level scaffolds. Remaining gaps were filled with Pacific Biosciences long reads, and then scaffolds were comparatively mapped to chromosomes. Long reads added 99.32 Mbp to the total length of the new assembly. Dovetail Chicago and Hi-C libraries increased the longest scaffold over 12-fold, from 9.71 Mbp to 124.99 Mbp and the scaffold N50 over 50-fold, from 1.48 Mbp to 75.02 Mbp. We demonstrate that Illumina de novo assemblies can be substantially upgraded by combining chromosome conformation capture and long-read sequencing.
- 650 _2
- $a zvířata $7 D000818
- 650 _2
- $a velbloudi $x genetika $7 D002162
- 650 _2
- $a výpočetní biologie $x metody $7 D019295
- 650 _2
- $a pouštní klima $7 D003889
- 650 12
- $a genom $7 D016678
- 650 _2
- $a genomika $x metody $7 D023281
- 650 _2
- $a sekvenční analýza DNA $x metody $7 D017422
- 655 _2
- $a časopisecké články $7 D016428
- 700 1_
- $a Rogers, Mark F $u Intelligent Systems Laboratory, University of Bristol, Bristol, UK.
- 700 1_
- $a Perelman, Polina L $u Institute of Molecular and Cellular Biology, SB RAS and Novosibirsk State University, Novosibirsk, Russia.
- 700 1_
- $a Proskuryakova, Anastasia A $u Institute of Molecular and Cellular Biology, SB RAS and Novosibirsk State University, Novosibirsk, Russia.
- 700 1_
- $a Serdyukova, Natalia A $u Institute of Molecular and Cellular Biology, SB RAS and Novosibirsk State University, Novosibirsk, Russia.
- 700 1_
- $a Johnson, Warren E $u The Walter Reed Biosystematics Unit, Smithsonian Institution, Museum Support Center MRC-534, Suitland, Maryland.
- 700 1_
- $a Horin, Petr $u Department of Animal Genetics, Faculty of Veterinary Medicine, Ceitec VFU, RG Animal Immunogenomics, University of Veterinary and Pharmaceutical Sciences, Brno, Czech Republic.
- 700 1_
- $a Corander, Jukka $u Department of Biostatistics, University of Oslo, Oslo, Norway. Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland.
- 700 1_
- $a Murphy, David $u Bristol Medical School: Translational Health Sciences, Molecular Neuroendocrinology Research Group, University of Bristol, Bristol, UK.
- 700 1_
- $a Burger, Pamela A $u Department of Integrative Biology and Evolution, Research Institute of Wildlife Ecology, Vetmeduni Vienna, Vienna, Austria.
- 773 0_
- $w MED00180393 $t Molecular ecology resources $x 1755-0998 $g Roč. 19, č. 4 (2019), s. 1015-1026
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/30972949 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20191007 $b ABA008
- 991 __
- $a 20191017091554 $b ABA008
- 999 __
- $a ok $b bmc $g 1451301 $s 1073191
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2019 $b 19 $c 4 $d 1015-1026 $e 20190517 $i 1755-0998 $m Molecular ecology resources $n Mol. ecol. resour. $x MED00180393
- GRA __
- $a 16-14-10009 $p Russian Science Foundation
- GRA __
- $a 17-00-00146 $p Russian Foundation for Basic Research
- GRA __
- $a RPG-2017-287 $p Leverhulme Trust
- GRA __
- $a P29623-B25 $p Austrian Science Fund
- LZP __
- $a Pubmed-20191007