PacBio Dotaz Zobrazit nápovědu
There is no consensus barcoding region for determination of arbuscular mycorrhizal fungal (AMF) taxa. To overcome this obstacle, we have developed an approach to sequence an AMF marker within the ribosome-encoding operon (rDNA) that covers all three widely applied variable molecular markers. Using a nested PCR approach specific to AMF, we amplified a part (c. 2.5 kb) of the rDNA spanning the majority of the small subunit rRNA (SSU) gene, the complete internal transcribed spacer (ITS) region and a part of the large subunit (LSU) rRNA gene. The PCR products were sequenced on the PacBio platform utilizing Single Molecule Real Time (SMRT) sequencing. Employing this method for selected environmental DNA samples, we were able to describe complex AMF communities consisting of various glomeromycotan lineages. We demonstrate the applicability of this new 2.5 kb approach to provide robust phylogenetic assignment of AMF lineages without known sequences from pure cultures and to consolidate information about AMF taxon distributions coming from three widely used barcoding regions into one integrative dataset.
- Klíčová slova
- Archaeosporales, Glomeromycota, PacBio, long-read metabarcoding, mycorrhizal fungi distribution, third-generation sequencing,
- MeSH
- DNA fungální genetika MeSH
- fylogeneze MeSH
- Glomeromycota * genetika MeSH
- houby genetika MeSH
- mykorhiza * genetika MeSH
- ribozomální DNA genetika MeSH
- sekvenční analýza DNA MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA fungální MeSH
- ribozomální DNA MeSH
Duckweeds are small, free-floating, morphologically highly reduced organisms belonging to the monocot order Alismatales. They display the most rapid growth among flowering plants, vary ~ 14-fold in genome size and comprise five genera. Spirodela is the phylogenetically oldest genus with only two mainly asexually propagating species: S. polyrhiza (2n = 40; 160 Mbp/1C) and S. intermedia (2n = 36; 160 Mbp/1C). This study combined comparative cytogenetics and de novo genome assembly based on PacBio, Illumina and Oxford Nanopore (ON) reads to obtain the first genome reference for S. intermedia and to compare its genomic features with those of the sister species S. polyrhiza. Both species' genomes revealed little more than 20,000 putative protein-coding genes, very low rDNA copy numbers and a low amount of repetitive sequences, mainly Ty3/gypsy retroelements. The detection of a few new small chromosome rearrangements between both Spirodela species refined the karyotype and the chromosomal sequence assignment for S. intermedia.
Aethionema arabicum is an important model plant for Brassicaceae trait evolution, particularly of seed (development, regulation, germination, dormancy) and fruit (development, dehiscence mechanisms) characters. Its genome assembly was recently improved but the gene annotation was not updated. Here, we improved the Ae. arabicum gene annotation using 294 RNA-seq libraries and 136 307 full-length PacBio Iso-seq transcripts, increasing BUSCO completeness by 11.6% and featuring 5606 additional genes. Analysis of orthologs showed a lower number of genes in Ae. arabicum than in other Brassicaceae, which could be partially explained by loss of homeologs derived from the At-α polyploidization event and by a lower occurrence of tandem duplications after divergence of Aethionema from the other Brassicaceae. Benchmarking of MADS-box genes identified orthologs of FUL and AGL79 not found in previous versions. Analysis of full-length transcripts related to ABA-mediated seed dormancy discovered a conserved isoform of PIF6-β and antisense transcripts in ABI3, ABI4 and DOG1, among other cases found of different alternative splicing between Turkey and Cyprus ecotypes. The presented data allow alternative splicing mining and proposition of numerous hypotheses to research evolution and functional genomics. Annotation data and sequences are available at the Ae. arabicum DB (https://plantcode.online.uni-marburg.de/aetar_db).
- Klíčová slova
- Aethionema arabicum, Brassicaceae evolution, Iso-seq, alternative splicing, genome annotation, seed germination, transcription factors,
- MeSH
- Brassicaceae genetika metabolismus fyziologie MeSH
- genom rostlinný genetika MeSH
- klíčení genetika fyziologie MeSH
- regulace genové exprese u rostlin genetika fyziologie MeSH
- semena rostlinná genetika metabolismus fyziologie MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Pythium oligandrum is a soil born free living oomycete able to parasitize fungi and oomycetes prey, including important plant and animals pathogens. Pythium oligandrum can colonize endophytically the root tissues of diverse plants where it induces plant defenses. Here we report the first long-read genome sequencing of a P. oligandrum strain sequenced by PacBio technology. Sequencing of genomic DNA loaded onto six SMRT cells permitted the acquisition of 913,728 total reads resulting in 112X genome coverage. The assembly and polishing of the genome sequence yielded180 contigs (N50 = 1.3 Mb; L50 = 12). The size of the genome assembly is 41.9 Mb with a longest contig of 2.7 Mb and 15,007 predicted protein-coding genes among which 95.25% were supported by RNAseq data, thus constituting a new Pythium genome reference. This data will facilitate genomic comparisons of Pythium species that are commensal, beneficial or pathogenic on plant, or parasitic on fungi and oomycete to identify key genetic determinants underpinning their diverse lifestyles. In addition comparison with plant pathogenic or zoopathogenic species will illuminate genomic adaptations for pathogenesis toward widely diverse hosts.
- Klíčová slova
- Mycoparasitism, PacBio, Pythium oligandrum, RNAseq, genome,
- MeSH
- Beta vulgaris parazitologie MeSH
- genom MeSH
- proteom MeSH
- Pythium genetika metabolismus MeSH
- rhizosféra MeSH
- sekvenování transkriptomu MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- proteom MeSH
The cheetah (Acinonyx jubatus, SCHREBER 1775) is a large felid and is considered the fastest land animal. Historically, it inhabited open grassland across Africa, the Arabian Peninsula, and southwestern Asia; however, only small and fragmented populations remain today. Here, we present a de novo genome assembly of the cheetah based on PacBio continuous long reads and Hi-C proximity ligation data. The final assembly (VMU_Ajub_asm_v1.0) has a total length of 2.38 Gb, of which 99.7% are anchored into the expected 19 chromosome-scale scaffolds. The contig and scaffold N50 values of 96.8 Mb and 144.4 Mb, respectively, a BUSCO completeness of 95.4% and a k-mer completeness of 98.4%, emphasize the high quality of the assembly. Furthermore, annotation of the assembly identified 23,622 genes and a repeat content of 40.4%. This new highly contiguous and chromosome-scale assembly will greatly benefit conservation and evolutionary genomic analyses and will be a valuable resource, e.g., to gain a detailed understanding of the function and diversity of immune response genes in felids.
- Klíčová slova
- Felidae, Hi-C, PacBio, conservation genomics, proximity-ligation,
- MeSH
- Acinonyx * genetika MeSH
- anotace sekvence MeSH
- chromozomy genetika MeSH
- fylogeneze MeSH
- genom MeSH
- genomika MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
BACKGROUND: Pikes represent an important genus (Esox) harbouring a pre-duplication karyotype (2n = 2x = 50) of economically important salmonid pseudopolyploids. Here, we have characterized the 5S ribosomal RNA genes (rDNA) in Esox lucius and its closely related E. cisalpinus using cytogenetic, molecular and genomic approaches. Intragenomic homogeneity and copy number estimation was carried out using Illumina reads. The higher-order structure of rDNA arrays was investigated by the analysis of long PacBio reads. Position of loci on chromosomes was determined by FISH. DNA methylation was analysed by methylation-sensitive restriction enzymes. RESULTS: The 5S rDNA loci occupy exclusively (peri)centromeric regions on 30-38 acrocentric chromosomes in both E. lucius and E. cisalpinus. The large number of loci is accompanied by extreme amplification of genes (>20,000 copies), which is to the best of our knowledge one of the highest copy number of rRNA genes in animals ever reported. Conserved secondary structures of predicted 5S rRNAs indicate that most of the amplified genes are potentially functional. Only few SNPs were found in genic regions indicating their high homogeneity while intergenic spacers were more heterogeneous and several families were identified. Analysis of 10-30 kb-long molecules sequenced by the PacBio technology (containing about 40% of total 5S rDNA) revealed that the vast majority (96%) of genes are organised in large several kilobase-long blocks. Dispersed genes or short tandems were less common (4%). The adjacent 5S blocks were directly linked, separated by intervening DNA and even inverted. The 5S units differing in the intergenic spacers formed both homogeneous and heterogeneous (mixed) blocks indicating variable degree of homogenisation between the loci. Both E. lucius and E. cisalpinus 5S rDNA was heavily methylated at CG dinucleotides. CONCLUSIONS: Extreme amplification of 5S rRNA genes in the Esox genome occurred in the absence of significant pseudogenisation suggesting its recent origin and/or intensive homogenisation processes. The dense methylation of units indicates that powerful epigenetic mechanisms have evolved in this group of fish to silence amplified genes. We discuss how the higher-order repeat structures impact on homogenisation of 5S rDNA in the genome.
- Klíčová slova
- Chromosome, Esox, Evolution, Fish, Single cell PacBio sequencing, rDNA,
- MeSH
- Esocidae genetika MeSH
- fylogeneze MeSH
- genetické lokusy genetika MeSH
- genomika * MeSH
- genová dávka MeSH
- heterochromatin metabolismus MeSH
- konzervovaná sekvence MeSH
- metylace DNA * MeSH
- ribozomální DNA genetika MeSH
- sekvence nukleotidů MeSH
- sekvenční analýza hybridizací s uspořádaným souborem oligonukleotidů MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- heterochromatin MeSH
- ribozomální DNA MeSH
The first gapless, telomere-to-telomere (T2T) sequence assemblies of plant chromosomes were reported recently. However, sequence assemblies of most plant genomes remain fragmented. Only recent breakthroughs in accurate long-read sequencing have made it possible to achieve highly contiguous sequence assemblies with a few tens of contigs per chromosome, that is a number small enough to allow for a systematic inquiry into the causes of the remaining sequence gaps and the approaches and resources needed to close them. Here, we analyse sequence gaps in the current reference genome sequence of barley cv. Morex (MorexV3). Optical map and sequence raw data, complemented by ChIP-seq data for centromeric histone variant CENH3, were used to estimate the abundance of centromeric, ribosomal DNA, and subtelomeric repeats in the barley genome. These estimates were compared with copy numbers in the MorexV3 pseudomolecule sequence. We found that almost all centromeric sequences and 45S ribosomal DNA repeat arrays were absent from the MorexV3 pseudomolecules and that the majority of sequence gaps can be attributed to assembly breakdown in long stretches of satellite repeats. However, missing sequences cannot fully account for the difference between assembly size and flow cytometric genome size estimates. We discuss the prospects of gap closure with ultra-long sequence reads.
- Klíčová slova
- CenH3, Cereba, ChIP-seq, PacBio HiFi reads, flow cytometry, nanopore, ribosomal DNA, satellite, telomeric repeats,
- MeSH
- chromozomy rostlin genetika MeSH
- genom rostlinný genetika MeSH
- ječmen (rod) * genetika MeSH
- ribozomální DNA genetika MeSH
- sekvenční analýza DNA MeSH
- telomery genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- ribozomální DNA MeSH
BACKGROUND: The invasive benthic round goby (Neogobius melanostomus) is the most successful temperate invasive fish and has spread in aquatic ecosystems on both sides of the Atlantic. Invasive species constitute powerful in situ experimental systems to study fast adaptation and directional selection on short ecological timescales and present promising case studies to understand factors involved the impressive ability of some species to colonize novel environments. We seize the unique opportunity presented by the round goby invasion to study genomic substrates potentially involved in colonization success. RESULTS: We report a highly contiguous long-read-based genome and analyze gene families that we hypothesize to relate to the ability of these fish to deal with novel environments. The analyses provide novel insights from the large evolutionary scale to the small species-specific scale. We describe expansions in specific cytochrome P450 enzymes, a remarkably diverse innate immune system, an ancient duplication in red light vision accompanied by red skin fluorescence, evolutionary patterns of epigenetic regulators, and the presence of osmoregulatory genes that may have contributed to the round goby's capacity to invade cold and salty waters. A recurring theme across all analyzed gene families is gene expansions. CONCLUSIONS: The expanded innate immune system of round goby may potentially contribute to its ability to colonize novel areas. Since other gene families also feature copy number expansions in the round goby, and since other Gobiidae also feature fascinating environmental adaptations and are excellent colonizers, further long-read genome approaches across the goby family may reveal whether gene copy number expansions are more generally related to the ability to conquer new habitats in Gobiidae or in fish.
- Klíčová slova
- Adaptation, Detoxification, Epigenetics, Evolution, Fish, Gene duplication, Genomics, Innate immunity, Invasive species, Neogobius melanostomus, Olfaction, Osmoregulation, PacBio, Vision,
- MeSH
- genom * MeSH
- ryby genetika fyziologie MeSH
- zavlečené druhy * MeSH
- zvířata MeSH
- zvláštnosti životní historie * MeSH
- Check Tag
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
Spiders are a hyperdiverse taxon and among the most abundant predators in nearly all terrestrial habitats. Their success is often attributed to key developments in their evolution such as silk and venom production and major apomorphies such as a whole-genome duplication. Resolving deep relationships within the spider tree of life has been historically challenging, making it difficult to measure the relative importance of these novelties for spider evolution. Whole-genome data offer an essential resource in these efforts, but also for functional genomic studies. Here, we present de novo assemblies for three spider species: Ryuthela nishihirai (Liphistiidae), a representative of the ancient Mesothelae, the suborder that is sister to all other extant spiders; Uloborus plumipes (Uloboridae), a cribellate orbweaver whose phylogenetic placement is especially challenging; and Cheiracanthium punctorium (Cheiracanthiidae), which represents only the second family to be sequenced in the hyperdiverse Dionycha clade. These genomes fill critical gaps in the spider tree of life. Using these novel genomes along with 25 previously published ones, we examine the evolutionary history of spidroin gene and structural hox cluster diversity. Our assemblies provide critical genomic resources to facilitate deeper investigations into spider evolution. The near chromosome-level genome of the 'living fossil' R. nishihirai represents an especially important step forward, offering new insights into the origins of spider traits.
- Klíčová slova
- Hi‐C, Mesothelae, assembly, chromosome, karyotype, spider silk,
- MeSH
- fylogeneze * MeSH
- genom genetika MeSH
- hedvábí genetika MeSH
- jedovatá zvířata MeSH
- pavouci * genetika klasifikace MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- hedvábí MeSH
BACKGROUND: Freshwater ecosystems are inhabited by members of cosmopolitan bacterioplankton lineages despite the disconnected nature of these habitats. The lineages are delineated based on > 97% 16S rRNA gene sequence similarity, but their intra-lineage microdiversity and phylogeography, which are key to understanding the eco-evolutional processes behind their ubiquity, remain unresolved. Here, we applied long-read amplicon sequencing targeting nearly full-length 16S rRNA genes and the adjacent ribosomal internal transcribed spacer sequences to reveal the intra-lineage diversities of pelagic bacterioplankton assemblages in 11 deep freshwater lakes in Japan and Europe. RESULTS: Our single nucleotide-resolved analysis, which was validated using shotgun metagenomic sequencing, uncovered 7-101 amplicon sequence variants for each of the 11 predominant bacterial lineages and demonstrated sympatric, allopatric, and temporal microdiversities that could not be resolved through conventional approaches. Clusters of samples with similar intra-lineage population compositions were identified, which consistently supported genetic isolation between Japan and Europe. At a regional scale (up to hundreds of kilometers), dispersal between lakes was unlikely to be a limiting factor, and environmental factors or genetic drift were potential determinants of population composition. The extent of microdiversification varied among lineages, suggesting that highly diversified lineages (e.g., Iluma-A2 and acI-A1) achieve their ubiquity by containing a consortium of genotypes specific to each habitat, while less diversified lineages (e.g., CL500-11) may be ubiquitous due to a small number of widespread genotypes. The lowest extent of intra-lineage diversification was observed among the dominant hypolimnion-specific lineage (CL500-11), suggesting that their dispersal among lakes is not limited despite the hypolimnion being a more isolated habitat than the epilimnion. CONCLUSIONS: Our novel approach complemented the limited resolution of short-read amplicon sequencing and limited sensitivity of the metagenome assembly-based approach, and highlighted the complex ecological processes underlying the ubiquity of freshwater bacterioplankton lineages. To fully exploit the performance of the method, its relatively low read throughput is the major bottleneck to be overcome in the future. Video abstract.
- Klíčová slova
- Freshwater bacterioplankton, Long-read amplicon sequencing, Microdiversity, PacBio, Phylogeography, Ribosomal internal transcribed spacers,
- MeSH
- biodiverzita * MeSH
- fylogeneze MeSH
- fylogeografie * MeSH
- plankton klasifikace genetika izolace a purifikace MeSH
- RNA ribozomální 16S genetika MeSH
- sekvenční analýza DNA metody MeSH
- sladká voda * MeSH
- vodní organismy klasifikace genetika izolace a purifikace MeSH
- Publikační typ
- audiovizuální média MeSH
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Geografické názvy
- Evropa MeSH
- Japonsko MeSH
- Názvy látek
- RNA ribozomální 16S MeSH