Across the tree of life, DNA damage response (DDR) proteins play a pivotal, yet dichotomous role in organismal development and evolution. Here, we present a comprehensive analysis of 432 DDR proteins encoded by 68 genomes, including that of Nucleospora cyclopteri, an intranuclear microsporidia sequenced in this study. We compared the DDR proteins encoded by these genomes to those of humans to uncover the DNA repair-ome across phylogenetically distant eukaryotes. We also performed further analyses to understand if organismal complexity and lifestyle play a role in the evolution of DDR protein length and conserved domain architecture. We observed that the genomes of extreme parasites such as Paramicrocytos, Giardia, Spironucleus, and certain microsporidian lineages encode the smallest eukaryotic repertoire of DDR proteins and that pathways involved in modulation of nucleotide pools and nucleotide excision repair are the most preserved DDR pathways in the eukaryotic genomes analysed here. We found that DDR and DNA repair proteins are consistently longer than housekeeping and metabolic proteins. This is likely due to the higher number of physical protein-protein interactions which DDR proteins are involved. We find that although DNA repair proteins are generally longer than housekeeping proteins, their functional domains occupy a relatively smaller footprint. Notably, this pattern holds true across diverse organisms and shows no dependence on either lifestyle or mitochondrial status. Finally, we observed that unicellular organisms harbour proteins that are tenfold longer than their human homologues, with the extra amino acids forming interdomain regions with a clearly novel albeit undetermined function.
- MeSH
- Eukaryota * genetics MeSH
- Phylogeny MeSH
- Humans MeSH
- Microsporidia genetics MeSH
- Evolution, Molecular * MeSH
- DNA Repair * MeSH
- DNA Damage * MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
Diplonemids are among the most abundant and species-rich protists in the oceans. Marine heterotrophic flagellates, including diplonemids, have been suggested to play important roles in global biogeochemical cycles. Diplonemids are also the sister taxon of kinetoplastids, home to trypanosomatid parasites of global health importance, and thus are informative about the evolution of kinetoplastid biology. However, the genomic and cellular complement that underpins diplonemids' highly successful lifestyle is underexplored. At the same time, our framework describing cellular processes may not be as broadly applicable as presumed, as it is largely derived from animal and fungal model organisms, a small subset of extant eukaryotic diversity. In addition to uniquely evolved machinery in animals and fungi, there exist components with sporadic (i.e., "patchy") distributions across other eukaryotes. A most intriguing subset are components ("jötnarlogs") stochastically present in a wide range of eukaryotes but lost in animal and/or fungal models. Such components are considered exotic curiosities but may be relevant to inferences about the complexity of the last eukaryotic common ancestor (LECA) and frameworks of modern cell biology. Here, we use comparative genomics and phylogenetics to comprehensively assess the membrane-trafficking system of diplonemids. They possess several proteins thought of as kinetoplastid specific, as well as an extensive set of patchy proteins, including jötnarlogs. Diplonemids apparently function with endomembrane machinery distinct from existing cell biological models but comparable with other free-living heterotrophic protists, highlighting the importance of including such exotic components when considering different models of ancient eukaryotic genomic complexity and the cell biology of non-opisthokont organisms.
- MeSH
- Biological Evolution MeSH
- Phylogeny MeSH
- Kinetoplastida * physiology genetics MeSH
- Publication type
- Journal Article MeSH
Telomeres, essential for maintaining genomic stability, are typically preserved through the action of telomerase, a ribonucleoprotein complex that synthesizes telomeric DNA. One of its two core components, telomerase RNA (TR), serves as the template for this synthesis, and its evolution across different species is both complex and diverse. This review discusses recent advancements in understanding TR evolution, with a focus on plants (Viridiplantae). Utilizing novel bioinformatic tools and accumulating genomic and transcriptomic data, combined with corresponding experimental validation, researchers have begun to unravel the intricate pathways of TR evolution and telomere maintenance mechanisms. Contrary to previous beliefs, a monophyletic origin of TR has been demonstrated first in land plants and subsequently across the broader phylogenetic megagroup Diaphoretickes. Conversely, the discovery of plant-type TRs in insects challenges assumptions about the monophyletic origin of TRs in animals, suggesting evolutionary innovations coinciding with arthropod divergence. The review also highlights key challenges in TR identification and provides examples of how these have been addressed. Overall, this work underscores the importance of expanding beyond model organisms to comprehend the full complexity of telomerase evolution, with potential applications in agriculture and biotechnology.
- MeSH
- Phylogeny MeSH
- Evolution, Molecular * MeSH
- RNA * genetics metabolism MeSH
- Plants genetics MeSH
- Telomerase * genetics metabolism MeSH
- Telomere * metabolism genetics MeSH
- Viridiplantae genetics metabolism MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Review MeSH
Satellite DNAs (satDNAs) are abundant components of eukaryotic genomes, playing pivotal roles in chromosomal organization, genome stability, and evolution. Here, we combined cytogenetic and genomic methods to characterize the satDNAs in the genomes of Leptidea butterflies. Leptidea is characterized by the presence of a high heterochromatin content, large genomes, and extensive chromosomal reshuffling as well as the occurrence of cryptic species. We show that, in contrast to other Lepidoptera, satDNAs constitute a considerable proportion of Leptidea genomes, ranging between 4.11% and 11.05%. This amplification of satDNAs, together with the hyperactivity of transposable elements, contributes to the substantial genome expansion in Leptidea. Using chromosomal mapping, we show that, particularly LepSat01-100 and LepSat03-167 satDNAs, are preferentially localized in heterochromatin exhibiting variable distribution that may have contributed to the highly diverse karyotypes within the genus. The satDNAs also exhibit W-chromosome accumulation, suggesting their involvement in sex chromosome evolution. Our results provide insights into the dynamics of satDNAs in Lepidoptera genomes and highlight their role in genome expansion and chromosomal organization, which could influence the speciation process. The high proportion of repetitive DNAs in the genomes of Leptidea underscores the complex evolutionary dynamics revealing the interplay between repetitive DNAs and genomic architecture in the genus.
- MeSH
- Phylogeny MeSH
- Genome, Insect * MeSH
- Heterochromatin genetics MeSH
- Karyotype * MeSH
- Chromosome Mapping MeSH
- Evolution, Molecular * MeSH
- Butterflies * genetics MeSH
- DNA, Satellite * genetics MeSH
- DNA Transposable Elements MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
UNLABELLED: Transmission of genetic material from one generation to the next is a fundamental feature of all living cells. In eukaryotes, a macromolecular complex called the kinetochore plays crucial roles during chromosome segregation by linking chromosomes to spindle microtubules. Little is known about this process in evolutionarily diverse protists. Within the supergroup Discoba, Euglenozoa forms a speciose group of unicellular flagellates-kinetoplastids, euglenids, and diplonemids. Kinetoplastids have an unconventional kinetochore system, while euglenids have subunits that are conserved among most eukaryotes. For diplonemids, a group of extremely diverse and abundant marine flagellates, it remains unclear what kind of kinetochores are present. Here, we employed deep homology detection protocols using profile-versus-profile Hidden Markov Model searches and AlphaFold-based structural comparisons to detect homologies that might have been previously missed. Interestingly, we still could not detect orthologs for most of the kinetoplastid or canonical kinetochore subunits with few exceptions including a putative centromere-specific histone H3 variant (cenH3/CENP-A), the spindle checkpoint protein Mad2, the chromosomal passenger complex members Aurora and INCENP, and broadly conserved proteins like CLK kinase and the meiotic synaptonemal complex proteins SYCP2/3 that also function at kinetoplastid kinetochores. We examined the localization of five candidate kinetochore-associated proteins in the model diplonemid, Paradiplonema papillatum. PpCENP-A shows discrete dots in the nucleus, implying that it is likely a kinetochore component. PpMad2, PpCLKKKT10/19, PpSYCP2L1KKT17/18, and PpINCENP reside in the nucleus, but no clear kinetochore localization was observed. Altogether, these results point to the possibility that diplonemids evolved a hitherto unknown type of kinetochore system. IMPORTANCE: A macromolecular assembly called the kinetochore is essential for the segregation of genetic material during eukaryotic cell division. Therefore, characterization of kinetochores across species is essential for understanding the mechanisms involved in this key process across the eukaryotic tree of life. In particular, little is known about kinetochores in divergent protists such as Euglenozoa, a group of unicellular flagellates that includes kinetoplastids, euglenids, and diplonemids, the latter being a highly diverse and abundant component of marine plankton. While kinetoplastids have an unconventional kinetochore system and euglenids have a canonical one similar to traditional model eukaryotes, preliminary searches detected neither unconventional nor canonical kinetochore components in diplonemids. Here, we employed state-of-the-art deep homology detection protocols but still could not detect orthologs for the bulk of kinetoplastid-specific nor canonical kinetochore proteins in diplonemids except for a putative centromere-specific histone H3 variant. Our results suggest that diplonemids evolved kinetochores that do not resemble previously known ones.
The genomic signature of an organism captures the characteristics of repeated oligonucleotide patterns in its genome 1, such as oligomer frequencies, GC content, and differences in codon usage. Viruses, however, are obligate intracellular parasites that are dependent on their host cells for replication, and information about genomic signatures in viruses has hitherto been sparse.Here, we investigate the presence and specificity of genomic signatures in 2,768 eukaryotic viral species from 105 viral families, aiming to illuminate dependencies and selective pressures in viral genome evolution. We demonstrate that most viruses have highly specific genomic signatures that often also differ significantly between species within the same family. The species-specificity is most prominent among dsDNA viruses and viruses with large genomes. We also reveal consistent dissimilarities between viral genomic signatures and those of their host cells, although some viruses present slight similarities, which may be explained by genetic adaptation to their native hosts. Our results suggest that significant evolutionary selection pressures act upon viral genomes to shape and preserve their genomic signatures, which may have implications for the field of synthetic biology in the construction of live attenuated vaccines and viral vectors.
The major organelles of the endomembrane system were in place by the time of the last eukaryotic common ancestor (LECA) (~1.5 billion years ago). Their acquisitions were defining milestones during eukaryogenesis. Comparative cell biology and evolutionary analyses show multiple instances of homology in the protein machinery controlling distinct interorganelle trafficking routes. Resolving these homologous relationships allows us to explore processes underlying the emergence of additional, distinct cellular compartments, infer ancestral states predating LECA, and explore the process of eukaryogenesis itself. Here, we undertake a molecular evolutionary analysis (including providing a transcriptome of the jakobid flagellate Reclinomonas americana), exploring the origins of the machinery responsible for the biogenesis of lysosome-related organelles (LROs), the Biogenesis of LRO Complexes (BLOCs 1,2, and 3). This pathway has been studied only in animals and is not considered a feature of the basic eukaryotic cell plan. We show that this machinery is present across the eukaryotic tree of life and was likely in place prior to LECA, making it an underappreciated facet of eukaryotic cellular organisation. Moreover, we resolve multiple points of ancient homology between all three BLOCs and other post-endosomal retrograde trafficking machinery (BORC, CCZ1 and MON1 proteins, and an unexpected relationship with the "homotypic fusion and vacuole protein sorting" (HOPS) and "Class C core vacuole/endosomal tethering" (CORVET) complexes), offering a mechanistic and evolutionary unification of these trafficking pathways. Overall, this study provides a comprehensive account of the rise of the LROs biogenesis machinery from before the LECA to current eukaryotic diversity, integrating it into the larger mechanistic framework describing endomembrane evolution.
The human intestine is a habitat for microorganisms and, recently, the composition of the intestinal microbiota has been correlated with the etiology of diseases such as inflammations, sores, and tumors. Although many studies have been conducted to understand the composition of that microbiota, expanding these studies to more samples and different backgrounds will improve our knowledge. In this work, we showed the colon microbiota composition and diversity of healthy subjects, patients with inflammatory bowel disease (IBD), and colon cancer by metagenomic sequencing. Our results indicated that the relative abundance of prokaryotic and eukaryotic microbes differs between the healthy vs. tumor biopsies, tumor vs. IBD biopsies, and fresh vs. paraffin-embedded tumor biopsies. Fusobacterium, Escherichia-Shigella, and Streptococcus genera were relatively abundant in fresh tumor biopsies, while Pseudomonas was significantly elevated in IBD biopsies. Additionally, another opportunist pathogen Malasseziales was revealed as the most abundant fungal clade in IBD biopsies, especially in ulcerative colitis. We also found that, while the Basidiomycota:Ascomycota ratio was slightly lower in tumor biopsies compared to biopsies from healthy subjects, there was a significant increase in IBD biopsies. Our work will contribute to the known diversity of prokaryotic and eukaryotic microbes in the colon biopsies in patients with IBD and colon cancer.
- MeSH
- Basidiomycota * MeSH
- Crohn Disease * microbiology MeSH
- Inflammatory Bowel Diseases * microbiology MeSH
- Humans MeSH
- Microbiota * MeSH
- Colonic Neoplasms * MeSH
- Intestinal Mucosa microbiology MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
Molecular identification of micro- and macroorganisms based on nuclear markers has revolutionized our understanding of their taxonomy, phylogeny and ecology. Today, research on the diversity of eukaryotes in global ecosystems heavily relies on nuclear ribosomal RNA (rRNA) markers. Here, we present the research community-curated reference database EUKARYOME for nuclear ribosomal 18S rRNA, internal transcribed spacer (ITS) and 28S rRNA markers for all eukaryotes, including metazoans (animals), protists, fungi and plants. It is particularly useful for the identification of arbuscular mycorrhizal fungi as it bridges the four commonly used molecular markers-ITS1, ITS2, 18S V4-V5 and 28S D1-D2 subregions. The key benefits of this database over other annotated reference sequence databases are that it is not restricted to certain taxonomic groups and it includes all rRNA markers. EUKARYOME also offers a number of reference long-read sequences that are derived from (meta)genomic and (meta)barcoding-a unique feature that can be used for taxonomic identification and chimera control of third-generation, long-read, high-throughput sequencing data. Taxonomic assignments of rRNA genes in the database are verified based on phylogenetic approaches. The reference datasets are available in multiple formats from the project homepage, http://www.eukaryome.org.
- MeSH
- Databases, Genetic MeSH
- Databases, Nucleic Acid MeSH
- Eukaryota * genetics MeSH
- Phylogeny MeSH
- Genes, rRNA genetics MeSH
- RNA, Ribosomal, 18S genetics MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
Understanding the relation between terrestrial microorganisms and edaphic factors in the Antarctic can provide insights into their potential response to environmental changes. Here we examined the composition of bacterial and micro-eukaryotic communities using amplicon sequencing of rRNA genes in 105 soil samples from the Sør Rondane Mountains (East Antarctica), differing in bedrock or substrate type and associated physicochemical conditions. Although the two most widespread taxa (Acidobacteriota and Chlorophyta) were relatively abundant in each sample, multivariate analysis and co-occurrence networks revealed pronounced differences in community structure depending on substrate type. In moraine substrates, Actinomycetota and Cercozoa were the most abundant bacterial and eukaryotic phyla, whereas on gneiss, granite and marble substrates, Cyanobacteriota and Metazoa were the dominant bacterial and eukaryotic taxa. However, at lower taxonomic level, a distinct differentiation was observed within the Cyanobacteriota phylum depending on substrate type, with granite being dominated by the Nostocaceae family and marble by the Chroococcidiopsaceae family. Surprisingly, metazoans were relatively abundant according to the 18S rRNA dataset, even in samples from the most arid sites, such as moraines in Austkampane and Widerøefjellet ("Dry Valley"). Overall, our study shows that different substrate types support distinct microbial communities, and that mineral soil diversity is a major determinant of terrestrial microbial diversity in inland Antarctic nunataks and valleys.
- Publication type
- Journal Article MeSH