Most cited article - PubMed ID 32884150
Releasing uncurated datasets is essential for reproducible phylogenomics
Understanding the origin of eukaryotic cells is one of the most difficult problems in all of biology. A key challenge relevant to the question of eukaryogenesis is reconstructing the gene repertoire of the last eukaryotic common ancestor (LECA). As data sets grow, sketching an accurate genomics-informed picture of early eukaryotic cellular complexity requires provision of analytical resources and a commitment to data sharing. Here, we summarise progress towards understanding the biology of LECA and outline a community approach to inferring its wider gene repertoire. Once assembled, a robust LECA gene set will be a useful tool for evaluating alternative hypotheses about the origin of eukaryotes and understanding the evolution of traits in all descendant lineages, with relevance in diverse fields such as cell biology, microbial ecology, biotechnology, agriculture, and medicine. In this Consensus View, we put forth the status quo and an agreed path forward to reconstruct LECA's gene content.
Freshwater crayfish are amongst the largest macroinvertebrates and play a keystone role in the ecosystems they occupy. Understanding the global distribution of these animals is often hindered due to a paucity of distributional data. Additionally, non-native crayfish introductions are becoming more frequent, which can cause severe environmental and economic impacts. Management decisions related to crayfish and their habitats require accurate, up-to-date distribution data and mapping tools. Such data are currently patchily distributed with limited accessibility and are rarely up-to-date. To address these challenges, we developed a versatile e-portal to host distributional data of freshwater crayfish and their pathogens (using Aphanomyces astaci, the causative agent of the crayfish plague, as the most prominent example). Populated with expert data and operating in near real-time, World of Crayfish™ is a living, publicly available database providing worldwide distributional data sourced by experts in the field. The database offers open access to the data through specialized standard geospatial services (Web Map Service, Web Feature Service) enabling users to view, embed, and download customizable outputs for various applications. The platform is designed to support technical enhancements in the future, with the potential to eventually incorporate various additional features. This tool serves as a step forward towards a modern era of conservation planning and management of freshwater biodiversity.
- Keywords
- Aphanomyces astaci, Astacidae, Cambaridae, Endangered species, Invasive species, Open data, Parastacidae, Species distribution,
- MeSH
- Aphanomyces MeSH
- Databases, Factual MeSH
- Ecosystem MeSH
- Internet MeSH
- Astacoidea * microbiology MeSH
- Fresh Water * MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
PhyloFisher is a software package written primarily in Python3 that can be used for the creation, analysis, and visualization of phylogenomic datasets that consist of protein sequences from eukaryotic organisms. Unlike many existing phylogenomic pipelines, PhyloFisher comes with a manually curated database of 240 protein-coding genes, a subset of a previous phylogenetic dataset sampled from 304 eukaryotic taxa. The software package can also utilize a user-created database of eukaryotic proteins, which may be more appropriate for shallow evolutionary questions. PhyloFisher is also equipped with a set of utilities to aid in running routine analyses, such as the prediction of alternative genetic codes, removal of genes and/or taxa based on occupancy/completeness of the dataset, testing for amino acid compositional heterogeneity among sequences, removal of heterotachious and/or fast-evolving sites, removal of fast-evolving taxa, supermatrix creation from randomly resampled genes, and supermatrix creation from nucleotide sequences. © 2024 Wiley Periodicals LLC. Basic Protocol 1: Constructing a phylogenomic dataset Basic Protocol 2: Performing phylogenomic analyses Support Protocol 1: Installing PhyloFisher Support Protocol 2: Creating a custom phylogenomic database.
- Keywords
- evolution, genomics, systematics, transcriptomics,
- MeSH
- Amino Acids * MeSH
- Biological Evolution * MeSH
- Phylogeny MeSH
- Culture MeSH
- Amino Acid Sequence MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Amino Acids * MeSH
Apicomplexans and related lineages comprise many obligate symbionts of animals; some of which cause notorious diseases such as malaria. They evolved from photosynthetic ancestors and transitioned into a symbiotic lifestyle several times, giving rise to species with diverse non-photosynthetic plastids. Here, we sought to reconstruct the evolution of the cryptic plastids in the apicomplexans, chrompodellids, and squirmids (ACS clade) by generating five new single-cell transcriptomes from understudied gregarine lineages, constructing a robust phylogenomic tree incorporating all ACS clade sequencing datasets available, and using these to examine in detail, the evolutionary distribution of all 162 proteins recently shown to be in the apicoplast by spatial proteomics in Toxoplasma. This expanded homology-based reconstruction of plastid proteins found in the ACS clade confirms earlier work showing convergence in the overall metabolic pathways retained once photosynthesis is lost, but also reveals differences in the degrees of plastid reduction in specific lineages. We show that the loss of the plastid genome is common and unexpectedly find many lineage- and species-specific plastid proteins, suggesting the presence of evolutionary innovations and neofunctionalizations that may confer new functional and metabolic capabilities that are yet to be discovered in these enigmatic organelles.
- Keywords
- apicomplexans, organelle evolution, parasites, plastids, reductive evolution,
- MeSH
- Photosynthesis genetics MeSH
- Phylogeny MeSH
- Metabolic Networks and Pathways MeSH
- Plastids * genetics MeSH
- Proteome * genetics MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Proteome * MeSH
BACKGROUND: Apicomplexa is a diverse phylum comprising unicellular endobiotic animal parasites and contains some of the most well-studied microbial eukaryotes including the devastating human pathogens Plasmodium falciparum and Cryptosporidium hominis. In contrast, data on the invertebrate-infecting gregarines remains sparse and their evolutionary relationship to other apicomplexans remains obscure. Most apicomplexans retain a highly modified plastid, while their mitochondria remain metabolically conserved. Cryptosporidium spp. inhabit an anaerobic host-gut environment and represent the known exception, having completely lost their plastid while retaining an extremely reduced mitochondrion that has lost its genome. Recent advances in single-cell sequencing have enabled the first broad genome-scale explorations of gregarines, providing evidence of differential plastid retention throughout the group. However, little is known about the retention and metabolic capacity of gregarine mitochondria. RESULTS: Here, we sequenced transcriptomes from five species of gregarines isolated from cockroaches. We combined these data with those from other apicomplexans, performed detailed phylogenomic analyses, and characterized their mitochondrial metabolism. Our results support the placement of Cryptosporidium as the earliest diverging lineage of apicomplexans, which impacts our interpretation of evolutionary events within the phylum. By mapping in silico predictions of core mitochondrial pathways onto our phylogeny, we identified convergently reduced mitochondria. These data show that the electron transport chain has been independently lost three times across the phylum, twice within gregarines. CONCLUSIONS: Apicomplexan lineages show variable functional restructuring of mitochondrial metabolism that appears to have been driven by adaptations to parasitism and anaerobiosis. Our findings indicate that apicomplexans are rife with convergent adaptations, with shared features including morphology, energy metabolism, and intracellularity.
- Keywords
- Anaerobic metabolism, Apicomplexa, Eugregarines, Evolution, Mitochondria, Mitosome, Parasitism, Phylogenomics,
- MeSH
- Single-Cell Analysis MeSH
- Apicomplexa * genetics MeSH
- Phylogeny MeSH
- Humans MeSH
- Mitochondria * genetics MeSH
- Transcriptome MeSH
- Animals MeSH
- Check Tag
- Humans MeSH
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH