Structure inference
Dotaz
Zobrazit nápovědu
BACKGROUND: Despite the excellent fossil record of cephalopods, their early evolution is poorly understood. Different, partly incompatible phylogenetic hypotheses have been proposed in the past, which reflected individual author's opinions on the importance of certain characters but were not based on thorough cladistic analyses. At the same time, methods of phylogenetic inference have undergone substantial improvements. For fossil datasets, which typically only include morphological data, Bayesian inference and in particular the introduction of the fossilized birth-death model have opened new possibilities. Nevertheless, many tree topologies recovered from these new methods reflect large uncertainties, which have led to discussions on how to best summarize the information contained in the posterior set of trees. RESULTS: We present a large, newly compiled morphological character matrix of Cambrian and Ordovician cephalopods to conduct a comprehensive phylogenetic analysis and resolve existing controversies. Our results recover three major monophyletic groups, which correspond to the previously recognized Endoceratoidea, Multiceratoidea, and Orthoceratoidea, though comprising slightly different taxa. In addition, many Cambrian and Early Ordovician representatives of the Ellesmerocerida and Plectronocerida were recovered near the root. The Ellesmerocerida is para- and polyphyletic, with some of its members recovered among the Multiceratoidea and early Endoceratoidea. These relationships are robust against modifications of the dataset. While our trees initially seem to reflect large uncertainties, these are mainly a consequence of the way clade support is measured. We show that clade posterior probabilities and tree similarity metrics often underestimate congruence between trees, especially if wildcard taxa are involved. CONCLUSIONS: Our results provide important insights into the earliest evolution of cephalopods and clarify evolutionary pathways. We provide a classification scheme that is based on a robust phylogenetic analysis. Moreover, we provide some general insights on the application of Bayesian phylogenetic inference on morphological datasets. We support earlier findings that quartet similarity metrics should be preferred over the Robinson-Foulds distance when higher-level phylogenetic relationships are of interest and propose that using a posteriori pruned maximum clade credibility trees help in assessing support for phylogenetic relationships among a set of relevant taxa, because they provide clade support values that better reflect the phylogenetic signal.
- Klíčová slova
- Bayesian phylogenetics, Cephalopoda, Endoceratoidea, Fossilized birth-death process, Multiceratoidea, Nautiloidea, Orthoceratoidea, Phylogeny, Posterior clade probabilities, Tree similarities,
- MeSH
- Bayesova věta MeSH
- fylogeneze MeSH
- hlavonožci * genetika MeSH
- pravděpodobnost MeSH
- zkameněliny MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Nonequilibrium dynamics and non-neutral processes, such as trait-dependent dispersal, are often missing from quantitative island biogeography models despite their potential explanatory value. One of the most influential nonequilibrium models is the taxon cycle, but it has been difficult to test its validity as a general biogeographical framework. Here, we test predictions of the taxon cycle model using six expected phylogenetic patterns and a time-calibrated phylogeny of Indo-Pacific Odontomachus (Hymenoptera: Formicidae: Ponerinae), one of the ant genera that E.O. Wilson used when first proposing the hypothesis. We used model-based inference and a newly developed trait-dependent dispersal model to jointly estimate ancestral biogeography, ecology (habitat preferences for forest interiors, vs. "marginal" habitats, such as savannahs, shorelines, disturbed areas) and the linkage between ecology and dispersal rates. We found strong evidence that habitat shifts from forest interior to open and disturbed habitats increased macroevolutionary dispersal rate. In addition, lineages occupying open and disturbed habitats can give rise to both island endemics re-occupying only forest interiors and taxa that re-expand geographical ranges. The phylogenetic predictions outlined in this study can be used in future work to evaluate the relative weights of neutral (e.g., geographical distance and area) and non-neutral (e.g., trait-dependent dispersal) processes in historical biogeography and community ecology.
- Klíčová slova
- Formicidae, Melanesia, biogeography, diversification, insect, taxon cycle,
- MeSH
- ekosystém MeSH
- Formicidae klasifikace genetika MeSH
- fylogeneze * MeSH
- fylogeografie MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
BACKGROUND: All currently available methods of network/association inference from microarray gene expression measurements implicitly assume that such measurements represent the actual expression levels of different genes within each cell included in the biological sample under study. Contrary to this common belief, modern microarray technology produces signals aggregated over a random number of individual cells, a "nitty-gritty" aspect of such arrays, thereby causing a random effect that distorts the correlation structure of intra-cellular gene expression levels. RESULTS: This paper provides a theoretical consideration of the random effect of signal aggregation and its implications for correlation analysis and network inference. An attempt is made to quantitatively assess the magnitude of this effect from real data. Some preliminary ideas are offered to mitigate the consequences of random signal aggregation in the analysis of gene expression data. CONCLUSION: Resulting from the summation of expression intensities over a random number of individual cells, the observed signals may not adequately reflect the true dependence structure of intra-cellular gene expression levels needed as a source of information for network reconstruction. Whether the reported effect is extrime or not, the important point, is to reconize and incorporate such signal source for proper inference. The usefulness of inference on genetic regulatory structures from microarray data depends critically on the ability of investigators to overcome this obstacle in a scientifically sound way. REVIEWERS: This article was reviewed by Byung Soo KIM, Jeanne Kowalski and Geoff McLachlan.
- MeSH
- lidé MeSH
- modely genetické * MeSH
- neparametrická statistika MeSH
- sekvenční analýza hybridizací s uspořádaným souborem oligonukleotidů metody statistika a číselné údaje MeSH
- stanovení celkové genové exprese metody statistika a číselné údaje MeSH
- výpočetní biologie metody statistika a číselné údaje MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- přehledy MeSH
- Research Support, N.I.H., Extramural MeSH
Ancient origins, profound ecological divergence, and extensive hybridization make the fire-bellied toads Bombina bombina and B. variegata (Anura: Bombinatoridae) an intriguing test case of ecological speciation. Previous modeling has proposed that the narrow Bombina hybrid zones represent strong barriers to neutral introgression. We test this prediction by inferring the rate of gene exchange between pure populations on either side of the intensively studied Kraków transect. We developed a method to extract high confidence sets of orthologous genes from de novo transcriptome assemblies, fitted a range of divergence models to these data and assessed their relative support with analytic likelihood calculations. There was clear evidence for postdivergence gene flow, but, as expected, no perceptible signal of recent introgression via the nearby hybrid zone. The analysis of two additional Bombina taxa (B. v. scabra and B. orientalis) validated our parameter estimates against a larger set of prior expectations. Despite substantial cumulative introgression over millions of years, adaptive divergence of the hybridizing taxa is essentially unaffected by their lack of reproductive isolation. Extended distribution ranges also buffer them against small-scale environmental perturbations that have been shown to reverse the speciation process in other, more recent ecotypes.
- Klíčová slova
- Ecological speciation, RNA-seq, genome-wide coalescence, hybrid zone, introgression,
- MeSH
- fylogeneze MeSH
- hybridizace genetická * MeSH
- rozšíření zvířat MeSH
- tok genů * MeSH
- transkriptom MeSH
- žáby genetika MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Uncovering causal interdependencies from observational data is one of the great challenges of a nonlinear time series analysis. In this paper, we discuss this topic with the help of an information-theoretic concept known as Rényi's information measure. In particular, we tackle the directional information flow between bivariate time series in terms of Rényi's transfer entropy. We show that by choosing Rényi's parameter α, we can appropriately control information that is transferred only between selected parts of the underlying distributions. This, in turn, is a particularly potent tool for quantifying causal interdependencies in time series, where the knowledge of "black swan" events, such as spikes or sudden jumps, are of key importance. In this connection, we first prove that for Gaussian variables, Granger causality and Rényi transfer entropy are entirely equivalent. Moreover, we also partially extend these results to heavy-tailed α-Gaussian variables. These results allow establishing a connection between autoregressive and Rényi entropy-based information-theoretic approaches to data-driven causal inference. To aid our intuition, we employed the Leonenko et al. entropy estimator and analyzed Rényi's information flow between bivariate time series generated from two unidirectionally coupled Rössler systems. Notably, we find that Rényi's transfer entropy not only allows us to detect a threshold of synchronization but it also provides non-trivial insight into the structure of a transient regime that exists between the region of chaotic correlations and synchronization threshold. In addition, from Rényi's transfer entropy, we could reliably infer the direction of coupling and, hence, causality, only for coupling strengths smaller than the onset value of the transient regime, i.e., when two Rössler systems are coupled but have not yet entered synchronization.
- Klíčová slova
- Rényi entropy, Rényi transfer entropy, Rössler system, multivariate time series,
- Publikační typ
- časopisecké články MeSH
INTRODUCTION: Intracranial EEG (iEEG) data is a powerful way to map brain function, characterized by high temporal and spatial resolution, allowing the study of interactions among neuronal populations that orchestrate cognitive processing. However, the statistical inference and analysis of brain networks using iEEG data faces many challenges related to its sparse brain coverage, and its inhomogeneity across patients. METHODS: We review these challenges and develop a methodological pipeline for estimation of network structure not obtainable from any single patient, illustrated on the inference of the interaction among visual streams using a dataset of 27 human iEEG recordings from a visual experiment employing visual scene stimuli. 100 ms sliding window and multiple band-pass filtered signals are used to provide temporal and spectral resolution. For the connectivity analysis we showcase two connectivity measures reflecting different types of interaction between regions of interest (ROI): Phase Locking Value as a symmetric measure of synchrony, and Directed Transfer Function-asymmetric measure describing causal interaction. For each two channels, initial uncorrected significance testing at p < 0.05 for every time-frequency point is carried out by comparison of the data-derived connectivity to a baseline surrogate-based null distribution, providing a binary time-frequency connectivity map. For each ROI pair, a connectivity density map is obtained by averaging across all pairs of channels spanning them, effectively agglomerating data across relevant channels and subjects. Finally, the difference of the mean map value after and before the stimulation is compared to the same statistic in surrogate data to assess link significance. RESULTS: The analysis confirmed the function of the parieto-medial temporal pathway, mediating visuospatial information between dorsal and ventral visual streams during visual scene analysis. Moreover, we observed the anterior hippocampal connectivity with more posterior areas in the medial temporal lobe, and found the reciprocal information flow between early processing areas and medial place area. DISCUSSION: To summarize, we developed an approach for estimating network connectivity, dealing with the challenge of sparse individual coverage of intracranial EEG electrodes. Its application provided new insights into the interaction between the dorsal and ventral visual streams, one of the iconic dualities in human cognition.
- Klíčová slova
- Directed Transfer Function, Phase Locking Value, connectivity analysis, dorsal visual stream, information flow, intracranial EEG, ventral visual stream, visual pathways,
- Publikační typ
- časopisecké články MeSH
Inferring the dependence structure of complex networks from the observation of the non-linear dynamics of its components is among the common, yet far from resolved challenges faced when studying real-world complex systems. While a range of methods using the ordinal patterns framework has been proposed to particularly tackle the problem of dependence inference in the presence of non-linearity, they come with important restrictions in the scope of their application. Hereby, we introduce the sign patterns as an extension of the ordinal patterns, arising from a more flexible symbolization which is able to encode longer sequences with lower number of symbols. After transforming time series into sequences of sign patterns, we derive improved estimates for statistical quantities by considering necessary constraints on the probabilities of occurrence of combinations of symbols in a symbolic process with prohibited transitions. We utilize these to design an asymptotic chi-squared test to evaluate dependence between two time series and then apply it to the construction of climate networks, illustrating that the developed method can capture both linear and non-linear dependences, while avoiding bias present in the naive application of the often used Pearson correlation coefficient or mutual information.
- Publikační typ
- časopisecké články MeSH
Circumscribing major eukaryote groups and resolving higher order relationships between them are among the most challenging tasks facing molecular evolutionists. Recently, evidence suggesting a new supergroup (the Excavata) comprising a wide array of flagellates has been collected. This group consists of diplomonads, retortamonads, Carpediemonas, heteroloboseans, Trimastix, jakobids, and Malawimonas, all of which possess a particular type of ventral feeding groove that is proposed to be homologous. Euglenozoans, parabasalids, and oxymonads have also been associated with Excavata as their relationships to one or more core excavate taxa were demonstrated. However, the main barrier to the general acceptance of Excavata is that its existence is founded primarily on cytoskeletal similarities, without consistent support from molecular phylogenetics. In gene trees, Excavata are typically not recovered together. In this paper, we present an analysis of the phylogenetic position of oxymonads (genus Monocercomonoides) based on concatenation of eight protein sequences (alpha-tubulin, beta-tubulin, gamma-tubulin, EF-1alpha, EF-2, cytosolic (cyt) HSP70, HSP90, and ubiquitin) and 18S rRNA. We demonstrate that the genes are in conflict regarding the position of oxymonads. Concatenation of alpha- and beta-tubulin placed oxymonads in the plant-chromist part of the tree, while the concatenation of other genes recovered a well-supported group of Metamonada (oxymonads, diplomonads, and parabasalids) that branched weakly with euglenozoans--connecting all four excavates included in the analyses and thus providing conditional support for the existence of Excavata.
- MeSH
- elongační faktor 1 genetika MeSH
- elongační faktor 2 genetika MeSH
- eukaryotické buňky * MeSH
- flagella MeSH
- fylogeneze * MeSH
- geny rRNA MeSH
- klasifikace MeSH
- molekulární evoluce * MeSH
- polymerázová řetězová reakce MeSH
- sekvence nukleotidů MeSH
- sekvenční analýza DNA MeSH
- tubulin genetika MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- elongační faktor 1 MeSH
- elongační faktor 2 MeSH
- tubulin MeSH
Viral RNA dependent polymerases (vRdPs) are present in all RNA viruses; unfortunately, their sequence similarity is too low for phylogenetic studies. Nevertheless, vRdP protein structures are remarkably conserved. In this study, we used the structural similarity of vRdPs to reconstruct their evolutionary history. The major strength of this work is in unifying sequence and structural data into a single quantitative phylogenetic analysis, using powerful a Bayesian approach. The resulting phylogram of vRdPs demonstrates that RNA-dependent DNA polymerases (RdDPs) of viruses within Retroviridae family cluster in a clearly separated group of vRdPs, while RNA-dependent RNA polymerases (RdRPs) of dsRNA and +ssRNA viruses are mixed together. This evidence supports the hypothesis that RdRPs replicating +ssRNA viruses evolved multiple times from RdRPs replicating +dsRNA viruses, and vice versa. Moreover, our phylogram may be presented as a scheme for RNA virus evolution. The results are in concordance with the actual concept of RNA virus evolution. Finally, the methods used in our work provide a new direction for studying ancient virus evolution.
- MeSH
- druhová specificita MeSH
- fylogeneze MeSH
- molekulární evoluce * MeSH
- molekulární modely MeSH
- molekulární sekvence - údaje MeSH
- RNA-dependentní RNA-polymerasa chemie genetika MeSH
- RNA-viry klasifikace enzymologie genetika MeSH
- sekundární struktura proteinů MeSH
- sekvence aminokyselin MeSH
- sekvenční homologie aminokyselin MeSH
- terciární struktura proteinů * MeSH
- vazebná místa genetika MeSH
- virové proteiny chemie genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- RNA-dependentní RNA-polymerasa MeSH
- virové proteiny MeSH
European wildlife has been subjected to intensifying levels of anthropogenic impact throughout the Holocene, yet the main genetic partitioning of many species is thought to still reflect the late-Pleistocene glacial refugia. We analyzed 26,342 nuclear SNPs of 464 wild boar (Sus scrofa) across the European continent to infer demographic history and reassess the genetic consequences of natural and anthropogenic forces. We found that population fragmentation, inbreeding and recent hybridization with domestic pigs have caused the spatial genetic structure to be heterogeneous at the local scale. Underlying local anthropogenic signatures, we found a deep genetic structure in the form of an arch-shaped cline extending from the Dinaric Alps, via Southeastern Europe and the Baltic states, to Western Europe and, finally, to the genetically diverged Iberian peninsula. These findings indicate that, despite considerable anthropogenic influence, the deeper, natural continental structure is still intact. Regarding the glacial refugia, our findings show a weaker signal than generally assumed, but are nevertheless suggestive of two main recolonization routes, with important roles for Southern France and the Balkans. Our results highlight the importance of applying genomic resources and framing genetic results within a species' demographic history and geographic distribution for a better understanding of the complex mixture of underlying processes.
- MeSH
- demografie MeSH
- fylogeneze MeSH
- genetická variace * MeSH
- genom * MeSH
- mitochondriální DNA genetika MeSH
- prasata MeSH
- Sus scrofa genetika MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Geografické názvy
- Evropa MeSH
- Názvy látek
- mitochondriální DNA MeSH