Nejvíce citovaný článek - PubMed ID 35437400
Pladias platform: Technical description of the database structure
The unifying element of all biodiversity data is the issue of taxon hierarchy modeling. We compared 25 existing databases in terms of handling taxa hierarchy and presentation of this data. We used documentation or demo installations of databases as a source of information and next in line was the analysis of structures using R packages provided by inspected platforms. If neither of these was available, we used the public interface of individual databases. For almost half (12) of the databases analyzed, we did not find any formalized taxa hierarchy data structure, providing only biological information about taxon membership in higher ranks, which is not fully formalizable and thus not generally usable. The least effective Adjacency List model (storing parentId of a taxon) dominates among the remaining providers. This study demonstrates the lack of attention paid by current biodiversity databases to modeling taxon hierarchy, particularly to making it available to researchers in the form of a hierarchical data structure within the data provided. For biodiversity relational databases, the Closure Table type is the most suitable of the known data models, which also corresponds to the ontology concept. However, its use is rather sporadic within the biodiversity databases ecosystem.
- MeSH
- biodiverzita * MeSH
- databáze faktografické * MeSH
- Publikační typ
- časopisecké články MeSH
- technické zprávy MeSH
BACKGROUND: The timely and geographical resolutions, as well as the quantity and taxon concepts of records on the occurrence of plants near national borders is often ambiguous. This is due to the regional focus and different approaches of the contributing national and regional databases and networks of the neighbouring countries. Careful data transformation between national data providers is essential for understanding distribution patterns and its dynamics for organisms in areas along the national borders. Sharing occurrence data through the international data aggregator Global Biodiversity Information Facility (GBIF) is also complicated and has to consider that the underlying taxonomic concept and geographic information system of each single GBIF dataset might be different. In addition, some regional data providers have a restrictive (non-cc) licensing policy which does not allow data publication via the GBIF network. Therefore, it is necessary to investigate new ways to make data fit for use for a better and comprehensive understanding of the Flora of the Bohemian Forest. NEW INFORMATION: In this paper, we present a bilateral technical interoperability solution for vascular plant occurrence data for the area between the Czech Republic and Bavaria. We describe the initial state of data providers in both countries and the factual and technical challenges in finding a sustainable concept to establish mutual data sharing. The resulting solution for a functional infrastructure and an agreed data pipeline is described in a step-by-step approach. The new distributed infrastructure allows botanists and other stakeholders from both countries to work within the cross-border context of historical and current plants' distribution.
- Klíčová slova
- ABCD occurrence data sources, Bohemian Forest, Diversity Workbench repositories., Flora Silvae Gabretae, Pladias, occurrence data portal, taxon names services, web portal,
- Publikační typ
- časopisecké články MeSH