The European Reference Genome Atlas: piloting a decentralised approach to equitable biodiversity genomics
Status PubMed-not-MEDLINE Jazyk angličtina Země Anglie, Velká Británie Médium electronic
Typ dokumentu časopisecké články
Grantová podpora
Wellcome Trust - United Kingdom
P 32691
Austrian Science Fund FWF - Austria
PubMed
39289538
PubMed Central
PMC11408602
DOI
10.1038/s44185-024-00054-6
PII: 10.1038/s44185-024-00054-6
Knihovny.cz E-zdroje
- Publikační typ
- časopisecké články MeSH
A genomic database of all Earth's eukaryotic species could contribute to many scientific discoveries; however, only a tiny fraction of species have genomic information available. In 2018, scientists across the world united under the Earth BioGenome Project (EBP), aiming to produce a database of high-quality reference genomes containing all ~1.5 million recognized eukaryotic species. As the European node of the EBP, the European Reference Genome Atlas (ERGA) sought to implement a new decentralised, equitable and inclusive model for producing reference genomes. For this, ERGA launched a Pilot Project establishing the first distributed reference genome production infrastructure and testing it on 98 eukaryotic species from 33 European countries. Here we outline the infrastructure and explore its effectiveness for scaling high-quality reference genome production, whilst considering equity and inclusion. The outcomes and lessons learned provide a solid foundation for ERGA while offering key learnings to other transnational, national genomic resource projects and the EBP.
Algal Genetics Group UMR 8227 CNRS Sorbonne Universite UPMC University Paris 06 Paris France
Andorra Research and Innovation Sant Julià de Lòria Andorra
Animal Breeding and Genomics Wageningen University and Research Wageningen The Netherlands
Aquatic Ecology and Evolution Institute of Ecology and Evolution University of Bern Bern Switzerland
Barcelona Supercomputing Center; Spanish National Bioinformatics Institute ELIXIR Spain Getafe Spain
Barcelona Supercomputing Centre Barcelona Spain
Berlin Center for Genomics in Biodiversity Research Berlin Germany
Biodiversity Research Center Academia Sinica Taipei Taiwan
BIOPOLIS Program in Genomics Biodiversity and Land Planning CIBIO Campus de Vairao Vairao Portugal
Catalan Institution for Research and Advanced Studies Barcelona Spain
Centre for Ecological and Evolutionary Synthesis University of Oslo Oslo Norway
Centre for Palaeogenetics Stockholm Sweden
Centro Nacional de Análisis Genómico Barcelona Spain
CIBER of Epidemiology and Public Health Granada Spain
CIBERINFEC Instituto Carlos 3 Barcelona Spain
CoNISMa Consorzio Nazionale Interuniversitario per le Scienze del Mare Roma Italy
Conservation Biology Research Group Department of Biology University of Malta Msida Malta
Departamento de Biologia Animal Faculdade de Ciências da Universidade de Lisboa Lisboa Portugal
Departamento de Biologia Faculdade de Ciencias da Universidade do Porto Porto Portugal
Departamento de Biologia Faculdade de Ciencias Universidade do Porto Porto Portugal
Departamento de Biologia Vegetal Faculdade de Ciências Universidade de Lisboa Lisboa Portugal
Department of Agricultural Sciences University of Naples Federico 2 Portici Italy
Department of Animal Ecology Netherlands Institute of Ecology Wageningen The Netherlands
Department of Bioinformatics and Genetics Swedish Museum of Natural History Stockholm Sweden
Department of Biological and Environmental Science University of Jyvaskyla Jyvaskyla Finland
Department of Biology and Biotechnologies Sapienza University of Rome Rome Italy
Department of Biology and Biotechnology University of Pavia Pavia Italy
Department of Biology and Ecology University of Novi Sad Novi Sad Serbia
Department of Biology University of Antwerp Antwerp Belgium
Department of Biology University of Florence Sesto Fiorentino Italy
Department of Biology University of Graz Graz Austria
Department of Biosciences Biotechnology and Environment University of Bari Aldo Moro Bari Italy
Department of Biosciences Università degli Studi di Milano Milan Italy
Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
Department of Ecology and Genetics Uppsala University Uppsala Sweden
Department of Fish Ecology and Evolution Eawag Kastanienbaum Switzerland
Department of Genetics University of Cambridge Cambridge UK
Department of Organismal Biology Universite libre de Bruxelles Brussels Belgium
Department of Zoology Faculty of Science Charles University Prague Czech Republic
Department of Zoology Faculty of Science Stockholm University Stockholm Sweden
Department of Zoology Hungarian Natural History Museum Budapest Hungary
Department of Zoology Institute of Ecology and Earth Sciences University of Tartu Tartu Estonia
Department of Zoology Stockholm University Stockholm Sweden
Department of Zoology Swedish Museum of Natural History Stockholm Sweden
DRESDEN concept Genome Center Dresden Germany
Ecology and Genetics Research Unit University of Oulu Oulu Finland
Ecology Evolution and Conservation Biology Department of Biology KU Leuven Leuven Belgium
ERISA Escola Superior de Saúde Ribeiro Sanches IPLUSO Lisboa Portugal
Evolutionary Biology Program Department of Ecology and Genetics Uppsala University Uppsala Sweden
Faculdade de Psicologia Universidade de Lisboa Lisboa Portugal
Faculty of Biology University of Warsaw Warsaw Poland
Faculty of Environmental Protection Velenje Slovenia
France Integrative Biology of Marine Models Station Biologique de Roscoff Roscoff France
Genomics Institute University of California Santa Cruz CA USA
Genoscope Institut François Jacob CEA Université Paris Saclay Evry France
Groupe de Recherche et d Etude pour la Gestion de l Environnement Villandraut France
Hellenic Centre for Marine Research Heraklion Crete Greece
HUN REN ELTE MTM Integrative Ecology Research Group Budapest Hungary
InBios Conservation Genetics Laboratory University of Liege Liege Belgium
Institut Botànic de Barcelona IBB Passeig del Migdia s n Parc de Montjüic Barcelona Spain
Institute for Bioinformatics and Medical Informatics University of Tubingen Tubingen Germany
Institute for Medical Genetics and Applied Genomics University of Tubingen Tubingen Germany
Institute for Nuclear Research of the NAS of Ukraine Kyiv Ukraine
Institute for Research in Biomedicine Barcelona Spain
Institute for Sustainable Plant Protection National Research Council Sesto Fiorentino Italy
Institute of Evolutionary Biology Barcelona Spain
Institute of Life and Environmental Sciences University of Iceland Reykjavik Iceland
Institute of Medical Genetics and Applied Genomics University of Tubingen Tubingen Germany
Institute of Microbiology of the Czech Academy of Sciences Praha Czech Republic
Institute of Zoology University of Cologne Cologne Germany
Laboratory of Biodiversity and Evolutionary Genomics KU Leuven Leuven Belgium
Leibniz Institut für Zoo und Wildtierforschung Berlin Germany
Leibniz Institute for the Analysis of Biodiversity Change Museum Koenig Bonn Bonn Germany
LOEWE Centre for Translational Biodiversity Genomics Frankfurt Germany
MARE Marine and Environmental Sciences Centre ARNET Aquatic Research Network Lisboa Portugal
Marine Animal Ecology Group Wageningen University and Research Wageningen The Netherlands
Max Planck Institute of Molecular Cell Biology and Genetics Dresden Germany
MHNC UP Natural History and Science Museum of the University of Porto Porto Portugal
MME BirdLife Hungary Budapest Hungary
Museu Nacional de História Natural e da Ciência Lisboa Portugal
Museum and Institute of Zoology Polish Academy of Sciences Warsaw Poland
National Biodiversity Future Center Palermo Italy
National Bioinformatics Infrastructure Sweden Uppsala Sweden
Natural History Museum University of Oslo Blindern Oslo Norway
Naturalis Biodiversity Center Leiden The Netherlands
Nature Research Centre Debrecen Hungary
Nature Research Centre Vilnius Lithuania
Naturhistorisches Museum Bern Bern Switzerland
Neuromics Support Facility Department of Biomedical Sciences University of Antwerp Antwerp Belgium
Neuromics Support Facility VIB Center for Molecular Neurology VIB Antwerp Belgium
Next Generation Sequencing Platform University of Bern Bern Switzerland
NGS Competence Center Tubingen Tubingen Germany
NGS Competence Center Tubingen University of Tubingen Tubingen Germany
NNF Center for Biosustainability Technical University of Denmark Kongens Lyngby Denmark
Portugal Centre for Ecology Evolution and Environmental Changes Lisbon Portugal
Ruder Boskovic Institute Zagreb Croatia
School of Biology and Environmental Science University College Dublin Belfield Ireland
Science and Research Centre Koper Koper Slovenia
Section for Ecology and Evolution Department of Biology University of Copenhagen Copenhagen Denmark
Senckenberg Research Institute Frankfurt Germany
Slovenian Museum of Natural History Ljubljana Slovenia
Sociedade Portuguesa de Botânica Lisbon Portugal
Sorbonne Université CNRS Biologie Intégrative des Organismes Marins Banyuls sur Mer France
Swiss Institute of Bioinformatics Lausanne Switzerland
The Earlham Institute Norwich Research Park Norwich UK
The Vertebrate Genome Laboratory The Rockefeller University New York NY USA
Tree of Life Wellcome Sanger Institute Hinxton Cambridge UK
UCD Conway Institute University College Dublin Belfield Ireland
UNESCO Chair Land Within Sea Biodiversity and Sustainability in Atlantic Islands Portugal
Universidade dos Acores Departamento de Biologia Ponta Delgada Portugal
Universitat de Barcelona Barcelona Spain
Universite Paris Saclay INRAE URGI Versailles France
University of Cologne Cologne Germany
University of Debrecen Centre for Agricultural Genomics and Biotechnology Debrecen Hungary
University of Eastern Finland Kuopio Finland
University of Ljubljana Biotechnical Faculty Department of Biology Ljubljana Slovenia
University of Maribor Faculty of Natural Sciences and Mathematics Maribor Slovenia
University of Namur Department of Biology URBE ILEE Namur Belgium
University of New Brunswick Saint John Saint John New Brunswick Canada
Uppsala University Uppsala Sweden
VU University Amsterdam Amsterdam The Netherlands
Wageningen University and Research Wageningen The Netherlands
Wellcome CRUK Gurdon Institute University of Cambridge Cambridge UK
Zobrazit více v PubMed
UNEP. Facts about the nature crisis. UNEP—UN Environment Programmehttps://www.unep.org/facts-about-nature-crisis (2022).
Zhang, Y., Wang, Z., Lu, Y. & Zuo, L. Editorial: biodiversity, ecosystem functions and services: Interrelationship with environmental and human health. Front. Ecol. Evol. 10, 10.3389/fevo.2022.1086408 (2022).
Urban, L. et al. Real-time genomics for One Health. Mol. Syst. Biol. 19, e11686 (2023). PubMed PMC
Kumar, S. et al. Changes in land use enhance the sensitivity of tropical ecosystems to fire-climate extremes. Sci. Rep.12, 964 (2022). PubMed PMC
IUCN. The IUCN Red List of Threatened Species Version 2022-2. The IUCN Red List of Threatened Specieshttps://www.iucnredlist.org.
IPBES. Summary for policymakers of the global assessment report on biodiversity and ecosystem services. 10.5281/zenodo.3553579 (2019).
Boehm, M. M. A. & Cronk, Q. C. B. Dark extinction: the problem of unknown historical extinctions. Biol. Lett.17, 2021 (2021). PubMed PMC
Supple, M. A. & Shapiro, B. Conservation of biodiversity in the genomics era. Genome Biol.19, 131 (2018). PubMed PMC
Formenti, G. et al. The era of reference genomes in conservation genomics. Trends Ecol. Evol.37, 197–202 (2022). PubMed
Theissinger, K. et al. How genomics can help biodiversity conservation. Trends Genet. 39, 545–559(2023). PubMed
Lewin, H. A. et al. Earth BioGenome Project: Sequencing life for the future of life. Proc. Natl Acad. Sci. Usa.115, 4325–4333 (2018). PubMed PMC
Crandall, E. D. et al. Importance of timely metadata curation to the global surveillance of genetic diversity. Conserv. Biol. 37, e14061 (2023). PubMed PMC
Samuel, S. & König-Ries, B. Understanding experiments and research practices for reproducibility: an exploratory study. PeerJ9, e11140 (2021). PubMed PMC
Buckner, J. C., Sanders, R. C., Faircloth, B. C. & Chakrabarty, P. The critical importance of vouchers in genomics. Elife10, e68264 (2021). PubMed PMC
Sabot, F. On the importance of metadata when sharing and opening data. BMC Genom. Data23, 79 (2022). PubMed PMC
Challis, R., Kumar, S., Sotero-Caio, C., Brown, M. & Blaxter, M. Genomes on a Tree (GoaT): A versatile, scalable search engine for genomic and sequencing project metadata across the eukaryotic tree of life. Wellcome Open Res.8, 24 (2023). PubMed PMC
Null, N. et al. Sequence locally, think globally: The Darwin Tree of Life Project. Proc. Natl Acad. Sci.119, e2115642118 (2022). PubMed PMC
Boytchev, H. Diversity in German science: researchers push for missing ethnicity data. Nature616, 22–24 (2023). PubMed
Stöck, M. et al. A brief review of vertebrate sex evolution with a pledge for integrative research: towards ‘sexomics’. Philos. Trans. R. Soc. Lond. B Biol. Sci.376, 20200426 (2021). PubMed PMC
Böhne, A. et al. Contextualising samples: Supporting reference genomes for European biodiversity through sample and associated metadata collection. npjbiodiversity10.1038/s44185-024-00053-7 (2024). PubMed PMC
Mc Cartney, A. M. et al. ERGA pilot project data sharing policy. 10.5281/ZENODO.8091290 (2021).
Martin, F. J. et al. Ensembl 2023. Nucleic Acids Res. 51, D933–D941 (2023). PubMed PMC
Larivière, D. et al. Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy. Nat. Biotechnol.42, 367–370 (2024). PubMed PMC
Mousseau, T. A. The biology of Chernobyl. Annu. Rev. Ecol. Evol. Syst.52, 87–109 (2021).
Mc Cartney, A. M. et al. Guidelines on the implementation of the Traditional Knowledge and Biocultural Labels and Notices in the European Reference Genome Atlas for biodiversity researchers. 10.5281/ZENODO.8088227 (2022).
Lawniczak, M. K. N. et al. Specimen and sample metadata standards for biodiversity genomics: a proposal from the Darwin Tree of Life project. Wellcome Open Res.7, 187 (2022). PubMed PMC
Leonard, J. A. et al. ERGA Sample Manifest Standard of Practice. https://github.com/ERGA-consortium/ERGA-sample-manifest.
Riginos, C. et al. Building a global genomics observatory: Using GEOME (the Genomic Observatories Metadatabase) to expedite and improve deposition and retrieval of genetic data and metadata for biodiversity research. Mol. Ecol. Resour.20, 1458–1469 (2020). PubMed
Liggins, L., Hudson, M. & Anderson, J. Creating space for Indigenous perspectives on access and benefit-sharing: encouraging researcher use of the Local Contexts Notices. Mol. Ecol.30, 2477–2482 (2021). PubMed
Mc Cartney, A. M. et al. Indigenous peoples and local communities as partners in the sequencing of global eukaryotic biodiversity. NPJ Biodivers.2, 1–12 (2023). PubMed PMC
Mc Cartney, A. M. et al. Balancing openness with Indigenous data sovereignty: an opportunity to leave no one behind in the journey to sequence all of life. Proc. Natl. Acad. Sci. USA. 119, e2115860119 (2022). PubMed PMC
Shaw, F. et al. COPO: a metadata platform for brokering FAIR data in the life sciences. F1000Res.9, 495 (2020).
Formenti, G., Fernandéz, J. M. & McCartney, A. M. Data download from the ERGA Pilot repository. 10.5281/ZENODO.8091687 (2021).
Mc Cartney, A. M., Formenti, G. & Mouton, A. ERGA Pilot Project Official Guidelines. 10.5281/zenodo.8319754 (2023).
Lawniczak, M. K. N. et al. Standards recommendations for the Earth BioGenome Project. Proc. Natl. Acad. Sci. USA. 119, e2115639118 (2022). PubMed PMC
Mc Cartney, A. M. et al. ERGA Pilot Project assembly recommendations. 10.5281/ZENODO.8088368 (2023).
Mc Cartney, A. M., Wood, J., Howe, K. & Formenti, G. ERGA Pilot Project post assembly quality control standards. 10.5281/ZENODO.8088393 (2022).
Howe, K. et al. Significantly improving the quality of genome assemblies through curation. Gigascience10, giaa153 (2021). PubMed PMC
Cunha, T. J., de Medeiros, B. A. S., Lord, A., Sørensen, M. V. & Giribet, G. Rampant loss of universal metazoan genes revealed by a chromosome-level genome assembly of the parasitic Nematomorpha. Curr. Biol.33, 3514–3521.e4 (2023). PubMed
Eleftheriadi, K. et al. The genome sequence of the Montseny horsehair worm, Gordionus montsenyensis sp. nov., a key resource to investigate Ecdysozoa evolution. Peer Community Journal, Volume 4, article no. e32. 10.24072/pcjournal.381 (2024).
Cunningham, F. et al. Ensembl 2022. Nucleic Acids Res.50, D988–D995 (2022). PubMed PMC
Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A. & Zdobnov, E. M. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol.38, 4647–4654 (2021). PubMed PMC
Gabriel, L. et al. BRAKER3: fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS and TSEBRA. bioRxiv 2023.06.10.544449 10.1101/2023.06.10.544449 (2023). PubMed PMC
United Nations Environment Programme. Convention on Biological Diversity. (Environmental Law and Institutions Programme Activity Centre, 1992).
CITES, Text of the Convention on International Trade in Endangered Species of Wild Fauna and Flora: signed March 3, 1973, entered into force July 1, 1975. (U.S. Fish and Wildlife Service, Office of Management Authority, 1993).
International treaty on plant genetic resources for food and agriculture. Food and Agriculture Organisation (2004).
Bassiouni, M. C. Convention on the Law of the Sea, UN Doc. A/Conf. 62-122 & Corr. 1--8; 1833 UNTS 397 (10 Dec. 1982). in International Terrorism: Multilateral Conventions (1937–2001) 101–103 (Brill Nijhoff, 2001).
Scholz, A. H. et al. Multilateral benefit-sharing from digital sequence information will support both science and biodiversity conservation. Nat. Commun.13, 1086 (2022). PubMed PMC
Tseng, M. et al. Strategies and support for Black, Indigenous, and people of colour in ecology and evolutionary biology. Nat. Ecol. Evol.4, 1288–1290 (2020). PubMed
Hickel, J., Dorninger, C., Wieland, H. & Suwandi, I. Imperialist appropriation in the world economy: Drain from the global South through unequal exchange, 1990–2015. Glob. Environ. Change73, 102467 (2022).
Holt, B. G. et al. An update of Wallace’s zoogeographic regions of the world. Science339, 74–78 (2013). PubMed
Ebenezer, T. E. et al. Africa: sequence 100,000 species to safeguard biodiversity. Nature603, 388–392 (2022). PubMed
Marques, J. P. et al. Building a Portuguese Coalition for Biodiversity Genomics. (2023). PubMed PMC
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data3, 160018 (2016). PubMed PMC
Carroll, S. R. et al. The CARE principles for indigenous data governance. Data Sci. J. 19, (2020). PubMed
Clarke, J. et al. Continuous base identification for single-molecule nanopore DNA sequencing. Nat. Nanotechnol.4, 265–270 (2009). PubMed
Loman, N. J., Quick, J. & Simpson, J. T. A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat. Methods12, 733–735 (2015). PubMed
Wenger, A. M. et al. Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome. Nat. Biotechnol.37, 1155–1162 (2019). PubMed PMC
Wang, Z., Gerstein, M. & Snyder, M. RNA-Seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet.10, 57–63 (2009). PubMed PMC
Mazzoni, C. J., Ciofi, C. & Waterhouse, R. M. Biodiversity: an atlas of European reference genomes. Nature619, 252 (2023). PubMed
Capella-Gutierrez, S. et al. ECCB2022: the 21st European Conference on Computational Biology. Bioinformatics38, ii1–ii4 (2022). PubMed
Boekhout, T. et al. Trends in yeast diversity discovery. Fungal Divers114, 491–537 (2022).
Medina-Córdova, N. et al. Biocontrol activity of the marine yeast Debaryomyces hansenii against phytopathogenic fungi and its ability to inhibit mycotoxins production in maize grain (Zea mays L.). Biol. Control97, 70–79 (2016).
Lourenço, J., Mendo, S. & Pereira, R. Radioactively contaminated areas: Bioindicator species and biomarkers of effect in an early warning scheme for a preliminary risk assessment. J. Hazard. Mater.317, 503–542 (2016). PubMed
Kesäniemi, J. et al. Exposure to environmental radionuclides associates with tissue-specific impacts on telomerase expression and telomere length. Sci. Rep.9, 850 (2019). PubMed PMC
Hardoim, P. R. et al. The hidden world within plants: ecological and evolutionary considerations for defining functioning of microbial endophytes. Microbiol. Mol. Biol. Rev.79, 293–320 (2015). PubMed PMC
Kolmogorov, M., Yuan, J., Lin, Y. & Pevzner, P. A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol.37, 540–546 (2019). PubMed
Hoff, K. J., Lomsadze, A., Borodovsky, M. & Stanke, M. Whole-genome annotation with BRAKER. Methods Mol. Biol.1962, 65–95 (2019). PubMed PMC