Nejvíce citovaný článek - PubMed ID 37953324
The Human Phenotype Ontology in 2024: phenotypes around the world
BACKGROUND: Use of artificial intelligence (AI) in rare diseases has grown rapidly in recent years. In this review we have outlined the most common machine-learning and deep-learning methods currently being used to classify and analyse large amounts of data, such as standardized images or specific text in electronic health records. To illustrate how these methods have been adapted or developed for use with rare diseases, we have focused on Fabry disease, an X-linked genetic disorder caused by lysosomal α-galactosidase. A deficiency that can result in multiple organ damage. METHODS: We searched PubMed for articles focusing on AI, rare diseases, and Fabry disease published anytime up to 08 January 2025. Further searches, limited to articles published between 01 January 2021 and 31 December 2023, were also performed using double combinations of keywords related to AI and each organ affected in Fabry disease, and AI and rare diseases. RESULTS: In total, 20 articles on AI and Fabry disease were included. In the rare disease field, AI methods may be applied prospectively to large populations to identify specific patients, or retrospectively to large data sets to diagnose a previously overlooked rare disease. Different AI methods may facilitate Fabry disease diagnosis, help monitor progression in affected organs, and potentially contribute to personalized therapy development. The implementation of AI methods in general healthcare and medical imaging centres may help raise awareness of rare diseases and prompt general practitioners to consider these conditions earlier in the diagnostic pathway, while chatbots and telemedicine may accelerate patient referral to rare disease experts. The use of AI technologies in healthcare may generate specific ethical risks, prompting new AI regulatory frameworks aimed at addressing these issues to be established in Europe and the United States. CONCLUSION: AI-based methods will lead to substantial improvements in the diagnosis and management of rare diseases. The need for a human guarantee of AI is a key issue in pursuing innovation while ensuring that human involvement remains at the centre of patient care during this technological revolution.
- Klíčová slova
- Artificial intelligence, Deep learning, Fabry disease, Machine learning, Rare diseases,
- MeSH
- Fabryho nemoc * diagnóza MeSH
- lidé MeSH
- umělá inteligence * MeSH
- vzácné nemoci * diagnóza MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
BACKGROUND: Large language models (LLMs) are increasingly used in the medical field for diverse applications including differential diagnostic support. The estimated training data used to create LLMs such as the Generative Pretrained Transformer (GPT) predominantly consist of English-language texts, but LLMs could be used across the globe to support diagnostics if language barriers could be overcome. Initial pilot studies on the utility of LLMs for differential diagnosis in languages other than English have shown promise, but a large-scale assessment on the relative performance of these models in a variety of European and non-European languages on a comprehensive corpus of challenging rare-disease cases is lacking. METHODS: We created 4967 clinical vignettes using structured data captured with Human Phenotype Ontology (HPO) terms with the Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema. These clinical vignettes span a total of 378 distinct genetic diseases with 2618 associated phenotypic features. We used translations of the Human Phenotype Ontology together with language-specific templates to generate prompts in English, Chinese, Czech, Dutch, German, Italian, Japanese, Spanish, and Turkish. We applied GPT-4o, version gpt-4o-2024-08-06, to the task of delivering a ranked differential diagnosis using a zero-shot prompt. An ontology-based approach with the Mondo disease ontology was used to map synonyms and to map disease subtypes to clinical diagnoses in order to automate evaluation of LLM responses. FINDINGS: For English, GPT-4o placed the correct diagnosis at the first rank 19·8% and within the top-3 ranks 27·0% of the time. In comparison, for the eight non-English languages tested here the correct diagnosis was placed at rank 1 between 16·9% and 20·5%, within top-3 between 25·3% and 27·7% of cases. INTERPRETATION: The differential diagnostic performance of GPT-4o across a comprehensive corpus of rare-disease cases was consistent across the nine languages tested. This suggests that LLMs such as GPT-4o may have utility in non-English clinical settings. FUNDING: NHGRI 5U24HG011449 and 5RM1HG010860. P.N.R. was supported by a Professorship of the Alexander von Humboldt Foundation; P.L. was supported by a National Grant (PMP21/00063 ONTOPRECISC-III, Fondos FEDER).
- Publikační typ
- časopisecké články MeSH
- preprinty MeSH
The Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema was released in 2022 and approved by ISO as a standard for sharing clinical and genomic information about an individual, including phenotypic descriptions, numerical measurements, genetic information, diagnoses, and treatments. A phenopacket can be used as an input file for software that supports phenotype-driven genomic diagnostics and for algorithms that facilitate patient classification and stratification for identifying new diseases and treatments. There has been a great need for a collection of phenopackets to test software pipelines and algorithms. Here, we present Phenopacket Store. Phenopacket Store v.0.1.19 includes 6,668 phenopackets representing 475 Mendelian and chromosomal diseases associated with 423 genes and 3,834 unique pathogenic alleles curated from 959 different publications. This represents the first large-scale collection of case-level, standardized phenotypic information derived from case reports in the literature with detailed descriptions of the clinical data and will be useful for many purposes, including the development and testing of software for prioritizing genes and diseases in diagnostic genomics, machine learning analysis of clinical phenotype data, patient stratification, and genotype-phenotype correlations. This corpus also provides best-practice examples for curating literature-derived data using the GA4GH Phenopacket Schema.
- Klíčová slova
- global alliance for genomics and health, human phenotype ontology, phenopacket schema,
- MeSH
- databáze genetické MeSH
- fenotyp * MeSH
- genomika * metody MeSH
- lidé MeSH
- software MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
The Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema was released in 2022 and approved by ISO as a standard for sharing clinical and genomic information about an individual, including phenotypic descriptions, numerical measurements, genetic information, diagnoses, and treatments. A phenopacket can be used as an input file for software that supports phenotype-driven genomic diagnostics and for algorithms that facilitate patient classification and stratification for identifying new diseases and treatments. There has been a great need for a collection of phenopackets to test software pipelines and algorithms. Here, we present phenopacket-store. Version 0.1.12 of phenopacket-store includes 4916 phenopackets representing 277 Mendelian and chromosomal diseases associated with 236 genes, and 2872 unique pathogenic alleles curated from 605 different publications. This represents the first large-scale collection of case-level, standardized phenotypic information derived from case reports in the literature with detailed descriptions of the clinical data and will be useful for many purposes, including the development and testing of software for prioritizing genes and diseases in diagnostic genomics, machine learning analysis of clinical phenotype data, patient stratification, and genotype-phenotype correlations. This corpus also provides best-practice examples for curating literature-derived data using the GA4GH Phenopacket Schema.
- Publikační typ
- časopisecké články MeSH
- preprinty MeSH