-
Je něco špatně v tomto záznamu ?
Understanding Health Records in West Slavic Languages: Available Resources, Case Study in Oncology
K. Anetta
Jazyk angličtina Země Nizozemsko
Typ dokumentu časopisecké články
PubMed
37386967
DOI
10.3233/shti230433
Knihovny.cz E-zdroje
- MeSH
- farmaceutické databáze * MeSH
- jazyk (prostředek komunikace) * MeSH
- lékařská onkologie MeSH
- lidé MeSH
- mezinárodní klasifikace nemocí MeSH
- znalosti MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
Currently, there is very little research aimed at developing medical knowledge extraction tools for major West Slavic languages (Czech, Polish, and Slovak). This project lays the groundwork for a general medical knowledge extraction pipeline, introducing the resource vocabularies available for the respective languages (UMLS resources, ICD-10 translations and national drug databases). It demonstrates the utility of this approach on a case study using a large proprietary corpus of Czech oncology records consisting of more than 40 million words written about more than 4,000 patients. After correlating MedDRA terms found in patients' records with drugs prescribed to them, significant non-obvious associations were found between selected medical conditions being mentioned and the probability of certain drugs being prescribed over the course of the patient's treatment, in some cases increasing the probability of prescriptions by over 250%. This direction of research, producing large amounts of annotated data, is a prerequisite for training deep learning models and predictive systems.
Citace poskytuje Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc23017045
- 003
- CZ-PrNML
- 005
- 20231026105414.0
- 007
- ta
- 008
- 231013s2023 ne f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.3233/SHTI230433 $2 doi
- 035 __
- $a (PubMed)37386967
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a ne
- 100 1_
- $a Anetta, Kristof $u NLP Centre, Faculty of Informatics, Masaryk University Brno, Czech Republic
- 245 10
- $a Understanding Health Records in West Slavic Languages: Available Resources, Case Study in Oncology / $c K. Anetta
- 520 9_
- $a Currently, there is very little research aimed at developing medical knowledge extraction tools for major West Slavic languages (Czech, Polish, and Slovak). This project lays the groundwork for a general medical knowledge extraction pipeline, introducing the resource vocabularies available for the respective languages (UMLS resources, ICD-10 translations and national drug databases). It demonstrates the utility of this approach on a case study using a large proprietary corpus of Czech oncology records consisting of more than 40 million words written about more than 4,000 patients. After correlating MedDRA terms found in patients' records with drugs prescribed to them, significant non-obvious associations were found between selected medical conditions being mentioned and the probability of certain drugs being prescribed over the course of the patient's treatment, in some cases increasing the probability of prescriptions by over 250%. This direction of research, producing large amounts of annotated data, is a prerequisite for training deep learning models and predictive systems.
- 650 _2
- $a lidé $7 D006801
- 650 12
- $a jazyk (prostředek komunikace) $7 D007802
- 650 12
- $a farmaceutické databáze $7 D062313
- 650 _2
- $a mezinárodní klasifikace nemocí $7 D038801
- 650 _2
- $a znalosti $7 D019359
- 650 _2
- $a lékařská onkologie $7 D008495
- 655 _2
- $a časopisecké články $7 D016428
- 773 0_
- $w MED00180836 $t Studies in health technology and informatics $x 1879-8365 $g Roč. 305, č. - (2023), s. 97-101
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/37386967 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y - $z 0
- 990 __
- $a 20231013 $b ABA008
- 991 __
- $a 20231026105409 $b ABA008
- 999 __
- $a ok $b bmc $g 2000526 $s 1203407
- BAS __
- $a 3
- BAS __
- $a PreBMC-MEDLINE
- BMC __
- $a 2023 $b 305 $c - $d 97-101 $e 2023Jun29 $i 1879-8365 $m Studies in health technology and informatics $n Stud Health Technol Inform $x MED00180836
- LZP __
- $a Pubmed-20231013