The aim of this study was to test whether the availability of internal imagery elicited by words is related to ratings of word imageability. Participants are presented with target words and, after a delay allowing for processing of the word, answer questions regarding the size or weight of the word referents. Target words differ with respect to imageability. Results show faster responses to questions for high imageability words than for low imageability words. The type of question (size/weight) modulates reaction times suggesting a dominance of the visual domain over the physical-experience domain in concept representation. Results hold across two different languages (Czech/German). These findings provide further insights into the representations underlying word meaning and the role of word imageability in language acquisition and processing.
- MeSH
- Adult MeSH
- Imagination * physiology MeSH
- Humans MeSH
- Young Adult MeSH
- Reaction Time * physiology MeSH
- Semantics MeSH
- Vocabulary MeSH
- Photic Stimulation MeSH
- Check Tag
- Adult MeSH
- Humans MeSH
- Young Adult MeSH
- Male MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
BACKGROUND: The hippocampal representation of space, formed by the collective activity of populations of place cells, is considered as a substrate of spatial memory. Alzheimer's disease (AD), a widespread severe neurodegenerative condition of multifactorial origin, typically exhibits spatial memory deficits among its early clinical signs before more severe cognitive impacts develop. OBJECTIVE: To investigate mechanisms of spatial memory impairment in a double transgenic rat model of AD. METHODS: In this study, we utilized 9-12-month-old double-transgenic TgF344-AD rats and age-matched controls to analyze the spatial coding properties of CA1 place cells. We characterized the spatial memory representation, assessed cells' spatial information content and direction-specific activity, and compared their population coding in familiar and novel conditions. RESULTS: Our findings revealed that TgF344-AD animals exhibited lower precision in coding, as evidenced by reduced spatial information and larger receptive zones. This impairment was evident in maps representing novel environments. While controls instantly encoded directional context during their initial exposure to a novel environment, transgenics struggled to incorporate this information into the newly developed hippocampal spatial representation. This resulted in impairment in orthogonalization of stored activity patterns, an important feature directly related to episodic memory encoding capacity. CONCLUSIONS: Overall, the results shed light on the nature of impairment at both the single-cell and population levels in the transgenic AD model. In addition to the observed spatial coding inaccuracy, the findings reveal a significantly impaired ability to adaptively modify and refine newly stored hippocampal memory patterns.
- MeSH
- Alzheimer Disease * physiopathology MeSH
- Amyloid beta-Protein Precursor genetics MeSH
- CA1 Region, Hippocampal physiopathology MeSH
- Hippocampus physiopathology MeSH
- Rats MeSH
- Humans MeSH
- Disease Models, Animal * MeSH
- Memory Disorders etiology physiopathology MeSH
- Rats, Inbred F344 MeSH
- Rats, Transgenic * MeSH
- Spatial Memory physiology MeSH
- Animals MeSH
- Check Tag
- Rats MeSH
- Humans MeSH
- Male MeSH
- Animals MeSH
- Publication type
- Journal Article MeSH
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. In biomedical image analysis, chosen performance metrics often do not reflect the domain interest, and thus fail to adequately measure scientific progress and hinder translation of ML techniques into practice. To overcome this, we created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Developed by a large international consortium in a multistage Delphi process, it is based on the novel concept of a problem fingerprint-a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), dataset and algorithm output. On the basis of the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as classification tasks at image, object or pixel level, namely image-level classification, object detection, semantic segmentation and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. Its applicability is demonstrated for various biomedical use cases.
- MeSH
- Algorithms * MeSH
- Image Processing, Computer-Assisted * MeSH
- Semantics MeSH
- Machine Learning MeSH
- Publication type
- Journal Article MeSH
- Review MeSH
BACKGROUND: Artificial intelligence (AI) has advanced substantially in recent years, transforming many industries and improving the way people live and work. In scientific research, AI can enhance the quality and efficiency of data analysis and publication. However, AI has also opened up the possibility of generating high-quality fraudulent papers that are difficult to detect, raising important questions about the integrity of scientific research and the trustworthiness of published papers. OBJECTIVE: The aim of this study was to investigate the capabilities of current AI language models in generating high-quality fraudulent medical articles. We hypothesized that modern AI models can create highly convincing fraudulent papers that can easily deceive readers and even experienced researchers. METHODS: This proof-of-concept study used ChatGPT (Chat Generative Pre-trained Transformer) powered by the GPT-3 (Generative Pre-trained Transformer 3) language model to generate a fraudulent scientific article related to neurosurgery. GPT-3 is a large language model developed by OpenAI that uses deep learning algorithms to generate human-like text in response to prompts given by users. The model was trained on a massive corpus of text from the internet and is capable of generating high-quality text in a variety of languages and on various topics. The authors posed questions and prompts to the model and refined them iteratively as the model generated the responses. The goal was to create a completely fabricated article including the abstract, introduction, material and methods, discussion, references, charts, etc. Once the article was generated, it was reviewed for accuracy and coherence by experts in the fields of neurosurgery, psychiatry, and statistics and compared to existing similar articles. RESULTS: The study found that the AI language model can create a highly convincing fraudulent article that resembled a genuine scientific paper in terms of word usage, sentence structure, and overall composition. The AI-generated article included standard sections such as introduction, material and methods, results, and discussion, as well a data sheet. It consisted of 1992 words and 17 citations, and the whole process of article creation took approximately 1 hour without any special training of the human user. However, there were some concerns and specific mistakes identified in the generated article, specifically in the references. CONCLUSIONS: The study demonstrates the potential of current AI language models to generate completely fabricated scientific articles. Although the papers look sophisticated and seemingly flawless, expert readers may identify semantic inaccuracies and errors upon closer inspection. We highlight the need for increased vigilance and better detection methods to combat the potential misuse of AI in scientific research. At the same time, it is important to recognize the potential benefits of using AI language models in genuine scientific writing and research, such as manuscript preparation and language editing.
- MeSH
- Algorithms * MeSH
- Data Analysis MeSH
- Language MeSH
- Humans MeSH
- Semantics MeSH
- Artificial Intelligence * MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
This paper provides an overview of current linguistic and ontological challenges which have to be met in order to provide full support to the transformation of health ecosystems in order to meet precision medicine (5 PM) standards. It highlights both standardization and interoperability aspects regarding formal, controlled representations of clinical and research data, requirements for smart support to produce and encode content in a way that humans and machines can understand and process it. Starting from the current text-centered communication practices in healthcare and biomedical research, it addresses the state of the art in information extraction using natural language processing (NLP). An important aspect of the language-centered perspective of managing health data is the integration of heterogeneous data sources, employing different natural languages and different terminologies. This is where biomedical ontologies, in the sense of formal, interchangeable representations of types of domain entities come into play. The paper discusses the state of the art of biomedical ontologies, addresses their importance for standardization and interoperability and sheds light to current misconceptions and shortcomings. Finally, the paper points out next steps and possible synergies of both the field of NLP and the area of Applied Ontology and Semantic Web to foster data interoperability for 5 PM.
- Publication type
- Journal Article MeSH
To improve patient outcomes after trauma, the need to decrypt the post-traumatic immune response has been identified. One prerequisite to drive advancement in understanding that domain is the implementation of surgical biobanks. This paper focuses on the outcomes of patients with one of two diagnoses: post-traumatic arthritis and osteomyelitis. In creating surgical biobanks, currently, many obstacles must be overcome. Roadblocks exist around scoping of data that is to be collected, and the semantic integration of these data. In this paper, the generic component model and the Semantic Web technology stack are used to solve issues related to data integration. The results are twofold: (a) a scoping analysis of data and the ontologies required to harmonize and integrate it, and (b) resolution of common data integration issues in integrating data relevant to trauma surgery.
- Publication type
- Journal Article MeSH
Multiple studies have investigated bibliometric factors predictive of the citation count a research article will receive. In this article, we go beyond bibliometric data by using a range of machine learning techniques to find patterns predictive of citation count using both article content and available metadata. As the input collection, we use the CORD-19 corpus containing research articles-mostly from biology and medicine-applicable to the COVID-19 crisis. Our study employs a combination of state-of-the-art machine learning techniques for text understanding, including embeddings-based language model BERT, several systems for detection and semantic expansion of entities: ConceptNet, Pubtator and ScispaCy. To interpret the resulting models, we use several explanation algorithms: random forest feature importance, LIME, and Shapley values. We compare the performance and comprehensibility of models obtained by "black-box" machine learning algorithms (neural networks and random forests) with models built with rule learning (CORELS, CBA), which are intrinsically explainable. Multiple rules were discovered, which referred to biomedical entities of potential interest. Of the rules with the highest lift measure, several rules pointed to dipeptidyl peptidase4 (DPP4), a known MERS-CoV receptor and a critical determinant of camel to human transmission of the camel coronavirus (MERS-CoV). Some other interesting patterns related to the type of animal investigated were found. Articles referring to bats and camels tend to draw citations, while articles referring to most other animal species related to coronavirus are lowly cited. Bat coronavirus is the only other virus from a non-human species in the betaB clade along with the SARS-CoV and SARS-CoV-2 viruses. MERS-CoV is in a sister betaC clade, also close to human SARS coronaviruses. Thus both species linked to high citation counts harbor coronaviruses which are more phylogenetically similar to human SARS viruses. On the other hand, feline (FIPV, FCOV) and canine coronaviruses (CCOV) are in the alpha coronavirus clade and more distant from the betaB clade with human SARS viruses. Other results include detection of apparent citation bias favouring authors with western sounding names. Equal performance of TF-IDF weights and binary word incidence matrix was observed, with the latter resulting in better interpretability. The best predictive performance was obtained with a "black-box" method-neural network. The rule-based models led to most insights, especially when coupled with text representation using semantic entity detection methods. Follow-up work should focus on the analysis of citation patterns in the context of phylogenetic trees, as well on patterns referring to DPP4, which is currently considered as a SARS-Cov-2 therapeutic target.
- Publication type
- Journal Article MeSH
Humans can memorize and later recognize many objects and complex scenes. In this study, we prepared large photographs and presented participants with only partial views to test the fidelity of their memories. The unpresented parts of the photographs were used as a source of distractors with similar semantic and perceptual information. Additionally, we presented overlapping views to determine whether the second presentation provided a memory advantage for later recognition tests. Experiment 1 (N = 28) showed that while people were good at recognizing presented content and identifying new foils, they showed a remarkable level of uncertainty about foils selected from the unseen parts of presented photographs (false alarm, 59%). The recognition accuracy was higher for the parts that were shown twice, irrespective of whether the same identical photograph was viewed twice or whether two photographs with overlapping content were observed. In Experiment 2 (N = 28), the memorability of the large image was estimated by a pre-trained deep neural network. Neither the recognition accuracy for an image part nor the tendency for false alarms correlated with the memorability. Finally, in Experiment 3 (N = 21), we repeated the experiment while measuring eye movements. Fixations were biased toward the center of the original large photograph in the first presentation, and this bias was repeated during the second presentation in both identical and overlapping views. Altogether, our experiments show that people recognize parts of remembered photographs, but they find it difficult to reject foils from unseen parts, suggesting that their memory representation is not sufficiently detailed to rule them out as distractors.
- Publication type
- Journal Article MeSH
INTRODUCTION: According to the strong version of the orthographic depth hypothesis, in languages with transparent letter-sound mappings (shallow orthographies) the reading of both familiar words and unfamiliar nonwords may be accomplished by a sublexical pathway that relies on serial grapheme-to-phoneme conversion. However, in languages such as English characterized by inconsistent letter-sound relationships (deep orthographies), word reading is mediated by a lexical-semantic pathway that relies on mappings between word-specific orthographic, semantic, and phonological representations, whereas the sublexical pathway is used primarily to read nonwords. METHODS: In this study, we used functional magnetic resonance imaging to elucidate neural substrates of reading in Czech, a language characterized by a shallo worthography. Specifically, we contrasted patterns of brain activation and connectivity during word and nonword reading to determine whether similar or different neural mechanisms are involved. Neural correlates were measured as differences in simple whole-brain voxel-wise activation, and differences in visual word form area (VWFA) task-related connectivity were computed on the group level from data of 24 young subject. Trial-to-trial reading reaction times were used as a measure of task difficulty, and these effects were subtracted from the activation and connectivity effects in order to eliminate difference in cognitive effort which is naturally higher for nonwords and may mask the true lexicality effects. RESULTS: We observed pattern of activity well described in the literature mostly derived from data of English speakers - nonword reading (as compared to word reading) activated the sublexical pathway to a greater extent whereas word reading was associated with greater activation of semantic networks. VWFA connectivity analysis also revealed stronger connectivity to a component of the sublexical pathway - left inferior frontal gyrus (IFG), for nonword compared to word reading. DISCUSSION: These converging results suggest that the brain mechanism of skilled reading in shallow orthography languages are similar to those engaged when reading in languages with a deep orthography and are supported by a universal dual-pathway neural architecture.
- Publication type
- Journal Article MeSH
Automated sentiment analysis is becoming increasingly recognized due to the growing importance of social media and e-commerce platform review websites. Deep neural networks outperform traditional lexicon-based and machine learning methods by effectively exploiting contextual word embeddings to generate dense document representation. However, this representation model is not fully adequate to capture topical semantics and the sentiment polarity of words. To overcome these problems, a novel sentiment analysis model is proposed that utilizes richer document representations of word-emotion associations and topic models, which is the main computational novelty of this study. The sentiment analysis model integrates word embeddings with lexicon-based sentiment and emotion indicators, including negations and emoticons, and to further improve its performance, a topic modeling component is utilized together with a bag-of-words model based on a supervised term weighting scheme. The effectiveness of the proposed model is evaluated using large datasets of Amazon product reviews and hotel reviews. Experimental results prove that the proposed document representation is valid for the sentiment analysis of product and hotel reviews, irrespective of their class imbalance. The results also show that the proposed model improves on existing machine learning methods.
- MeSH
- Algorithms * MeSH
- Emotions MeSH
- Humans MeSH
- Neural Networks, Computer * MeSH
- Semantics MeSH
- Machine Learning MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH