One of the most common statistical analyses in experimental psychology concerns the comparison of two means using the frequentist t test. However, frequentist t tests do not quantify evidence and require various assumption tests. Recently, popularized Bayesian t tests do quantify evidence, but these were developed for scenarios where the two populations are assumed to have the same variance. As an alternative to both methods, we outline a comprehensive t test framework based on Bayesian model averaging. This new t test framework simultaneously takes into account models that assume equal and unequal variances, and models that use t-likelihoods to improve robustness to outliers. The resulting inference is based on a weighted average across the entire model ensemble, with higher weights assigned to models that predicted the observed data well. This new t test framework provides an integrated approach to assumption checks and inference by applying a series of pertinent models to the data simultaneously rather than sequentially. The integrated Bayesian model-averaged t tests achieve robustness without having to commit to a single model following a series of assumption checks. To facilitate practical applications, we provide user-friendly implementations in JASP and via the RoBTT package in R . A tutorial video is available at https://www.youtube.com/watch?v=EcuzGTIcorQ.
- MeSH
- Bayes Theorem MeSH
- Psychology, Experimental * methods MeSH
- Data Interpretation, Statistical MeSH
- Humans MeSH
- Models, Statistical * MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Review MeSH
INTRODUCTION: The objective of this study was to assess the impact of the Enhanced Recovery After Surgery (ERAS) programme implementation on treatment costs at a university-type centre, using the DRG scheme. MATERIALS AND METHODS: Retrospective analysis of patients' data in a group of 604 individuals enroled in the study. We evaluated three groups of patients according to the ERAS clinical protocol (CP): (1) CP oncogynaecology, (2) CP simple hysterectomy, (3) CP laparoscopy. The study aimed to evaluate the impact on the length of stay (LOS), savings in bed-days, and the reduction in direct treatment costs. Three parameters-antibiotic consumption, blood derivative consumption and laboratory test costs-were chosen to compare direct treatment costs. The statistical significance of the difference in the observed parameters was tested by a two-sample unpaired t test with unequal variances at the 0.05 significance level. RESULTS: We analysed data from 604 patients. In all three groups, the length of stay (LOS) was significantly reduced. The most significant reduction was observed in the CP oncogynaecology group, where the LOS was reduced from 11.1 days to 6.8 days (2022) and 7.6 days (2023) compared to 2019 (p < 0.05). Furthermore, there was a notable reduction in inpatient bed-days, which resulted in the capacity being made available to admit additional patients. A statistically significant reduction in direct costs was observed in the group of CP hysterectomy (antibiotic use) and in the CP laparoscopy (laboratory test costs). CONCLUSIONS: The implementation of the ERAS principles resulted in a number of significant positive economic impacts-reduction in the LOS and a corresponding increase in bed capacity for new patients. Additionally, direct treatment costs, including those related to antibiotic use or laboratory testing were reduced. The Czech Republic's acute healthcare system, like the majority of European healthcare systems, is financed by the DRG system. This flat-rate payment per patient encourages hospital management to seek cost-reduction strategies. The results of our study indicate that fast-track protocols represent a potential viable approach to reducing the cost of treatment while simultaneously meeting the recommendations of evidence-based medicine.
- MeSH
- Length of Stay statistics & numerical data economics MeSH
- Adult MeSH
- Gynecologic Surgical Procedures * economics MeSH
- Hysterectomy economics MeSH
- Diagnosis-Related Groups MeSH
- Laparoscopy economics MeSH
- Middle Aged MeSH
- Humans MeSH
- Health Care Costs * statistics & numerical data MeSH
- Retrospective Studies MeSH
- Aged MeSH
- Enhanced Recovery After Surgery * MeSH
- Check Tag
- Adult MeSH
- Middle Aged MeSH
- Humans MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
OBJECTIVE: This scoping review aims to identify, catalogue, and characterize previously reported tools, techniques, methods, and processes that have been recommended or used by evidence synthesizers to detect fraudulent or erroneous data and mitigate its impact. INTRODUCTION: Decision-making for policy and practice should always be underpinned by the best available evidence-typically peer-reviewed scientific literature. Evidence synthesis literature should be collated and organized using the appropriate evidence synthesis methodology, best exemplified by the role systematic reviews play in evidence-based health care. However, with the rise of "predatory journals," fraudulent or erroneous data may be invading this literature, which may negatively affect evidence syntheses that use this data. This, in turn, may compromise decision-making processes. INCLUSION CRITERIA: This review will include peer-reviewed articles, commentaries, books, and editorials that describe at least 1 tool, technique, method, or process with the explicit purpose of identifying or mitigating the impact of fraudulent or erroneous data for any evidence synthesis, in any topic area. Manuals, handbooks, and guidance from major organizations, universities, and libraries will also be considered. METHODS: This review will be conducted using the JBI methodology for scoping reviews and reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR). Databases and relevant organizational websites will be searched for eligible studies. Title and abstract, and, subsequently, full-text screening will be conducted in duplicate. Data from identified full texts will be extracted using a pre-determined checklist, while the findings will be summarized descriptively and presented in tables. REVIEW REGISTRATION: Open Science Framework https://osf.io/u8yrn.
- MeSH
- Humans MeSH
- Fraud prevention & control MeSH
- Systematic Reviews as Topic MeSH
- Scientific Misconduct * MeSH
- Research Design standards MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Review MeSH
BACKGROUND AND OBJECTIVE: While prostate cancer (PCa) incidence and mortality rates continue to rise, early detection of PCa remains highly controversial, and the research landscape is rapidly evolving. Existing systematic reviews (SRs) and meta-analyses (MAs) provide valuable insights, but often focus on single aspects of early detection, hindering a comprehensive understanding of the topic. We aim to fill this gap by providing a comprehensive SR of contemporary SRs covering different aspects of early detection of PCa in the European Union (EU) and the UK. METHODS: On June 1, 2023, we searched four databases (Medline ALL via Ovid, Embase, Web of Science, and Cochrane Central Register of Controlled Trials) and Google Scholar. To avoid repetition of previous studies, only SRs (qualitative, quantitative, and/or MAs) were considered eligible. In the data, common themes were identified to present the evidence systematically. KEY FINDINGS AND LIMITATIONS: We identified 1358 citations, resulting in 26 SRs eligible for inclusion. Six themes were identified: (1) invitation: men at general risk should be invited at >50 yr of age, and testing should be discontinued at >70 yr or with <10 yr of life expectancy; (2) decision-making: most health authorities discourage population-based screening and instead recommend a shared decision-making (SDM) approach, but implementation of SDM in clinical practice varies widely; decision aids help men make more informed and value-consistent screening decisions and decrease men's intention to attempt screening, but these do not affect screening uptake; (3) acceptance: facilitators for men considering screening include social prompting by partners and clinician recommendations, while barriers include a lack of knowledge, low-risk perception, and masculinity attributes; (4) screening test and algorithm: prostate-specific antigen-based screening reduces PCa-specific mortality and metastatic disease in men aged 55-69 yr at randomisation if screened at least twice; (5) harms and benefits: these benefits come at the cost of unnecessary biopsies, overdiagnosis, and subsequent overtreatment; and (6) future of screening: risk-adapted screening including (prebiopsy) risk calculators, magnetic resonance imaging, and blood- and urine-based biomarkers could reduce these harms. To enable a comprehensive overview, we focused on SRs. These do not include the most recent prospective studies, which were therefore incorporated in the discussion. CONCLUSIONS AND CLINICAL IMPLICATIONS: By identifying consistent and conflicting evidence, this review highlights the evidence-based foundations that can be built upon, as well as areas requiring further research and improvement to reduce the burden of PCa in the EU and UK. PATIENT SUMMARY: This review of 26 reviews covers various aspects of prostate cancer screening such as invitation, decision-making, screening tests, harms, and benefits. This review provides insights into existing evidence, highlighting the areas of consensus and discrepancies, to guide future research and improve prostate cancer screening strategies in Europe.
- MeSH
- Early Detection of Cancer * MeSH
- European Union * MeSH
- Humans MeSH
- Prostatic Neoplasms * diagnosis MeSH
- Check Tag
- Humans MeSH
- Male MeSH
- Publication type
- Journal Article MeSH
- Review MeSH
- Geographicals
- United Kingdom MeSH
Cohort studies are a robust analytical observational study design that explore the difference in outcomes between two cohorts, differentiated by their exposure status. Despite being observational in nature, they are often included in systematic reviews of effectiveness, particularly when randomized controlled trials are limited or not feasible. Like all studies included in a systematic review, cohort studies must undergo a critical appraisal process to assess the extent to which a study has considered potential bias in its design, conduct, or analysis. Critical appraisal tools facilitate this evaluation. This paper introduces the revised critical appraisal tool for cohort studies, completed by the JBI Effectiveness Methodology Group, who are currently revising the suite of JBI critical appraisal tools for quantitative study designs. The revised tool responds to updates in methodological guidance from the Grading of Recommendations, Assessment, Development and Evaluation (GRADE) Working Group and reporting guidance from PRISMA 2020, providing a robust framework for evaluating risk of bias in a cohort study. Transparent and rigorous assessment using this tool will assist reviewers in understanding the validity and relevance of the results and conclusions drawn from a systematic review that includes cohort studies. This may contribute to better evidence-based decision-making in health care. This paper discusses the key changes made to the tool, outlines justifications for these changes, and provides practical guidance on how this tool should be interpreted and applied by systematic reviewers.
- MeSH
- Cohort Studies MeSH
- Humans MeSH
- Research Design * standards MeSH
- Bias * MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
The UPPS-P Impulsive Behavior Model and the various psychometric instruments developed and validated based on this model are well established in clinical and research settings. However, evidence regarding the psychometric validity, reliability, and equivalence across multiple countries of residence, languages, or gender identities, including gender-diverse individuals, is lacking to date. Using data from the International Sex Survey (N = 82,243), confirmatory factor analyses and measurement invariance analyses were performed on the preestablished five-factor structure of the 20-item short version of the UPPS-P Impulsive Behavior Scale to examine whether (a) psychometric validity and reliability and (b) psychometric equivalence hold across 34 country-of-residence-related, 22 language-related, and three gender-identity-related groups. The results of the present study extend the latter psychometric instrument's well-established relevance to 26 countries, 13 languages, and three gender identities. Most notably, psychometric validity and reliability were evidenced across nine novel translations included in the present study (i.e., Croatian, English, German, Hebrew, Korean, Macedonian, Polish, Portuguese-Portugal, and Spanish-Latin American) and psychometric equivalence was evidenced across all three gender identities included in the present study (i.e., women, men, and gender-diverse individuals).
- MeSH
- Adult MeSH
- Factor Analysis, Statistical MeSH
- Gender Identity * MeSH
- Impulsive Behavior * MeSH
- Language MeSH
- Middle Aged MeSH
- Humans MeSH
- Adolescent MeSH
- Young Adult MeSH
- Surveys and Questionnaires MeSH
- Psychometrics MeSH
- Reproducibility of Results MeSH
- Cross-Cultural Comparison MeSH
- Check Tag
- Adult MeSH
- Middle Aged MeSH
- Humans MeSH
- Adolescent MeSH
- Young Adult MeSH
- Male MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Validation Study MeSH
BACKGROUND: Effective diabetes management requires a multimodal approach involving lifestyle changes, pharmacological treatment, and continuous patient education. Self-management demands can be overwhelming for patients, leading to lowered motivation, poor adherence, and compromised therapeutic outcomes. In this context, digital health apps are emerging as vital tools to provide personalized support and enhance diabetes management and clinical outcomes. OBJECTIVE: This study evaluated the impact of the digital health application Vitadio on glycemic control in patients with type 2 diabetes mellitus (T2DM). Secondary objectives included evaluating its effects on cardiometabolic parameters (weight, BMI, waist circumference, blood pressure, and heart rate) and self-reported measures of diabetes distress and self-management. METHODS: In this 6-month, 2-arm, multicenter, unblinded randomized controlled trial, patients aged 18 years or older diagnosed with T2DM were randomly assigned (1:1) to an intervention group (IG) receiving standard diabetes care reinforced by the digital health app Vitadio or to a control group (CG) provided solely with standard diabetes care. Vitadio provided a mobile-based self-management support tool featuring educational modules, motivational messages, peer support, personalized goal setting, and health monitoring. The personal consultant was available in the app to provide technical support for app-related issues. The primary outcome, assessed in the intention-to-treat population, was a change in glycated hemoglobin (HbA1c) levels at 6 months. Secondary outcomes included changes in cardiometabolic measures and self-reported outcomes. Data were collected in 2 study centers: diabetologist practice in Dessau-Roßlau and the University of Dresden. RESULTS: Between November 2022 and June 2023, a total of 276 patients were screened for eligibility, with 149 randomized to in intervention group (IG; n=73) and a control group (CG; n=76). The majority of participants were male (91/149, 61%). The dropout rate at month 6 was 19% (121/149). While both groups achieved significant HbA1c reduction at 6 months (IG: mean -0.8, SD 0.9%, P<.001; CG: mean -0.3, SD 0.7%, P=.001), the primary confirmatory analysis revealed statistically significant advantage of the IG (adjusted mean difference: -0.53%, SD 0.15, 95% CI -0.24 to -0.82; P<.001; effect size [Cohen d]=0.67, 95% CI 0.33-1). Significant between-group differences in favor of the IG were also observed for weight loss (P=.002), BMI (P=.001) and systolic blood pressure (P<.03). In addition, Vitadio users experienced greater reduction in diabetes-related distress (P<.03) and obtained more pronounced improvements in self-care practices in the areas of general diet (P<.001), specific diet (P<.03), and exercise (P<.03). CONCLUSIONS: This trial provides evidence for the superior efficacy of Vitadio in lowering the HbA1c levels in T2DM patients compared to standard care. In addition, Vitadio contributed to improvements in cardiometabolic health, reduced diabetes-related distress, and enhanced self-management, highlighting its potential as an accessible digital tool for comprehensive diabetes management. TRIAL REGISTRATION: German Clinical Trials Registry DRKS00027405; https://drks.de/search/de/trial/DRKS00027405.
- MeSH
- Diabetes Mellitus, Type 2 * blood therapy MeSH
- Adult MeSH
- Glycated Hemoglobin analysis MeSH
- Middle Aged MeSH
- Humans MeSH
- Mobile Applications * MeSH
- Self Care MeSH
- Self-Management MeSH
- Aged MeSH
- Telemedicine MeSH
- Check Tag
- Adult MeSH
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Multicenter Study MeSH
- Randomized Controlled Trial MeSH
JBI has long held the view that an inclusive approach to the conceptualization of what counts as evidence is important to the evidence-based movement. JBI's approach for appraising textual evidence had encompassed all forms of text (narrative, opinion, and policy), with one general tool used to guide critical appraisal. The proliferation of textual evidence and increase in textual evidence reviews demonstrate the need to reconceptualize JBI's methodological approach to critically appraising textual evidence. The objective of this paper is to outline the updated methodological approach to systematic reviews of textual evidence, especially in relation to the development of 3 separate critical appraisal tools for narrative, expert opinion, and policy text. Using an adapted Delphi approach, the JBI Textual Evidence Methodology Group convened over several rounds of meetings and discussions with international experts to reach consensus on the reconceptualization of critical appraisal tools for textual evidence sources. Strategies to effectively interrogate the legitimacy and authenticity of sources were found to be dependent upon the type of textual evidence under review. Therefore, 3 separate critical appraisal tools for narrative, expert opinion, and policy text were developed. This paper provides an overview of the development of 3 separate critical appraisal tools, highlighting the complex nature of textual evidence data sources.
Current European/US guidelines recommend that molecular testing in advanced non-small cell lung cancer (aNSCLC) be performed using next-generation sequencing (NGS). However, the global uptake of NGS is limited, largely owing to reimbursement constraints. We compared real-world costs of NGS and single-gene testing (SGT) in nonsquamous aNSCLC. This observational study was conducted across 10 pathology centers in 10 different countries worldwide. Biomarker data collected via structured questionnaires (1 January-31 December 2021) were used to feed micro-costing analyses for three scenarios ['Starting Point' (SP; 2021-2022), 'Current Practice' (CP; 2023-2024), and 'Future Horizons' (FH; 2025-2028)] in both a real-world model, comprising all biomarkers tested by each center, and a standardized model, comprising the same sets of biomarkers across centers. Testing costs (including retesting) encompassed personnel costs, consumables, equipment, and overheads. Overall, 4,491 patients with aNSCLC were evaluated. Mean per-patient costs decreased for NGS relative to SGT over time, with real-world model costs 18% lower for NGS than for SGT in the SP scenario, and 26% lower for NGS than for SGT in the CP scenario. Mean per-biomarker costs also decreased over time for NGS relative to SGT. In the standardized model, the tipping point for the minimum number of biomarkers required for NGS to result in cost savings (per patient) was 10 and 12 in the SP and CP scenarios, respectively. Retesting had a negligible impact on cost analyses, and results were robust to variation in cost parameters. This study provides robust real-world global evidence for cost savings with NGS-based panels over SGT to evaluate predictive biomarkers in nonsquamous aNSCLC when the number of biomarkers to be tested exceeds 10. Widespread adoption of NGS may enable more efficient use of limited healthcare resources.
- MeSH
- Cost-Benefit Analysis MeSH
- Genetic Testing economics methods MeSH
- Middle Aged MeSH
- Humans MeSH
- Biomarkers, Tumor * genetics MeSH
- Lung Neoplasms * genetics pathology diagnosis MeSH
- Health Care Costs MeSH
- Carcinoma, Non-Small-Cell Lung * genetics diagnosis pathology MeSH
- Aged MeSH
- High-Throughput Nucleotide Sequencing * economics MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Multicenter Study MeSH
- Observational Study MeSH
- Comparative Study MeSH
IMPORTANCE: Cigarette smoking is a strong risk factor for mortality in patients diagnosed with head and neck squamous cell carcinoma (HNSCC). However, little evidence supports which smoking metric best models the association between smoking and survival in HNSCC. OBJECTIVE: To determine which smoking metric best models a linear association between smoking exposure and overall survival (OS) in patients with HNSCC. DESIGN, SETTING, AND PARTICIPANTS: A retrospective multicenter cohort study of 6 clinical epidemiological studies was performed. Five were part of the Human Papillomavirus, Oral and Oropharyngeal Cancer Genomic Research (VOYAGER) consortium. Participants included patients 18 years and older with pathologically confirmed HNSCC. Data were collected from January 2002 to December 2019, and data were analyzed between January 2022 to November 2024. MAIN OUTCOMES AND MEASURES: The primary outcome was OS. The performance of 8 smoking metrics, including pack-years, duration, and log cig-years (calculated as log10[cigarettes smoked per day + 1] × number of years smoked) for modeling OS were compared. Metric performance was measured by the strength of association in Cox proportional hazard models, linearity based on P for linear trend, Akaike information criterion (AIC; lower value indicates better model fit), and visual assessment of spline curves. Secondary outcomes included modeling OS in clinicodemographic subgroups and HNSCC anatomic subsites. Exploratory outcomes included cancer-specific survival and noncancer survival. RESULTS: In total, 8875 patients with HNSCC (2114 [24%] female; median [IQR] age, 61 [54-69] years) were included. Of 8 smoking metrics evaluated, smoking duration (adjusted hazard ratio [aHR], 1.11 [95% CI, 1.03-1.19]) and log cig-years (aHR, 1.11 [95% CI, 1.04-1.18]) had the highest aHRs; both had a statistically significant linear association with OS. Log cig-years had the lowest AIC linear value and the most visually linear spline curve when modeling OS. Duration and log cig-years outperformed pack-years for modeling OS regardless of age, smoking status, and cancer stage. Both performed well in lip and oral cavity, laryngeal (only duration was significant), and human papillomavirus-negative oropharyngeal subsites. In an exploratory analysis, duration had the highest aHR (1.15 [95% CI, 1.02-1.29]), and log cig-years had the lowest AIC linear value when modeling noncancer survival. CONCLUSIONS AND RELEVANCE: In this cohort study, smoking duration and log cig-years best modeled a linear relationship with OS for patients with HNSCC. Both metrics maintained robust performance within specific clinicodemographic subgroups and anatomic subsites. Although most HNSCC survival models control for smoking exposure using smoking status or pack-years, duration and log cig-years may be superior metrics to account for the effects of smoking on survival.
- MeSH
- Squamous Cell Carcinoma of Head and Neck * mortality MeSH
- Cigarette Smoking * adverse effects MeSH
- Smoking * adverse effects MeSH
- Middle Aged MeSH
- Humans MeSH
- Survival Rate MeSH
- Head and Neck Neoplasms * mortality MeSH
- Proportional Hazards Models MeSH
- Retrospective Studies MeSH
- Risk Factors MeSH
- Aged MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Multicenter Study MeSH