Differential item functioning
Dotaz
Zobrazit nápovědu
Ciele. Prítomnosť odlišného fungovania položiek (DIF) v psychodiagnostickom nástroji môže indikovať prítomnosť skreslenia medzi skúmanými skupinami. Štúdia je zameraná na analýzu DIF medzi mužmi a ženami v slovenskej verzii NEO päťfaktorového osobnostného inventára (NEO-FFI). Výskumný súbor. Výskumný súbor tvorí 1016 respondentov (40,6% mužov) zo Slovenska, s priemerným vekom 25,97 rokov (SD= 13,42). Štatistická analýza. Pre analýzu DIF bolo využitý Mantelov test v programe DIFAS 5.0, ordinálna logistická regresia a metóda Poly-SIBTEST. Výsledky. Finálne bolo identifikovaných 18 položiek so stredným alebo veľkým DIF, 6 z nich bolo identifikovaných všetkými tromi použitými metódami. Jedna položka vykazovala neuniformné DIF. Odstránenie identifikovaných DIF položiek spôsobilo významnejšiu zmenu v miere rodových rozdielov v škále Otvorenosť voči skúsenosti. Možné faktory spôsobujúce DIF sú diskutované. Prítomnosť identifikovaných DIF položiek nenaznačuje, že NEO-FFI je závažne skresľujúce voči mužom alebo ženám. Obmedzenia štúdie. Limitáciou tejto štúdie je najmä použitie všeobecného výskumného súboru. Ďalšie štúdie by mohli overiť tieto zistenia na vekovo špecifických súboroch.
Objectives. Presence of differential item functioning (DIF) in psychological measurement tools can indicate the presence of bias between groups. The present study is focused on analysing DIF between males and females in the Slovak version of NEO Five Factor Inventory (NEO-FFI). Sample and settings. The sample consists of 1016 respondents (40,6% males) from Slovakia, with mean age of 25.97 years (SD=13.42). Statistical analysis. For DIF analysis Mantel’s test in DIFAS 5.0 software, ordinal logistic regression and Poly-SIBTEST were used. Results. The final results show 18 items with medium and large uniform DIF, 6 of them identified by all the methods used. One item showing purely non-uniform DIF was identified. Deletion of identified DIF items caused notable change in scale score differences between males and females in Openness to experience scale. Possible factors causing DIF are discussed. Presence of the identified DIF items does not suggest that NEO-FFI is severely biased against males or females. Limitation. Limitation of the present study lies mainly in the general sample. Future studies should verify these findings on age specific samples.
- Klíčová slova
- Big Five,
- MeSH
- dospělí MeSH
- extraverze (psychologie) MeSH
- genderová identita * MeSH
- interpretace statistických dat MeSH
- introverze (psychologie) MeSH
- lidé středního věku MeSH
- lidé MeSH
- mladiství MeSH
- mladý dospělý MeSH
- neuroticismus MeSH
- osobnostní dotazník * MeSH
- senioři nad 80 let MeSH
- senioři MeSH
- zpráva o sobě MeSH
- Check Tag
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mladiství MeSH
- mladý dospělý MeSH
- senioři nad 80 let MeSH
- senioři MeSH
Ciele. Odlišné fungovanie položiek (DIF) možno skúmať prostredníctvom viacerých metód založených na klasickej teórii testov ako aj teórii odpovede na položku. Štúdia je zameraná na porovnanie identifikovaných odlišne fungujúcich položiek skupín mužov a žien, resp. konzistenciu výsledkov Mantel-Haenszelovej procedúry, logistickej regresie a metódy SIBTEST. Výskumný súbor. Výskumný súbor tvorí 507 respondentov zo Slovenska, 258 mužov (50,9 %) a 249 žien (49,1 %), vo veku od 18 do 65 rokov (M = 42,44; SD = 12,75 %). Štatistická analýza. Skúmané boli vzťahy medzi mierami efektu jednotlivých metód, zhoda v identifikovaných položkách, zhoda v kategorizácii položiek ako stredne závažne a závažne odlišne fungujúcich a relatívna zhoda v tomto zaradení pred a po purifikácii. Výsledky. Boli zistené veľmi silné vzťahy medzi jednotlivými metódami, ako aj pomerne vysoká zhoda v identifikovaných odlišne fungujúcich položkách (57,15–100 %) pri podmienke p < 0,05, čo hovorí o pomerne vysokej konzistencii uvedených metód. Nižšia zhoda bola zistená v kategorizácii identifikovaných položiek podľa závažnosti DIF. Problematická sa zdá byť najmä kategorizácia logistickej regresie, pretože zhoda s ostatnými metódami v označení položiek ako stredne závažne až závažne odlišne fungujúcich bola len 14,29–50 %. Zhoda v tomto kritériu medzi Mantel-Haenszelovou procedúrou a SIBTESTom bola 57,15–83,33 %. Skúmanie relatívnej zhody prinieslo podobné zistenia. Štúdia tiež ukázala, že položky, ktoré boli v zhode všetkých použitých metód zaradené do kategórie stredne závažného a závažného DIF, sa nachádzajú vo všetkých troch subškálach. Obmedzenia štúdie. Limitáciou tejto štúdie je najmä veľkosť a rozmanitosť výskumného súboru, ktorý je pre uskutočnené analýzy dostačujúci, no bolo by vhodné uskutočniť ďalšie výskumy na iných súboroch pre potvrdenie týchto zistení.
Objectives. Differential item functioning (DIF) can be assessed through multiple methods based on classical test theory or item response theory. The study is focused on comparing items identified as functioning differentially in males and females by Mantel-Haenszel chi-square procedure, logistic regression and SIBTEST method. Sample and settings. The sample consists of 507 Slovak respondents, 258 men (50,9 %) and 249 women (49,1 %), aged from 18 to 65 years (M=42,44; SD=12,75). Statistical analysis. The correlations between effect sizes of individual methods were computed as well as matching percentages in identified DIF items, matching percentages in items categorized as showing moderate and large DIF, and relative matching percentages in categorization of DIF items before and after purification. Results. The results show strong correlations between individual methods as well as rather good matching percentages in identified DIF items (57,15–100 %), which shows medium to high consistency of these methods. Weaker matching percentages were identified in categorization of items by DIF severity. The categorization for logistic regression seems to be especially problematic, since the matching percentage between logistic regression and other methods in items labelled as moderate and large DIF were only 14,29–50%. The matching percentages between Mantel-Haenszel procedure and SIBTEST were 57,15–83,33%. The relative matching percentages are similar. Items showing moderate or large DIF identified by all three methods are present in all three subscales. Limitation. The limitation of this study is particularly the size and diversity of the sample, which is sufficient for conducted analysis, but it would be appropriate to carry out further research on other samples to confirm the findings.
We provide a tutorial on differential item functioning (DIF) analysis, an analytic method useful for identifying potentially biased items in assessments. After explaining a number of methodological approaches, we test for gender bias in two scenarios that demonstrate why DIF analysis is crucial for developing assessments, particularly because simply comparing two groups' total scores can lead to incorrect conclusions about test fairness. First, a significant difference between groups on total scores can exist even when items are not biased, as we illustrate with data collected during the validation of the Homeostasis Concept Inventory. Second, item bias can exist even when the two groups have exactly the same distribution of total scores, as we illustrate with a simulated data set. We also present a brief overview of how DIF analysis has been used in the biology education literature to illustrate the way DIF items need to be reevaluated by content experts to determine whether they should be revised or removed from the assessment. Finally, we conclude by arguing that DIF analysis should be used routinely to evaluate items in developing conceptual assessments. These steps will ensure more equitable-and therefore more valid-scores from conceptual assessments.
- MeSH
- diagnostické sebehodnocení * MeSH
- interpretace statistických dat MeSH
- lidé MeSH
- průzkumy a dotazníky * MeSH
- psychometrie metody MeSH
- reprodukovatelnost výsledků MeSH
- senzitivita a specificita MeSH
- statistické modely * MeSH
- zkreslení výsledků (epidemiologie) * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
BACKGROUND: The Symptom Checklist (SCL) developed by the Health Behaviour in School-aged Children (HBSC) study is a non-clinical measure of psychosomatic complaints (e.g., headache and feeling low) that has been used in numerous studies. Several studies have investigated the psychometric characteristics of this scale; however, some psychometric properties remain unclear, among them especially a) dimensionality, b) adequacy of the Graded Response Model (GRM), and c) measurement invariance across countries. METHODS: Data from 229,906 adolescents aged 11, 13 and 15 from 46 countries that participated in the 2018 HBSC survey were analyzed. Adolescents were selected using representative sampling and surveyed by questionnaire in the classroom. Dimensionality was investigated using exploratory graph analysis. In addition, we investigated whether the GRM provided an adequate description of the data. Reliability over the latent variable continuum and differential test functioning across countries were also examined. RESULTS: Exploratory graph analyses showed that SCL can be considered as one-dimensional in 16 countries. However, a comparison of the unidimensional with a post-hoc bifactor GRM showed that deviation from a hypothesized one-dimensional structure was negligible in most countries. Multigroup invariance analyses supported configural and metric invariance, but not scalar invariance across 32 countries. Alignment analysis showed non-invariance especially for the items irritability, feeling nervous/bad temper and feeling low. CONCLUSION: HBSC-SCL appears to represent a consistent and reliable unidimensional instrument across most countries. This bodes well for population health analyses that rely on this scale as an early indicator of mental health status.
- MeSH
- dítě MeSH
- emoce MeSH
- kontrolní seznam * MeSH
- lidé MeSH
- mladiství MeSH
- průzkumy a dotazníky MeSH
- psychometrie MeSH
- reprodukovatelnost výsledků MeSH
- školy * MeSH
- Check Tag
- dítě MeSH
- lidé MeSH
- mladiství MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
OBJECTIVE: The Boston Naming Test (BNT) is the most widely used test to assess visual confrontation naming in both research and clinical settings. Recently, an abbreviated Czech version of the BNT was described. The purpose of this study is to assess the validity of this new test at the item level with advanced psychometric methods to assess its equivalence with the original test. The rationale was to help busy clinicians in the differential diagnosis of language disorders. METHOD: We administered the BNT-30 (odd item form of BNT-60) (N = 535; 75.61 ± 9.11; 60-96 years) and shortened the BNT-15 (N = 754; 71.94 ± 7.88; 60-96 years) to a large sample of healthy older adults. RESULTS: Significant but low associations between BNT performance and age, education, and sex were found. We found strong evidence for the unidimensionality of both BNT-15/BNT-30 versions in healthy adults (p's < .001). CONCLUSION: In-depth psychometric analysis of the BNT-15 and BNT-30 Czech versions show that test stimuli function in a similar fashion as the original BNT. Normative values adjusting for the influence of age, education, and sex are provided for use in clinical settings and future cross-cultural comparisons.
- MeSH
- jazyk (prostředek komunikace) * MeSH
- jazykové testy MeSH
- lidé MeSH
- neuropsychologické testy MeSH
- psychometrie MeSH
- senioři MeSH
- Check Tag
- lidé MeSH
- senioři MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Geografické názvy
- Česká republika MeSH
Cíl: Přechod od kategoriálních modelů poruch osobnosti k dimenzionálnímu pojetí funkčních schopností osobnosti, jak jsou konceptualizovány v DSM-5 i MKN-11, představuje pro klinickou oblast výzvu k implementaci nových diagnostických metod pro poruchy osobnosti. Zároveň se ukazuje, že koncept osobnostního fungování má širší uplatnění kvůli svým schopnostem popisovat obecně závažnost psychopatologie. České verze dvou instrumentů zaměřených na funkčních schopnosti osobnosti již prošly ověřením psychometrických vlastností, nicméně pro jejich další využívání je potřebné, aby byly podrobeny dalším výzkumům. V této studii se proto zaměřujeme na jejich schopnost diskriminovat mezi těmi bez psychiatrické diagnózy a dvěma skupinami lidí s psychiatrickou diagnózou. Materiál a metoda: Studie byla realizována na 163 respondentech, kteří pocházeli ze tří skupin: 1) pacienti s poruchami osobnosti (n = 58), 2) pacienti se zvýšeným skóre neuroticismu (n = 50), 3) běžná populace (n = 55). Funkční schopnosti osobnosti byly hodnoceny za pomocí sebeposuzovacícho dotazníku Level of Personality Functioning Scale-Brief Form 2.0 (LPFS-BF 2.0) a klinicky hodnoceným polostrukturovaným rozhovorem Semi-structured Interview for Personality Functioning DSM-5 (STiP 5.1). Výsledky: Tři sledované skupiny se lišily v osobnostním fungování. Obecně se dá říct, že lidé s poruchou osobnosti dosahovali nejvyššího narušení osobnostního fungování, zatímco nejnižší úroveň narušení osobnostního fungování měla běžná populace, mezi nimi pak byla skupina se zvýšeným neuroticismem. Zatímco rozhovor STiP-5.1 byl schopen rozlišovat i v rámci skupin pacientů, dotazník LPFS-BF 2.0 měl tuto schopnost nižší. Z demografických proměnných se ukázala jako významná korelace s věkem naznačující trend ke zralejšímu osobnostnímu fungování s vyšším věkem, dále se ukázalo, že vyšší vzdělání bylo asociováno s nižším narušením osobnostního fungování. Závěr: Polostrukturovaný rozhovor STiP-5.1 i screeningový dotazník LPFS-BF 2.0 ukázaly adekvátní schopnost diskriminovat tři zkoumané populace. Obě metody jsou uživatelsky příznivé a vzhledem k dobrým psychometrickým vlastnostem jejich českých verzí je možné obě doporučit pro výzkumné užití a s obezřetností také pro klinickou praxi. Rozšiřuje se tak repertoár moderních metod (nejen) k diagnostice poruch osobnosti v ČR.
Objective: The transition from categorical models of personality disorders to a dimensional approach of personality functioning, as conceptualized by the DSM-5 and ICD-11, represents a challenge for the clinical area to implement new diagnostic methods for personality disorders. While it also seems that the personality functioning concept has a broader application because of its ability to capture the overall severity of psychopathology. The psychometric properties of Czech versions of two instruments assessing personality functioning have been verified already. However, for their future use, they must be subjected to further research. Therefore, in this study, we focus on their ability to discriminate between those without a psychiatric diagnosis and two groups of people with a psychiatric diagnosis. Method: The study was conducted on 163 respondents from three groups: 1) patients with personality disorders (n = 58), 2) patients with increased neuroticism score (n = 50), 3) general population (n = 55). Personality functioning was assessed by self-report questionnaire Level of Personality Functioning Scale-Brief Form 2.0 (LPFS-BF 2.0) and clinician-rated Semi-structured Interview for Personality Functioning DSM-5 (STiP 5.1). Results: The three groups differed in their level of personality functioning. Generally, people with personality disorders had the highest level of personality functioning impairment, the general population had the lowest level of impairment, the group with increased neuroticism score was in the middle. While the STiP-5.1 was able to differentiate within the patients ́ groups, the ability of LPFS-BF 2.0 was limited. From the demographic variables, age correlated with personality functioning, suggesting a trend toward more mature personality functioning in older age. Also, higher education levels were associated with lower personality functioning impairment. Conclusion: The semi-structured interview STiP-5.1 and the screening questionnaire LPFS-BF 2.0 showed adequate ability to discriminate between the three study groups. Both methods are user-friendly, and given the psychometric properties of their Czech versions, they can be recommended for research proposes and with caution to use in clinical practice. Thus, the repertoire of modern methods available in the Czech Republic for assessing (not only) personality disorders is expanding.
- MeSH
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mladiství MeSH
- neuroticismus MeSH
- osobnost * MeSH
- poruchy osobnosti * diagnóza MeSH
- sebeposuzující dotazníky PHQ MeSH
- senioři MeSH
- Check Tag
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mladiství MeSH
- mužské pohlaví MeSH
- senioři MeSH
- ženské pohlaví MeSH
- Publikační typ
- práce podpořená grantem MeSH
BACKGROUND: There is a growing importance of loneliness measurement through valid and reliable instruments. However, to establish valid and reliable measures, there is a need to explore their psychometric properties in different research settings and language environments. For this reason, this study aimed to validate the Three Item Loneliness Scale (TILS) in the Czech Republic within a Slavonic language environment. METHODS: A sample of Czech adults (n = 3236) was used consisting primarily of university students. We utilized Classical Test Theory to assess TILS internal consistency, temporal stability, and factor structure. Item Response Theory (IRT) was used to estimate Differential Item Functioning (DIF), the discrimination and difficulty of the TILS items and to estimate the measurement precision of the whole scale. Construct validity was explored through the Spearman correlation coefficient using personality traits, depression, and anxiety. RESULTS: The results showed satisfactory reliability and validity of the TILS in the Czech Republic. The scale's internal consistency and temporal stability were found to be satisfactory (Cronbach's α = 0.81, McDonald's ω = 0.82, ICC = 0.71). The parallel analysis supported the unidimensionality of the TILS. The IRT results indicated that the highest measurement precision was reached in individuals with lower and above-average levels of loneliness. Significant correlations between the TILS scores, anxiety, depression, and personality traits supported the construct validity of the scale. Although the DIF analysis identified statistically significant differences in responses to items TILS_2 and TILS_3 based on education level and employment status (with no significant differences observed for TILS_1), the effect sizes of these differences were small. This indicates that, despite statistical significance, the practical impact on the scale's validity across these groups is minimal. CONCLUSIONS: The validated TILS provides a reliable and valid tool for assessing loneliness in the Czech Republic. Its brevity makes it a practical option for researchers and clinicians seeking to measure loneliness time-efficiently. Future studies should explore how adding new items could increase the measurement precision of the TILS.
- MeSH
- deprese psychologie diagnóza MeSH
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mladiství MeSH
- mladý dospělý MeSH
- osamocení * psychologie MeSH
- průzkumy a dotazníky normy MeSH
- psychometrie * MeSH
- reprodukovatelnost výsledků MeSH
- úzkost psychologie diagnóza MeSH
- Check Tag
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mladiství MeSH
- mladý dospělý MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- Geografické názvy
- Česká republika MeSH
The objective of the article is to present an item analysis of selected subtests of the Czech version of the WJ IV COG battery from a group of Romani children, ages 7-11. The research sample consisted of 400 school-aged Romani children from the Czech Republic who were selected by quota sampling. A partial comparative sample for the analysis was the Czech population collected as norms of the Czech edition of © Propsyco (n = 936). The Woodcock-Johnson IV COG was used as a research tool. Statistical analysis was performed in Winstep software using Differential Item Functioning; differences between groups were expressed in logits and tested via the Rasch-Welch T-test. It was discovered that higher item difficulty was noted in the verbal subtests, although variability in item difficulty was found across all subtests. The analysis of individual items makes it possible to discover which tasks are most culturally influenced.
- MeSH
- dítě MeSH
- etnicita MeSH
- lidé MeSH
- psychometrie MeSH
- reprodukovatelnost výsledků MeSH
- Romové * MeSH
- zdroje elektrické energie MeSH
- Check Tag
- dítě MeSH
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Geografické názvy
- Česká republika MeSH
Sum fraction terms can approximate multi-variable functions on the basis of discrete observations, replacing a partial differential equation definition with polynomial elementary data relation descriptions. Artificial neural networks commonly transform the weighted sum of inputs to describe overall similarity relationships of trained and new testing input patterns. Differential polynomial neural networks form a new class of neural networks, which construct and solve an unknown general partial differential equation of a function of interest with selected substitution relative terms using non-linear multi-variable composite polynomials. The layers of the network generate simple and composite relative substitution terms whose convergent series combinations can describe partial dependent derivative changes of the input variables. This regression is based on trained generalized partial derivative data relations, decomposed into a multi-layer polynomial network structure. The sigmoidal function, commonly used as a nonlinear activation of artificial neurons, may transform some polynomial items together with the parameters with the aim to improve the polynomial derivative term series ability to approximate complicated periodic functions, as simple low order polynomials are not able to fully make up for the complete cycles. The similarity analysis facilitates substitutions for differential equations or can form dimensional units from data samples to describe real-world problems.
Úvod: Pooperačne ponechaný gázový materiál v paraspinálnej oblasti predstavuje vzácnu a väčšinou asymptomatickú komplikáciu, v chronických prípadoch však môže predstavovať diferenciálno-diagnostický problém. Kazuistika: Pacient, 69 rokov, so spinocelulárnym karcinómom laryngu a anamnézou lumbálnej diskektómie v segmentoch L4–L5 a L5–S1 vpravo vykonanej pred 20 rokmi. Počítačová pozitrónová emisná tomografia potvrdila zvýšený metabolizmus fluorodeoxyglukózy v sakrálnej oblasti. T2-vážené snímky magnetickej rezonancie zobrazili hypointenznú mäkkotkanivovú masu s hyperintenzným periférnym lemom. Peroperačný nález a biopsia potvrdili fibrotickú formáciu so zabudovanými vláknami gázového materiálu. Prezentovaná kazuistika sa zaraďuje k publikovaným prípadom s najdlhším časovým obdobím medzi spinálnym operačným výkonom a diagnózou gossypibómu. Závery: Ponechaný chirurgický gázový materiál nevykazuje špecifické klinické a rádiologické znaky. Mal by byť zahrnutý do diferenciálnej diagnostiky mäkkotkanivových más v paraspinálnej oblasti u pacientov po operácii chrbtice. Magnetická rezonancia a peroperačné nálezy predstavujú najlepšie modality pre diagnostiku gossypibómu.
Introduction: Surgical sponge retained after a surgery in the paraspinal area is a rare and mostly asymptomatic complication that, however, may represent a problem for differential diagnostics. Case report: Our patient was a 69-year-old man with squamous cell carcinoma of the larynx and after right-sided lumbar discectomy at the L4–L5 and L5–S1 levels performed 20 years ago. Computed positrone emission tomography confirmed increased metabolism of fluorine-deoxyglucose in the sacral region. T2-weighted MRI showed a hypointense soft tissue mass with hyperintense peripheral rim. Intraoperative finding and biopsy confirmed fibrous formation with imbedded fiber gauze material. Our case is among case reports with the longest time periods between the primary spine surgery and the diagnosis of gossypiboma. Conclusions: Retained surgical sponges do not present with any specific clinical or radiological signs. They should be included in differential diagnosis of soft tissue masses in the paraspinal region in patients after spinal surgery. Magnetic resonance imaging and intraoperative findings are the best modality for the diagnosis of gossypibomas.
- Klíčová slova
- textilom,
- MeSH
- chybná zdravotní péče MeSH
- cizí tělesa diagnóza chirurgie komplikace MeSH
- diferenciální diagnóza MeSH
- diskektomie * škodlivé účinky MeSH
- granulom z cizího tělesa * diagnóza chirurgie patologie MeSH
- lidé MeSH
- magnetická rezonanční tomografie MeSH
- metastázy nádorů diagnóza MeSH
- meziobratlová ploténka chirurgie MeSH
- nemoci páteře diagnóza etiologie chirurgie MeSH
- plazmocelulární granulom diagnóza chirurgie patologie MeSH
- pozitronová emisní tomografie MeSH
- Propionibacterium acnes izolace a purifikace MeSH
- senioři MeSH
- tampony chirurgické * škodlivé účinky MeSH
- Check Tag
- lidé MeSH
- mužské pohlaví MeSH
- senioři MeSH
- Publikační typ
- kazuistiky MeSH