Using effect sizes for research reporting: examples using item response theory to analyze differential item functioning.
- Authors
- Steinberg, Lynne; Thissen, David
- Year
- 2006
- Journal
- Psychological methods
- PMID
- 17154754
- DOI
- 10.1037/1082-989X.11.4.402
The psychological literature currently emphasizes reporting the "effect size" of research findings in addition to the outcome of any tests of significance. However, some confusion may result from the fact that there are three distinct uses of effect sizes in the psychological literature, namely, power analysis, research synthesis, and research reporting. The authors review these uses of effect sizes and develop a case study of the description of effect size for research reporting in the context of item response theory. For many parametric models, hypotheses are tested by comparing the values of directly interpretable parameters. The authors show that the size of the effect can be expressed by a presentation of the values of the parameter estimates derived from the fitted model. Studies that use item response theory to detect differential item functioning provide illustrations.
No figures extracted from this document.
No chunks โ full text not yet ingested.
No entities extracted from this document yet.
No uploaded files.
No citations found.
In this knowledge base
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Score-Based Tests With Fixed Effects Person Parameters in Item Response Theory: Detecting Model Misspecification Including Differential Item Functioning. | Debelak R et al. | โ | 2026 | โ |
| A Review of Some of the History of Factorial Invariance and Differential Item Functioning. | Thissen D | โ | 2025 | โ |
| Being Mindful About Overuse of Total Scores: a Comparison of Total Scores and Moderated Nonlinear Factor Analysis Scores in Assessing Mindfulness Across Race/Ethnicity, Age, and PTSD Diagnosis. | Lozano A et al. | โ | 2025 | โ |
| Differential Item Functioning Analysis of Likert Scales: An Overview and Demonstration of Rating Scale Tree Model. | Effatpanah F et al. | โ | 2025 | โ |
| Differential Item Functioning Effect Size Use for Validity Information. | Finch WH et al. | โ | 2025 | โ |
| Impacts of DIF Item Balance and Effect Size Incorporation With the Rasch Tree. | Asamoah NAB et al. | โ | 2025 | โ |
| Measuring a Critical Component of Contraceptive Decision Making: The Contraceptive Concerns and Beliefs Scale. | Rocca CH et al. | โ | 2024 | โ |
| Agency in Contraceptive Decision-Making in Patient Care: a Psychometric Measure. | Harper CC et al. | โ | 2023 | โ |
| A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning. | Henninger M et al. | โ | 2023 | โ |
| Differential Item Functioning of the Youth Psychopathic Traits Inventory Across Race/Ethnicity and Gender Among a Sample of Justice-Involved Youth: An Item Response Theory Analysis. | Ray JV | โ | 2023 | โ |
| Implementing a Standardized Effect Size in the POLYSIBTEST Procedure. | Weese JD et al. | โ | 2023 | โ |
| Indicators of Tobacco Dependence Among Youth: Findings From Wave 1 (2013-2014) of the Population Assessment of Tobacco and Health Study. | Strong DR et al. | โ | 2023 | โ |
| Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT. | Zimmer F et al. | โ | 2023 | โ |
| Validation of the Brief Young Adult Alcohol Consequences Questionnaire among student and nonstudent young adults. | Stamates AL et al. | โ | 2023 | โ |
| A longitudinal approach to understanding boredom during pandemics: The predictive roles of trauma and emotion dysregulation. | Bambrah V et al. | โ | 2022 | โ |
| An R toolbox for score-based measurement invariance tests in IRT models. | Schneider L et al. | โ | 2022 | โ |
| Differential Item Functioning Analyses of the Patient-Reported Outcomes Measurement Information System (PROMISยฎ) Measures: Methods, Challenges, Advances, and Future Directions. | Teresi JA et al. | โ | 2021 | โ |
| Differential Item Functioning Effect Size From the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study. | Park SE et al. | โ | 2021 | โ |
| Harmonizing altered measures in integrative data analysis: A methods analogue study. | Hussong AM et al. | โ | 2021 | โ |
| Improving the Delivery of Function-Directed Care During Acute Hospitalizations: Methods to Develop and Validate the Functional Assessment in Acute Care Multidimensional Computerized Adaptive Test (FAMCAT). | Cheville AL et al. | โ | 2021 | โ |
| Is Motor Milestone Assessment in Infancy Valid and Scaled Equally Across Sex, Birth Weight, and Gestational Age? Findings From the Millennium Cohort Study. | de Almeida Maia D et al. | โ | 2021 | โ |
| Longitudinal change in restricted and repetitive behaviors from 8-36โmonths. | Sifre R et al. | โ | 2021 | โ |
| Psychometrics of three Swedish physical pediatric item banks from the Patient-Reported Outcomes Measurement Information System (PROMIS)ยฎ: pain interference, fatigue, and physical activity. | Carlberg Rindestig F et al. | โ | 2021 | โ |
| Improving the assessment of measurement invariance: Using regularization to select anchor items and identify differential item functioning. | Belzak WCM et al. | โ | 2020 | โ |
| Simplifying the Assessment of Measurement Invariance over Multiple Background Variables: Using Regularized Moderated Nonlinear Factor Analysis to Detect Differential Item Functioning. | Bauer DJ et al. | โ | 2020 | โ |
| Testing Differential Item Functioning in Small Samples. | Belzak WCM | โ | 2020 | โ |
| Age differences in DSM-IV borderline personality disorder symptom expression: Results from a national study using item response theory (IRT). | McMahon K et al. | โ | 2019 | โ |
| An Item-Level Analysis for Detecting Faking on Personality Tests: Appropriateness of Ideal Point Item Response Theory Models. | Liu J et al. | โ | 2019 | โ |
| DIF in the Spanish Version of the Verbal Selective Reminding Test Using Samples From Hispanics in the United States, Mexicans, and Spaniards. | Morales-Ortiz M et al. | โ | 2019 | โ |
| Item Response Theory Analysis of the Psychopathic Personality Inventory-Revised. | Eichenbaum AE et al. | โ | 2019 | โ |
| Psychometric Evaluation of an Instrument to Measure Prospective Pregnancy Preferences: The Desire to Avoid Pregnancy Scale. | Rocca CH et al. | โ | 2019 | โ |
| Simplifying the implementation of modern scale scoring methods with an automated R package: Automated moderated nonlinear factor analysis (aMNLFA). | Gottfredson NC et al. | โ | 2019 | โ |
| UNC Perceived Message Effectiveness: Validation of a Brief Scale. | Baig SA et al. | โ | 2019 | โ |
| Examining sex differences in DSM-IV-TR narcissistic personality disorder symptom expression using Item Response Theory (IRT). | Hoertel N et al. | โ | 2018 | โ |
| The use of latent variable mixture models to identify invariant items in test construction. | Sawatzky R et al. | โ | 2018 | โ |
| A more general model for testing measurement invariance and differential item functioning. | Bauer DJ | โ | 2017 | โ |
| Identifying Unbiased Items for Screening Preschoolers for Disruptive Behavior Problems. | Studts CR et al. | โ | 2017 | โ |
| Indicators of dependence for different types of tobacco product users: Descriptive findings from Wave 1 (2013-2014) of the Population Assessment of Tobacco and Health (PATH) study. | Strong DR et al. | โ | 2017 | โ |
| Montreal Accord on Patient-Reported Outcomes (PROs) use series-Paper 7: modern perspectives of measurement validation emphasize justification of inferences based on patient reported outcome scores. | Sawatzky R et al. | โ | 2017 | โ |
| A commentary on randomized clinical trials: How to produce them with a good level of evidence. | Flecha OD et al. | โ | 2016 | โ |
| Assessing Validity of Measurement in Learning Disabilities Using Hierarchical Generalized Linear Modeling: The Roles of Anxiety and Motivation. | Sideridis GD | โ | 2016 | โ |
| Calibration of the Spanish PROMIS Smoking Item Banks. | Huang W et al. | โ | 2016 | โ |
| Differences in symptom expression between unipolar and bipolar spectrum depression: Results from a nationally representative sample using item response theory (IRT). | Hoertel N et al. | โ | 2016 | โ |
| Differential endorsement of suicidal ideation and attempt in bipolar versus unipolar depression: a testlet response theory analysis. | Weinstock LM et al. | โ | 2016 | โ |
| Differential item functioning magnitude and impact measures from item response theory models. | Kleinman M et al. | โ | 2016 | โ |
| Methodological Issues in Examining Measurement Equivalence in Patient Reported Outcomes Measures: Methods Overview to the Two-Part Series, "Measurement Equivalence of the Patient Reported Outcomes Measurement Information System<sup>ยฎ</sup> (PROMIS<sup>ยฎ</sup>) Short Forms". | Teresi JA et al. | โ | 2016 | โ |
| The Accuracy of Computerized Adaptive Testing in Heterogeneous Populations: A Mixture Item-Response Theory Analysis. | Sawatzky R et al. | โ | 2016 | โ |
| Are symptom features of depression during pregnancy, the postpartum period and outside the peripartum period distinct? Results from a nationally representative sample using item response theory (IRT). | Hoertel N et al. | โ | 2015 | โ |
| Assessing the Straightforwardly-Worded Brief Fear of Negative Evaluation Scale for Differential Item Functioning Across Gender and Ethnicity. | Harpole JK et al. | โ | 2015 | โ |
| Development of a brief questionnaire to assess contraceptive intent. | Raine-Bennett TR et al. | โ | 2015 | โ |
| Differential item functioning (DIF) of SF-12 and Q-LES-Q-SF items among french substance users. | Bourion-Bรฉdรจs S et al. | โ | 2015 | โ |
| Differential reporting of depressive symptoms across distinct clinical subpopulations: what DIFference does it make? | Wanders RB et al. | โ | 2015 | โ |
| Item response theory analysis of the life orientation test-revised: age and gender differential item functioning analyses. | Steca P et al. | โ | 2015 | โ |
| Measurement of multiple nicotine dependence domains among cigarette, non-cigarette and poly-tobacco users: Insights from item response theory. | Strong DR et al. | โ | 2015 | โ |
| Quantifying 'problematic' DIF within an IRT framework: application to a cancer stigma index. | Edelen MO et al. | โ | 2015 | โ |
| Sex differences in DSM-IV posttraumatic stress disorder symptoms expression using item response theory: A population-based study. | Rivollier F et al. | โ | 2015 | โ |
| Examining sex differences in DSM-IV borderline personality disorder symptom expression using Item Response Theory (IRT). | Hoertel N et al. | โ | 2014 | โ |
| PROMISยฎ Parent Proxy Report Scales for children ages 5-7 years: an item response theory analysis of differential item functioning across age groups. | Varni JW et al. | โ | 2014 | โ |
| Development and psychometric properties of the PROMIS(ยฎ) pediatric fatigue item banks. | Lai JS et al. | โ | 2013 | โ |
| Examining faking on personality inventories using unfolding item response theory models. | Scherbaum CA et al. | โ | 2013 | โ |
| Integrative data analysis in clinical psychology research. | Hussong AM et al. | โ | 2013 | โ |
| Is surgical root coverage effective for the treatment of cervical dentin hypersensitivity? A systematic review. | Douglas de Oliveira DW et al. | โ | 2013 | โ |
| Measurement invariance of the SF-12 across European-American, Latina, and African-American postpartum women. | Desouky TF et al. | โ | 2013 | โ |
| PROMIS Pediatric Peer Relationships Scale: development of a peer relationships item bank as part of social health measurement. | Dewalt DA et al. | โ | 2013 | โ |
| Scale refinement and initial evaluation of a behavioral health function measurement tool for work disability evaluation. | Marfeo EE et al. | โ | 2013 | โ |
| Validation of the problem gambling severity index using confirmatory factor analysis and rasch modelling. | Miller NV et al. | โ | 2013 | โ |
| Young women's perceptions of the benefits of childbearing: associations with contraceptive use and pregnancy. | Rocca CH et al. | โ | 2013 | โ |
| Latent variable mixture models: a promising approach for the validation of patient reported outcomes. | Sawatzky R et al. | โ | 2012 | โ |
| Modifying measures based on differential item functioning (DIF) impact analyses. | Teresi JA et al. | โ | 2012 | โ |
| Patterns of alcohol dependence in Thai drinkers: a differential item functioning analysis of gender and age bias. | Srisurapanont M et al. | โ | 2012 | โ |
| PROMISยฎ Parent Proxy Report Scales: an item response theory analysis of the parent proxy report item banks. | Varni JW et al. | โ | 2012 | โ |
| PROMIS Pediatric Anger Scale: an item response theory analysis. | Irwin DE et al. | โ | 2012 | โ |
| Functioning of alcohol use disorder criteria among men and women with arrests for driving under the influence of alcohol. | McCutcheon VV et al. | โ | 2011 | โ |
| Is caregiver-adolescent disagreement due to differences in thresholds for reporting manic symptoms? | Freeman AJ et al. | โ | 2011 | โ |
| Validation of the Excited Component of the Positive and Negative Syndrome Scale (PANSS-EC) in a naturalistic sample of 278 patients with acute psychosis and agitation in a psychiatric emergency room. | Montoya A et al. | โ | 2011 | โ |
| An item response analysis of the pediatric PROMIS anxiety and depressive symptoms scales. | Irwin DE et al. | โ | 2010 | โ |
| Assessing the severity of hazardous drinking and related consequences among incarcerated women. | Strong DR et al. | โ | 2010 | โ |
| DSM-IV depressive symptom expression among individuals with a history of hypomania: a comparison to those with or without a history of mania. | Weinstock LM et al. | โ | 2010 | โ |
| DSM-IV nicotine dependence symptom characteristics for recent-onset smokers. | Rose JS et al. | โ | 2010 | โ |
| Exploring the role of a nicotine quantity-frequency use criterion in the classification of nicotine dependence and the stability of a nicotine dependence continuum over time. | McBride O et al. | โ | 2010 | โ |
| PROMIS Pediatric Pain Interference Scale: an item response theory analysis of the pediatric pain item bank. | Varni JW et al. | โ | 2010 | โ |
| Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS): An item response theory approach. | Teresi JA et al. | โ | 2009 | โ |
| Differential item functioning of DSM-IV depressive symptoms in individuals with a history of mania versus those without: an item response theory analysis. | Weinstock LM et al. | โ | 2009 | โ |
| Linking measures of adolescent nicotine dependence to a common latent continuum. | Strong DR et al. | โ | 2009 | โ |
| Use of item response theory to understand differential functioning of DSM-IV major depression symptoms by race, ethnicity and gender. | Uebelacker LA et al. | โ | 2009 | โ |
| Incorporating Measurement Non-Equivalence in a Cross-Study Latent Growth Curve Analysis. | Flora DB et al. | โ | 2008 | โ |
| Item response theory detected differential item functioning between healthy and ill children in quality-of-life measures. | Langer MM et al. | โ | 2008 | โ |
| Occurrences and sources of Differential Item Functioning (DIF) in patient-reported outcome measures: Description of DIF methods, and review of measures of depression, quality of life and general health. | Teresi JA et al. | โ | 2008 | โ |
| Methodological issues for building item banks and computerized adaptive scales. | Thissen D et al. | โ | 2007 | โ |