Evaluating Construct Equivalence of Youth Depression Measures Across Multiple Measures and Multiple Studies

George W. Howe, Getachew A. Dagne, C. Hendricks Brown, Ahnalee M. Brincks, William Beardslee, Tatiana Perrino, Hilda Pantin

Research output: Contribution to journalArticle


Construct equivalence of measures across studies is necessary for synthesizing results when combining data in meta-analysis or integrative data analysis. We discuss several assumptions required for construct equivalence, and review methods using individual-level data and item response theory (IRT) analysis for detecting or adjusting for violations of these assumptions. We apply IRT to data from 7 measures of depressive symptoms for 4,283 youth from 16 randomized prevention trials. Findings indicate that these data violate assumptions of conditional independence. Bifactor IRT models find that depression measures contain substantial reporter variance, and indicate that a single common factor model would be substantially biased. Separate analyses of ratings by youth find stronger evidence for construct equivalence, but factor invariance across sex and age does not hold. We conclude that data synthesis studies employing measures of youth depression should analyze results separately by reporter, explore more complex approaches to integrate these different perspectives, and explore methods that adjust for sex and age differences in item functioning.

Original languageEnglish (US)
JournalPsychological Assessment
StatePublished - Jan 1 2019



  • Differential item functioning
  • Harmonization
  • Integrative data analysis
  • Measurement
  • Youth depression

ASJC Scopus subject areas

  • Clinical Psychology
  • Psychiatry and Mental health

Cite this