Skip to main navigation Skip to search Skip to main content

Fitting Bayesian Item Response Theory Models Using Deep Learning Computational Frameworks

  • Nanyu Luo
  • , Yuting Han
  • , Jinbo He
  • , Xiaoya Zhang
  • , Feng Ji
  • University of Toronto
  • Beijing Language and Culture University
  • University of Florida

Research output: Contribution to journalArticlepeer-review

Abstract

PyTorch and TensorFlow are two widely adopted modern deep learning frameworks that provide comprehensive computational libraries for developing and fitting complex models. Motivated by the technical barriers in recent item response theory (IRT) work and the lack of practice-oriented tutorials, we demonstrate how modern deep learning platforms can be used for Bayesian IRT parameter estimation by providing a didactic yet in-depth introduction to PyTorch and TensorFlow in a psychometric context, framing IRT models as graphical models, and offering step-by-step guidance that bridges probabilistic machine learning and psychometrics. In this study, we illustrate how to leverage these platforms to estimate widely used psychometric models in educational testing, psychological measurement, and behavioral assessment, namely dichotomous and polytomous IRT models and their multidimensional extensions. We compare Hamiltonian Monte Carlo and variational inference estimators for these models in a unified computational environment. Simulation studies show that both approaches yield parameter estimates with low mean squared error and bias in low-dimensional settings, while also indicating that VI might underestimate aspects of posterior uncertainty in higher-dimensional scenarios. Nonetheless, for practitioners who prioritize computational efficiency and scalability, especially when Graphics Processing Unit (GPU) acceleration is available, VI remains a compelling option. Three empirical case studies further demonstrate how PyTorch- and TensorFlow-based implementations compare with established IRT software in applied settings. We conclude by discussing the broader potential of integrating contemporary deep learning tools and perspectives into psychometric research.

Original languageEnglish
JournalJournal of Educational and Behavioral Statistics
DOIs
Publication statusAccepted/In press - Apr 2026

Keywords

  • deep learning
  • item response theory
  • Markov Chain Monte Carlo
  • PyTorch
  • TensorFlow
  • variational inference

Cite this