Hostname: page-component-745bb68f8f-g4j75 Total loading time: 0 Render date: 2025-01-27T13:56:10.797Z Has data issue: false hasContentIssue false

High-Dimensional Maximum Marginal Likelihood Item Factor Analysis by Adaptive Quadrature

Published online by Cambridge University Press:  01 January 2025

Stephen Schilling*
Affiliation:
School of Education, University of Michigan
R. Darrell Bock
Affiliation:
Center for Health Statistics, University of Illinois at Chicago
*
Requests for reprints should be sent to Stephen Schilling, Assistant Professor, University of Michigan, School of Education, Ann Arbor, MI 48109, USA. E-mail: schillsg@umich.edu

Abstract

Although the Bock–Aitkin likelihood-based estimation method for factor analysis of dichotomous item response data has important advantages over classical analysis of item tetrachoric correlations, a serious limitation of the method is its reliance on fixed-point Gauss-Hermite (G-H) quadrature in the solution of the likelihood equations and likelihood-ratio tests. When the number of latent dimensions is large, computational considerations require that the number of quadrature points per dimension be few. But with large numbers of items, the dispersion of the likelihood, given the response pattern, becomes so small that the likelihood cannot be accurately evaluated with the sparse fixed points in the latent space. In this paper, we demonstrate that substantial improvement in accuracy can be obtained by adapting the quadrature points to the location and dispersion of the likelihood surfaces corresponding to each distinct pattern in the data. In particular, we show that adaptive G-H quadrature, combined with mean and covariance adjustments at each iteration of an EM algorithm, produces an accurate fast-converging solution with as few as two points per dimension. Evaluations of this method with simulated data are shown to yield accurate recovery of the generating factor loadings for models of up to eight dimensions. Unlike an earlier application of adaptive Gibbs sampling to this problem by Meng and Schilling, the simulations also confirm the validity of the present method in calculating likelihood-ratio chi-square statistics for determining the number of factors required in the model. Finally, we apply the method to a sample of real data from a test of teacher qualifications.

Type
Original Paper
Copyright
Copyright © 2005 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Ahrens, J.H., Dieter, U. (1979). Computer methods for sampling from the exponential and normal distributions. Communications of the Association for Computing Machinery, 15, 873882.CrossRefGoogle Scholar
Ansari, A., Jedidi, K. (2000). Bayesian factor analysis for multilevel binary observations. Psychometrika, 65(4), 475496.CrossRefGoogle Scholar
Bartholomew, D.J., Knott, M. (1999). Latent Variable Models and Factor Analysis. Oxford: New York.Google Scholar
Bock, R.D. (1975/1985) Multivariate Statistical Methods in Behavioral Research. McGraw-Hill, New York; 1985 reprint, Chicago: Scientific Software InternationalGoogle Scholar
Bock, R.D., Lieberman, M. (1970). Fitting a response model for dichotomously scored items. Psychometrika, 35, 179197.CrossRefGoogle Scholar
Bock, R.D., Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: application of an EM algorithm. Psychometrika, 46, 443459.CrossRefGoogle Scholar
Bock, R.D., Gibbons, R.D., Muraki, E. (1987). Full information item factor analysis. Applied Psychological Measurement, 12(3), 261280.CrossRefGoogle Scholar
Bock, R.D., Schilling, S.G. (1997). High-dimensional full-information item factor analysis. In Birkane, M. (Eds.), Latent Variable Modeling and Applications to Causality (pp. 163176). New-York: Springer.CrossRefGoogle Scholar
Bock, R.D., Gibbons, R.D., Muraki, E., Schilling, S.G., Wilson, D.T., Wood, R. (1999). TESTFACT 3: Test Scoring, Item Statistics, and Full-information Item Factor Analysis. Chicago: Scientific Software International.Google Scholar
Divgi, D.R. (1979). Calculation of the tetrachoric correlation coefficient. Psychometrika, 44, 169172.CrossRefGoogle Scholar
Ferguson, G.A. (1941). The factorial interpretation of test difficulty. Psychometrika, 6, 323329.CrossRefGoogle Scholar
Fox, J.P., Glas, C.A.W. (2001). Bayesian estimation of a multilevel IRT model using Gibbs sampling. Psychometrika, 66(2), 271288.CrossRefGoogle Scholar
Guilford, J.P. (1941). The difficulty of a test and its factor composition. Psychometrika, 6, 6677.CrossRefGoogle Scholar
Haberman, S.J. (1977). Log-linear models and frequency tables with small expected cell counts. Annals of Statistics, 5, 11481169.CrossRefGoogle Scholar
Harman, H.H. (1987). Modern Factor Analysis. Chicago: University of Chicago Press.Google Scholar
Hedeker, D., Gibbons, R.D. (1994). A random-effects ordinal regression model for multilevel analysis. Biometrics, 50, 933944.CrossRefGoogle ScholarPubMed
Hill, H.C., Schilling, S.G., Ball, D.L. (2004) Developing measures of teachers mathematics knowledge for teaching. Elementary School Journal, in press.CrossRefGoogle Scholar
Householder, A.S. (1964). The Theory of Matrices in Numerical Analysis. New York: Blaisdell.Google Scholar
Kaiser, H.F. (1958). The varimax criterion for analytic rotation in factor analysis. Psychometrika, 23, 187200.CrossRefGoogle Scholar
Leonelli, B.T., Chang, C.H., Bock, R.D., Schilling, S.G. (2000). A full-information item factor analysis interpretation of the MMPI-2: Normative Sampling with Non-pathonomic Descriptors. Journal of Personality Assessment, 74(3), 400422.CrossRefGoogle ScholarPubMed
Lesaffre, E., Spiessens, B. (2001). On the effect of the number of quadrature points in a logistic random-effects model: an example. Applied Statistics, 50, 325335.Google Scholar
Lindstrom, M.J., Bates, D.M. (1990). Nonlinear mixed effects models for repeated measures data. Biometrics, 46, 673687.CrossRefGoogle ScholarPubMed
Liu, C., Rubin, D.B., Wu, Y.N. (1998). Parameter expansion to accelerate EM: The PX-EM algorithm. Biometrika, 85(4), 755770.CrossRefGoogle Scholar
Liu, Q., Pierce, D.A. (1994). A note on G-H quadrature. Biometrika, 81(3), 624629.Google Scholar
Meng, X.L., Schilling, S.G. (1996). Fitting full-information factor models and an empirical investigation of bridge sampling. Journal of the American Statistical Association, 91, 12541267.CrossRefGoogle Scholar
Mislevy, R.J. (1984). Estimating latent distributions. Psychometrika, 49(3), 359381.CrossRefGoogle Scholar
Muthén, B.O. (1984). A general structural equation model with dichotomous, ordered categorical and continuous latent variable indicators. Psychometrika, 49, 115132.CrossRefGoogle Scholar
Muthén, L. K., Muthén, B.O. (19982001). Mplus User’s Guide (Second edition). Muthén & Muthén, Los Angeles CA.Google Scholar
Naylor, J.C., Smith, A.F.M. (1982). Applications of a method for the efficient computation of posterior distributions. Applied Statistics, 31, 214225.CrossRefGoogle Scholar
Polak, E. (1971). Computational Methods in Optimization. New York: Academic Press.Google Scholar
Powell, M.J.D. (1964). An efficient method for several variables without calculating derivatives. Computer Journal, 7, 155162.CrossRefGoogle Scholar
Rabe-Hesketh, S., Pickles, A., Skrondal, A., (2001) GLLAMM Manual. Tech. rept. 2001/01. Department of Biostatistics and Computing, Institute of Psychiatry, King’s College, University of London. Downloadable from http://www.gllamm.org.Google Scholar
Rabe-Hesketh, S., Skrondal, A., Pickles, A. (2002). Reliable estimation of generalized linear mixed models using adaptive quadrature. The Stata Journal, 2, 121.CrossRefGoogle Scholar
Rabe-Hesketh, S., Skrondal, A., Pickles, A. (2005). Generalized multilevel structural equation modeling. Psychometrika, 69, 167190.CrossRefGoogle Scholar
Rabe-Hesketh, S., Skrondal, A., Pickles, A. (2005b) Maximum likelihood estimation of limited and discrete dependent variable models with nested random effects. Journal of Econometrics, in press.CrossRefGoogle Scholar
Ramsay, J.O. (1998). Estimating smooth monotone functions. Journal of the Royal Statistical Society, Series B, 60, 365375.CrossRefGoogle Scholar
Raudenbush, S.W., Yang, M.Yosef (2000). Maximum likelihood for generalized linear models with nested random effects via high-order, multivariate Laplace approximation. Journal of Computational and Graphical Statistics, 9(1), 141157.CrossRefGoogle Scholar
Raudenbush, S.W., Birk, A.S. (2002). Maximum likelihood for generalized linear models with nested random effects via high-order, multivariate Laplace approximation. Journal of Computational and Graphical Statistics, 9(1), 141157.CrossRefGoogle Scholar
Ripley, B.D. (1987). Stochastic Simulation. New York: Wiley.CrossRefGoogle Scholar
Schilling, S.G. (1993) Advances in Full Information Item Factor Analysis using the Gibbs Sampler. (Unpublished doctoral dissertation, University of Chicago)Google Scholar
Schrage, L. (1979). A more portable fortran random number generator. Association for Computing Machinery: Transactions on Mathematical Software, 5, 132138.CrossRefGoogle Scholar
Thurstone, L.L. (1947). Multiple Factor Analysis. Chicago: The University of Chicago Press.Google Scholar
Thurstone, L.L., Thurstone, T.G. (1941). Factorial studies of intelligence. Psychometric Monographs No. 2. Chicago: University of Chicago Press.Google Scholar
Tierney, L., Kadane, J.B. (1986). Accurate approximations for posterior moments and marginal densities. Journal of the American Statistical Association, 81, 8286.CrossRefGoogle Scholar
Wei, G.C.G., Tanner, M.A. (1990). A Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithm. Journal of the American Statistical Association, 85, 699704.CrossRefGoogle Scholar
Wood, R., Wilson, D.T., Gibbons, R.D., Schilling, S.G., Muraki, E., Bock, R.D. (2003). TESTFACF 4: Test Scoring, Item Statistics, and Full-information Item Factor Analysis. Chicago: Scientific Software International.Google Scholar
Zimowski, M.F., Muraki, E., Mislevy, R.J., Bock, R.D. (1995). BILOG-MG: multiple-group item analysis and test scoring. Chicago: Scientific Software International.Google Scholar