Hostname: page-component-745bb68f8f-g4j75 Total loading time: 0 Render date: 2025-01-07T18:41:15.715Z Has data issue: false hasContentIssue false

Analyzing Sequential Categorical Data: Individual Variation in Markov Chains

Published online by Cambridge University Press:  01 January 2025

William Gardner*
Affiliation:
University of Virginia
*
Requests for reprints should be addressed to William Gardner, Department of Psychology, University of Virginia, Charlottesville, VA 22903.

Abstract

Markov chains are probabilistic models for sequences of categorical events, with applications throughout scientific psychology. This paper provides a method for analyzing data consisting of event sequences and covariate observations. It is assumed that each sequence is a Markov process characterized by a distinct transition probability matrix. The objective is to use the covariate data to explain differences between individuals in the transition probability matrices characterizing their sequential data. The elements of the transition probability matrices are written as functions of a vector of latent variables, with variation in the latent variables explained through a multivariate regression on the covariates. The regression is estimated using the EM algorithm, and requires the numerical calculation of a multivariate integral. An example using simulated cognitive developmental data is presented, which shows that the estimation of individual variation in the parameters of a probability model may have substantial theoretical importance, even when individual differences are not the focus of the investigator's concerns.

Type
Original Paper
Copyright
Copyright © 1990 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

Research contributing to this article was supported by B.R.S. Subgrant 5-35345 from the University of Virginia. I thank the DADA Group, Bill Fabricius, Don Hartmann, William Griffin, Jack McArdle, Ivo Molenaar, Ronald Schoenberg, Simon Tavaré, and several anonymous reviewers for their discussion of these points.

References

Anderson, T., & Goodman, L. (1957). Statistical inference about Markov chains. Annals of Mathematical Statistics, 28, 89110.CrossRefGoogle Scholar
Atkinson, A. C. (1985). Plots, transformations, and regression: An introduction to graphical methods of diagnositc regression analysis, New York: Oxford University Press.Google Scholar
Bakeman, R., & Gottman, J. (1986). Observing Interaction, Cambridge: Cambridge University Press.Google Scholar
Bentler, P., & Lee, S. (1975). Some extensions of matrix calculus. General Systems, 20, 145150.Google Scholar
Bertler, P., & Tanaka, J. (1983). Problems with the EM algorithm for ML factor analysis. Psychometrika, 48, 247252.CrossRefGoogle Scholar
Brainerd, C., Howe, M., & Desrochers, A. (1982). The general theory of two-stage learning: A mathematical review with illustrations from memory development. Psychological Bulletin, 91, 634665.CrossRefGoogle Scholar
Budescu, D. (1987). A Markov model for generation of random binary sequences. Journal of Experimental Psychology: Human Perception and Performance, 13, 2539.Google Scholar
Chi, M., Feltovich, P., & Glaser, R. (1981). Categorization and representation of physics problems by novices. Cognitive Science, 5, 121152.CrossRefGoogle Scholar
Çinlar, E. (1975). Introduction to stochastic processes, Englewood Cliffs, NJ: Prentice Hall.Google Scholar
Dempster, A., Laird, N., & Rubin, D. (1977). Maximum likelihood from incomplete data via the EM algorithm with discussion. Journal of the Royal Statistical Society, 39, 138.CrossRefGoogle Scholar
Edlefsen, E., & Jones, S. (1986). GAUSS programming language manual, Kent, WA: Aptech Systems.Google Scholar
Gardner, W. (1989). Analysis of individual variation in Markov chains, Charlottesville: University of Virginia, Department of Psychology.Google Scholar
Gardner, W., & Hartmann, D. (1984). Markov analysis of social interaction. Behavioral Assessment, 6, 229236.Google Scholar
Gottman, J. (1979). Detecting cyclicity in social interaction. Psychological Bulletin, 36, 338348.CrossRefGoogle Scholar
Gottman, J., & Roy, A. (1989). Sequential analysis: A guide for behavioral researchers, New York: Cambridge University Press.Google Scholar
Greeno, J. (1974). Representation of learning as a discrete transition in a finite state space. In Krantz, D., Atkinson, R., Luce, R., & Suppes, P. (Eds.), Contemporary developments in mathematical psychology: Vol. 1: Learning, memory, and thinking, San Francisco: W. H. Freeman.Google Scholar
Hanushek, E., & Jackson, J. (1977). Statistical methods for social scientists, New York: Acadmeic Press.Google Scholar
Heckman, J., & Singer, B. (1982). Population heterogeneity in demographic models. In Land, K., Rogers, A. (Eds.), Multidimensional mathematical demography, New York: Academic Press.Google Scholar
Louis, T. (1982). Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society, 44, 226233.CrossRefGoogle Scholar
Schotenberg, R. (1985). Latent variables in the analysis of limited dependent variables. In Tuma, N. (Eds.), Sociological methodology 1985, San Francisco: Jossey-Bass.Google Scholar
Townsend, J., & Ashby, G. (1983). The stochastic modeling of elementary psychological processes, New York: Cambridge University Press.Google Scholar
White, H. (1982). Maximum likelihood estimation of misspecified models. econometrica, 50, 125.CrossRefGoogle Scholar
Wilkinson, A. (1982). Partial knowledge and self correction: Developmental studies of a quantitative concept. Development Psychology, 18, 876893.CrossRefGoogle Scholar