Does OpenMx estimate the ordered family using a link function like probit or logit? Or may I open a feature request on github?

I found one years old Q&A in this forum here stating the the threshold model was akin to the probit. And this other Q&A mentions how to specify a probit model. So I am asking this question for a clearer answer.

It puzzles me that I cannot find software (besides perhaps Stata) that enables estimating a SEM with latent factors using family link functions (e.g. ordered + logit). As far as I understand, only with that method, the factors are estimated in relation to the distribution of the items. Please let me know if you believe there are reasons that this is mistaken or not important.

"Categorical Threshold Estimation -- Models with categorical outcomes can be estimated, including thresholds for the categories."

- openmx-features

Maximum likelihood estimation for ordinal variables is done by generating expected covariance and mean matrices for the latent continuous variables underlying the set of ordinal variables, then integrating the multivariate normal distribution defined by those covariances and means. The likelihood for each row of the data is defined as the multivariate integral of the expected distribution over the interval defined by the thresholds bordering that row’s data.

OpenMx uses Alan Genz’s SADMVN routine for multivariate normal integration (see http://www.math.wsu.edu/faculty/genz/software/software.html for more information). When continuous variables are present, OpenMx utilizes a block decomposition to separate the continuous and ordinal covariance matrices for FIML. The likelihood of the continuous variables is calculated normally. The effects of the point estimates of the continuous variables is projected out of the expected covariance matrix of the ordinal data. The likelihood of the ordinal data is defined as the multivariate integral over the distribution defined by the resulting ordinal covariance matrix.

Hi

I'm not sure I understand your question, but I'm going to give it a shot:

There are two main approaches for analyzing ordinal (ordered factor) data in OpenMx. One is to supply the raw data and request the maximum likelihood fit function, which proceeds by generating the expected covariance matrix and means (though these may be zero if the thresholds are being estimated) then calculating the integral of the MVN distribution between the thresholds defined by the particular values of the ordinal variable. So, e.g., analyzing just one 3 category 0/1/2 variable, the likelihood, given the model's parameter values (covariances or a path model that defines them in terms of other parameters) would be the MVN integral from minus infinity to the first threshold for a score of 0, from threshold 1 to threshold 2 for those scoring 1, and from threshold 2 to plus infinity for those scoring 2.

WLS initially operates somewhat similarly, but it first estimates all the polychorics and the thresholds, and the covariance matrix of the parameters, whose inverse is used for the weights. Essentially model fitting then proceeds based on the observed and expected statistics: (o-e)' inv(W) (o-e) where o and e are the previously estimated correlations and thresholds.

Does that help?

Hi Neal,

Thank you very much for that kind response.

This was already helpful. I take away that the WLS estimator does not offer what I ask for. But please let me take a step back: Can you help me figure out whether it is possible to use OpenMx a bit like the Stata software's GSEM command (as in Generalized SEM)? -- (a) would that be by the Full Information ML method you described? (b) does it use the probit link function? (c) would you recommend testing for multivariate normality MVN of the ordered items before using this method?

GSEM enables estimating latent factors using ologit, ordered logit, where logit is a link function and the ordered/ordinal scale is a distribution family (see e.g. the documentation). As far as I understand, this estimation technique is necessary to ensure that the model is closely informed by the distribution of the data; the ordered items. In contrast, computing from the ordered items a polychoric correlation matrix piped into a model implies that the model (perhaps estimated with WLS) will be less informed by the actual distribution of the data.

So far, for some days ago, I picked up the lines below from the documentation (leaving it aside to ask the question you help answering):

Ohh yeah, and sure, the question was phrased a bit weird. Sorry about that. I guess I meant to ask whether OpenMx is able to estimate models using a (probit/logit) link function to the ordered family distribution.

(I was not able to edit my initial response you your answer, so this comes in a second response)