Structural Equation Modeling, Part 2

PSY 356

But first: what is SEM?

Structural equation modeling (SEM) "can be viewed as a combination of factor analysis and regression or path analysis." (Hox and Bechger, 2011)

...OK, but what are those?

Factor analysis

Factor analysis aims to explain common variance among observed indicators in terms of latent variables. It is often referred to as a measurement model.

Math

Verbal

Arithmetic

Matrix reasoning

Word problems

Analogies

Reading comprehension

Vocabulary

Path analysis

Path analysis aims to find directional paths from one observed variable to another. It is often referred to as a structural model.

Math instruction

Math self-efficacy

Math performance

Attitudes toward school

Structural equation modeling

Structural equation modeling is a general term that refers to any combination of a measurement model and a structural model.

Why use SEM?

Some would say that SEM more closely corresponds to the theoretical mechanism of change than regression.
...but when there is just one predictor and one mediator (as in the example we just did) there isn't really a benefit.
- As we'll see in lab, we get the same results
And the conceptual challenges and caveats are still the same!
The benefit arises once things become more complicated -- i.e., once we start adding in more variables and paths between them. Then we can test model fit.

Model fit

How well does our model replicate our data's means and covariances?
- Every model gives us a model-implied covariance matrix and model-implied means.
- The discrepancy between those model-implied values and our actual values indicates how well the model fits.
- The likelihood represents a function of this discrepancy, which gets higher if the model fits better and lower if the model fits worse

Back to our example

Math instruction

Math self-efficacy

Math performance

Attitudes toward school

Back to our example

Math instruction

Math self-efficacy

Math performance

Attitudes toward school

Means

Covariances

Means and covariances of school attitudes, math instruction, math self-efficacy, and math performance, respectively*

The key question is: how well does our model replicate these means and covariances?

*Note that these are not the actual values. You will see the real values in lab.

Model fit

The basic premises:
1. SEM implies that some set of relationships exists among the variables. This means that some relationships do not exist among the variables - i.e., that we aren't modeling every possible relationship that could exist.
2. If we could model every possible relationship among the variables, we could replicate the means and covariances perfectly. We would also require a ton of parameters and the model would not be informative.
3. How can we balance the fit of the model against not having too many parameters?

Fit indices

Fit indices are a function of the likelihood and the number of parameters.
We start with a \(\chi^2\) distribution.
- Degrees of freedom are # means, variances, and covariances - # parameters in the model
  - Significant test = model does not fit
  - The \(\chi^2\) itself is almost always significant. What is interesting are the fit indices we can construct from this statistic.

"Absolute" fit indices

Root mean squared error of approximation (RMSEA)
- Rule: Values between 0 and 0.08 are OK; below 0.10 are marginal.
Tucker-Lewis Index (TLI) and Comparative Fit Index (CFI)
- Rule: Values above 0.9 are OK; above .95 is better.
Squared Root Mean Residual (SRMR)
- Rule: Values below 0.08 are OK.

Honestly, these "rules" are highly subjective and fallible. You should at some point learn how these fit indices are formulated, but today doesn't need to be that day.

Factor analysis

Factor analysis aims to explain common variance among observed indicators in terms of latent variables. It is often referred to as a measurement model.

Math

Verbal

Arithmetic

Matrix reasoning

Word problems

Analogies

Reading comprehension

Vocabulary

Factor Analysis: EFA vs CFA

Factor Analysis: Statistical method used to identify latent variables (factors) that explain the correlations among observed variables
Two main approaches:
- Exploratory Factor Analysis (EFA): Explores the possible underlying factor structure without imposing a predetermined model
- Confirmatory Factor Analysis (CFA): Tests how well measured variables represent the number of constructs based on theory
Key differences relate to:
- Purpose (explore vs. confirm)
- Theory requirements (minimal vs. substantial)
- Factor structure specification (determined by data vs. researcher)

Exploratory Factor Analysis (EFA)

Purpose: Identify underlying factor structure when relationships are uncertain
Key choice points:
- Number of factors to extract (eigenvalues, scree plot, parallel analysis)
- Extraction method (Principal Components, Maximum Likelihood, etc.)
- Rotation method
Interpretation: Based on pattern of factor loadings (correlations between items and factors)
Limitations:
- Subjective decisions required
- Sample-specific results
- Difficult to replicate exactly

A 2-factor EFA

Arithmetic

Matrix reasoning

Word problems

Analogies

Reading comprehension

Vocabulary

A 1-factor EFA

Arithmetic

Matrix reasoning

Word problems

Analogies

Reading comprehension

Vocabulary

EFA: Rotation Methods

Why rotate? To improve interpretability by maximizing high loadings and minimizing low loadings

Orthogonal rotations (assume factors are uncorrelated):
- Varimax: Maximizes variance of squared loadings within factors
- Quartimax: Simplifies rows of loading matrix
- Equamax: Combination of varimax and quartimax
Oblique rotations (allow factors to correlate):
- Promax: Computationally efficient rotation
- Direct oblimin: Control degree of correlation
- Geomin: Useful for complex factor structures
Don't worry about all the -maxes and -mins. Just know that we need to use oblique rotation most of the time. Why do you think that is?

Confirmatory Factor Analysis (CFA)

Purpose: Test how well measured variables represent constructs based on theory
Requirements:
- Strong theoretical foundation
- A priori specification of:
  - Number of factors
  - Which variables load on which factors
  - Whether factors correlate
Features:
- Tests specific hypothesized measurement model
- Provides fit indices to evaluate model quality
- Allows for constraining or freeing parameters
- Can test competing theoretical models

CFA's are SEM's!

CFA is the measurement component of SEM
- Defines how latent variables are measured by observed variables
- Establishes construct validity before testing structural relationships
Two-step approach to SEM (Anderson & Gerbing, 1988):
- Step 1: Validate measurement model using CFA
- Step 2: Test structural model with paths between latent variables
Benefits of this approach:
- Separates measurement issues from structural issues
- Ensures valid measurement before testing structural relationships
- Makes identification problems easier to locate
- Reduces interpretational confounding

deck

By Veronica Cole

Structural Equation Modeling, Part 2

But first: what is SEM?

Factor analysis

Path analysis

Structural equation modeling

Why use SEM?

Model fit

Back to our example

Back to our example

Model fit

Fit indices

"Absolute" fit indices

Factor analysis

Factor Analysis: EFA vs CFA

Exploratory Factor Analysis (EFA)

A 2-factor EFA

A 1-factor EFA

EFA: Rotation Methods

Confirmatory Factor Analysis (CFA)

CFA's are SEM's!

deck

More from Veronica Cole