PSY 356
In the presence of nested data, the assumption of independence of errors is not met.
Multilevel models allow us to...
...model group-level effects, if they are of substantive interest
...determine the relative magnitude of within-group effects
...generalize to a population of group effects
In the presence of nested data, the assumption of independence of errors is not met.
This is key, because a standard linear model assumes that between-group differences are the only source of variation
If we ignore nesting of data, we obtain biased estimates of coefficients and standard errors
Standard errors often downwardly biased, leading to inflated Type I error rate
Randomly sample one observation from each group
Fixed effects model
i.e., including a regression coefficient for each grouping
Two approaches which only estimate marginal effects
Generalized estimating equations
Adjustments to standard errors which adjust for clustering
Huber-White SEs, so-called "sandwich" estimator
Multilevel models allow us to...
...model group-level effects, if they are of substantive interest
e.g., How does school climate affect math scale scores?
...determine the relative magnitude of within-group effects
e.g., How much do school-level factors affect math scale scores, relative to child-level scale scores?
...generalize to a population of group effects
e.g., How much do school-level factors affect math scale scores, relative to child-level scale scores, above and beyond the schools we are currently considering?
Multilevel models, at least theoretically, allow us to model the interplay between these levels.
Nationally representative study children's cognitive and social development from kindergarten to eighth grade
\(N=15305\) children sampled starting in 1998
Here we look at math skills in a cross-section of students in third grade
Children nested within school
Questions we wish to answer
To what extent do differences between children in math ability owe to differences between schools?
Which child-level factors are associated with higher math scores?
Which school-level factors are associated with higher math scores?
Do the effects of child-level factors vary from one school to the next?
where \(i\) indexes children and \(j\) indexes school.
\(\beta_{0j}\) is a subject's predicted math score, given that they are a student at school \(j\).
\(r_{ij}\) is the subject-specific deviation from this predicted mean.
\(\sigma^2\) is the within-school variance.
\(\gamma_{00}\) is the grand mean math score across schools.
\(u_{0j}\) is the school-specific deviation from this grand mean.
\(\tau_{00}\) is the between-school variance.
grand mean across schools
some school-specific deviation from that grand mean
some child-specific deviation from the school-implied value
Total variance = \(\tau_{00}\) +\(\sigma^2\)
\(ICC = \frac{BetweenGroups Variance}{Total Variance}\)
\(ICC = \frac{\tau_{00}}{\tau_{00}+\sigma^2}\)
Questions we wish to answer
To what extent do differences between children in math ability owe to differences between schools?
Which child-level factors are associated with higher math scores?
Which school-level factors are associated with higher math scores?
Do the effects of child-level factors vary from one school to the next?
\(\beta_{0j}\) is the predicted math score for a child who watches no TV, given that they are a student at school \(j\).
\(\beta_{1j}\) is the effect of hours of TV watched on math score for school \(j\). Note that it is the same for all schools here.
where \(i\) indexes children and \(j\) indexes school.
random
fixed
random
fixed
Note that this model could also be run (erroneously) as a standard linear regression by getting rid of random effects!
We predict math scale score from the number of hours of TV watched after dinner without accounting for nesting within schools
We find a fairly precipitous drop, predicting a 2.32-point reduction in math score for each hour of TV watched.
Questions we wish to answer
To what extent do differences between children in math ability owe to differences between schools?
Which child-level factors are associated with higher math scores?
Which school-level factors are associated with higher math scores?
Do the effects of child-level factors vary from one school to the next?
Here \(\gamma_{01}\) conveys the effect of \(PctFRL_j\) (the percentage of students qualifying for free or reduced lunch at school \(j\)) on the overall predicted math score for school \(j\).
\(\beta_{0j}\) is the predicted math score for a child who watches no TV, given that they are a student at school \(j\).
\(\beta_{1j}\) is the effect of hours of TV watched on math score for school \(j\). Note that it is the same for all schools here.
random
fixed
Note that even though \(PctFRL_j\) is a school-level variable and \(HoursTV_i\) is a child-level variable, both are fixed effects.
Questions we wish to answer
To what extent do differences between children in math ability owe to differences between schools?
Which child-level factors are associated with higher math scores?
Which school-level factors are associated with higher math scores?
Do the effects of child-level factors vary from one school to the next?
Here \(\gamma_{01}\) conveys the effect of \(PctFRL_j\) (the percentage of students qualifying for free or reduced lunch at school \(j\)) on the overall predicted math score for school \(j\), and \(\gamma_{01}\) conveys the effect of \(PctFRL_j\) on the effect of \(HoursTV_i\).
Here \(\gamma_{01}\) conveys the effect of \(PctFRL_j\) (the percentage of students qualifying for free or reduced lunch at school \(j\)) on the overall predicted math score for school \(j\).
random
fixed
fixed