Single and Factorial ANOVA

3 Single and Factorial ANOVA

For this exercise, we will use a related but diﬀerent problem.

We are interested in how children’s language development and their parent’s socio-economic-status (SES) are related. We also want to see whether gender have an eﬀect on early language development. SES is a well known variable in socio-linguistics (more generally in sociology) with levels ‘low’, ‘middle’ and ‘high’. We use a binary gender variable with two levels ‘female’ and ‘male’.

We recruit 10 3-year-old kids for all combination of these two factors. We record mother-child dialogs in comparable situations, and calculate the MLU for each kid. The data looks like the following:

subject	SES	gender	MLU
1	low	male	1.81
11	low	female	1.56
21	middle	male	2.96
31	middle	female	3.64
41	high	male	3.02
51	high	female	2.79
⋮	⋮	⋮	⋮

You can get the full dataset here.

Note: the data (somewhat) matches the results in the literature, but randomly generated for this demonstration. Your results will not necessarily match the reality.

In all questions, assume an α-level of 0.05.

: Exercise 3.1. Create two box plots, one describing MLU based on SES and the other by gender. Which diﬀerences do you expect to be statistically signiﬁcant?

: Exercise 3.2. Check normality of each group of SES using normal Q-Q plots. Do the distributions look normal?

Exercise 3.3. Perform an appropriate ANOVA to investigate the eﬀect of SES on children’s MLU. Make sure to include the test for homogeneity of variances in the options dialog, and also include pairwise comparisons using Bonferroni correction.

Is the homogeneity of variances assumption met? Which part of the output tells you this?
Do you get a signiﬁcant eﬀect due to SES?
Which levels (groups) of the SES diﬀer from each other signiﬁcantly?

Exercise 3.4. Perform a two-way ANOVA using both factors, SES and gender. Make sure to include the test for homogeneity, eﬀect sizes, and the interaction plot in the SPSS output (TIP: interpretation is easier if you put gender on x-axis, and plot SES as separate lines).

Do you see any interaction patterns between two factors in the interaction graph?
Which main eﬀects are statistically signiﬁcant?
Can you interpret the main eﬀects directly based on your ﬁnding about whether the interaction term is signiﬁcant or not?
How do you interpret the eﬀect sizes for the signiﬁcant eﬀects you have found.

[next] [prev] [prev-tail] [front] [up]