1 Simple regression

It is claimed that caregivers adapt their language to the language abilities of children during language acquisition. We follow a particular child and record an hour conversation between the child and her mother once every month between the child’s second and fourth birth dates. We calculate a well-known measure of language complexity/competency mean length of utterance (MLU), for the child and her mother for each recording session. The data is given below.1

Age (months)Child’s MLUMother’s MLU
25 1.46 5.42
26 1.41 5.69
27 1.66 6.27
28 1.74 6.10
29 1.90 6.06
30 1.91 5.98
31 1.85 6.10
32 2.06 6.09
33 2.27 6.10
34 2.43 6.14
35 2.70 6.42
36 2.81 6.35
37 2.69 6.21
38 2.72 6.07
39 2.64 5.84
40 3.05 6.17
41 3.22 5.74
42 3.42 6.11
43 3.70 6.41
44 3.90 5.50
45 3.57 6.00
46 3.49 6.90
47 3.66 6.65
48 3.64 6.40

You can also get the data as an SPSS data file here.

Exercise 1.1. Draw a scatter plot of Mother’s MLU vs. Child’s MLU. Also draw a fitted regression line over the scatter plot.

Mark or clearly describe the most influential data point in the scatter plot.

Exercise 1.2. Perform a linear regression analysis that predicts the mother’s MLU using the child’s MLU as the predictor. Answer the questions below using the analysis performed.
  1. What are the estimated coefficients, intercept (a) and slope (b).
  2. Assuming the linear regression model is correct, how do you interpret the values of intercept and the slope?
  3. Check whether residuals are normally distributed or not. Which part of the output is useful for answering this question?
  4. We hypothesize that caregivers adjust their language based on children’s language proficiency. As a result, we expect the mother’s MLU to increase as the child’s MLU increases.

    State whether this hypothesis is supported or disproved by the regression model at α-level 0.05.

  5. What is the rate or percentage of variation in the mother’s MLU explained by the child’s MLU? Which statistics in the output express the explained variance?