Chapter 4: Multiple Regression Analysis

Inference

Where We're Going

In Chapter 3, we learned how to estimate the parameters (β_j) of our model using OLS. But an estimate from one sample is just that—one estimate. How confident can we be in it? How do we test our economic theories?

This Chapter is About Inference:

Using our OLS estimates to test hypotheses about the true population parameters.
Answering questions like: "Is this variable's effect on y statistically different from zero?"
Constructing confidence intervals to get a plausible range for the true parameter values.

A New Assumption: Normality

To perform inference, we need to know the sampling distribution of our OLS estimators. We add one final assumption to the Gauss-Markov assumptions:

Assumption MLR.6 (Normality):

The population error term 'u' is independent of the explanatory variables and is normally distributed with a mean of 0 and variance σ².

Why? This assumption implies that the OLS estimators (β̂_j) are also normally distributed. This is the foundation that lets us use the t-test and F-test.

This full set of six assumptions is called the Classical Linear Model (CLM) assumptions.

Testing a Single Hypothesis: The t-Test

The most common test we run is whether a variable x_j has a statistically significant effect on y. We are testing a hypothesis about the unknown population parameter β_j.

1. State the Null Hypothesis (H₀)

This is the "boring" case, the theory we want to test against. Most often:

H₀: β_j = 0

(x_j has no ceteris paribus effect on y)

2. Choose an Alternative (H₁)

What you believe if the null is false. Can be:

Two-sided: H₁: β_j ≠ 0
One-sided: H₁: β_j > 0 or H₁: β_j < 0

t-statistic = (β̂_j - hypothesized value) / se(β̂_j)

This tells us how many standard errors our estimate is from the hypothesized value (usually 0).

Making a Decision: Rejection Regions

We compare our t-statistic to a critical value from the t-distribution. If our t-stat is "extreme" enough, we reject the null hypothesis.

For a 5% two-sided test, we reject H₀ if |t-statistic| > critical value. This happens if the t-stat falls in either of the 2.5% tails.

An Easier Way: The p-value

The p-value is a more informative way to summarize the evidence against the null hypothesis.

Interpretation of the p-value:

"The p-value is the smallest significance level at which we could reject the null hypothesis. It's the probability of observing a t-statistic as extreme as we did, if the null hypothesis were true."

Small p-value (e.g., < 0.05): Strong evidence against H₀. We say the result is "statistically significant".

Large p-value (e.g., > 0.10): Weak evidence against H₀. We "fail to reject" the null.

Check Your Understanding

Your regression output for the effect of `education` on `log(wage)` shows:

Coefficient = 0.092, Standard Error = 0.007

Calculate the t-statistic for H₀: β_educ = 0.
With a large sample size, the 5% critical value for a two-sided test is 1.96. Do you reject the null hypothesis?
Is the effect of education on wages statistically significant?

Answer:

t-statistic = 0.092 / 0.007 ≈ 13.14.
Yes, we reject H₀, because |13.14| is much larger than 1.96.
Yes, the effect is highly statistically significant. The p-value would be extremely small.

Confidence Intervals

Instead of just a point estimate, a confidence interval gives us a plausible range of values for the true population parameter β_j.

95% CI = [ β̂_j - c × se(β̂_j) , β̂_j + c × se(β̂_j) ]

Interpretation: "If we drew many random samples and constructed a 95% CI for each, we would expect 95% of those intervals to contain the true population parameter β_j."

A useful shortcut: A 95% CI is roughly the point estimate plus or minus two standard errors.

If the 95% CI does not contain 0, it's equivalent to rejecting H₀: β_j = 0 at the 5% level against a two-sided alternative.

Testing Multiple Restrictions: The F-Test

What if we want to test if a group of variables has an effect on y? For example, in a salary regression, do performance metrics as a group matter?

We can't just check their t-stats individually. We need a joint hypothesis test.

1. Unrestricted Model

The full model with all variables.

log(sal) = β₀ + β₁yrs + β₂games + β₃bavg + β₄hruns

2. Restricted Model

The model where the null (H₀: β₃=0, β₄=0) is imposed.

log(sal) = β₀ + β₁yrs + β₂games

The F-test checks if the R-squared increases enough when we move from the restricted to the unrestricted model to justify adding the variables.

The F-Statistic

The F-statistic is calculated based on the R-squareds from the two models.

F = [ (R²_ur - R²_r) / q ] / [ (1 - R²_ur) / (n - k - 1) ]

R²_ur = R-squared from the unrestricted model (the bigger one).

R²_r = R-squared from the restricted model.

q = Number of restrictions (variables dropped).

n - k - 1 = Degrees of freedom in the unrestricted model.

A large F-statistic provides evidence against the null hypothesis, suggesting the variables are "jointly significant."

F-Test vs. t-Test

You run a regression and find that three variables (x₁, x₂, x₃) are individually not statistically significant (their t-stats are small). However, the F-test for their joint significance (H₀: β₁=β₂=β₃=0) has a very small p-value.

How is this possible?

Answer:

This is a classic symptom of multicollinearity. The variables x₁, x₂, and x₃ are likely highly correlated with each other.

Because they move together, it's hard for OLS to disentangle their individual effects, leading to high standard errors and insignificant t-statistics. However, the F-test shows that as a group, they still have significant explanatory power.

Economic vs. Statistical Significance

This is one of the most important lessons in econometrics. The two concepts are not the same!

Statistical Significance

Determined by the size of the t-statistic (or p-value). It tells us how confident we are that an effect is not zero.

It is heavily influenced by sample size; with a huge sample, even tiny effects can become "statistically significant".

Economic (Practical) Significance

Determined by the size and sign of the coefficient (β̂_j). It tells us if the variable's effect is large enough to be important in the real world.

This requires subject-matter knowledge and judgment.

Always discuss both! An effect can be statistically significant but too small to matter, or economically large but estimated too imprecisely to be statistically significant (especially in small samples).

Chapter 4: Multiple Regression Analysis

Where We're Going

This Chapter is About Inference:

A New Assumption: Normality

Testing a Single Hypothesis: The t-Test

1. State the Null Hypothesis (H0)

2. Choose an Alternative (H1)

Making a Decision: Rejection Regions

An Easier Way: The p-value

Interpretation of the p-value:

Check Your Understanding

Answer:

Confidence Intervals

Testing Multiple Restrictions: The F-Test

1. Unrestricted Model

2. Restricted Model

The F-Statistic

F-Test vs. t-Test

Answer:

Economic vs. Statistical Significance

Statistical Significance

Economic (Practical) Significance

1. State the Null Hypothesis (H₀)

2. Choose an Alternative (H₁)