Course Outline
-
segmentGetting Started (Don't Skip This Part)
-
segmentHigh School / Advanced Statistics and Data Science I (ABC)
-
segmentPART I: EXPLORING VARIATION
-
segmentChapter 1 - Welcome to Statistics: A Modeling Approach
-
segmentChapter 2 - Understanding Data
-
segmentChapter 3 - Examining Distributions
-
segmentChapter 4 - Explaining Variation
-
segmentPART II: MODELING VARIATION
-
segmentChapter 5 - A Simple Model
-
segmentChapter 6 - Quantifying Error
-
segmentChapter 7 - Adding an Explanatory Variable to the Model
-
segmentChapter 8 - Digging Deeper into Group Models
-
segmentChapter 9 - Models with a Quantitative Explanatory Variable
-
segmentPART III: EVALUATING MODELS
-
segmentChapter 10 - The Logic of Inference
-
segmentChapter 11 - Model Comparison with F
-
segmentChapter 12 - Parameter Estimation and Confidence Intervals
-
12.8 Interpreting the Confidence Interval
-
segmentChapter 13 - What You Have Learned
-
segmentFinishing Up (Don't Skip This Part!)
-
segmentResources
list High School / Advanced Statistics and Data Science I (ABC)
12.8 Interpreting the Confidence Interval
Now that we have spent some time constructing confidence intervals, it is important to pause and think about what a confidence interval means, and how it fits with other concepts we have studied so far.
Confidence Intervals Are About the DGP
One common misconception about confidence intervals is that they
define lower and upper cutoffs for where .95 of the
But that was just a method for calculating the interval, not a
definition of what the interval actually refers to. It is important to
remember that we developed the concept of confidence interval by
mentally moving the sampling distribution of
If we did want to know the range of possible sample
Error in an Estimate
The
But being the best doesn’t mean it’s right. The point estimate is
almost certainly wrong. It might be too low or it might be too high, but
we don’t know which way it is wrong. And to make matters worse, we can’t
know for sure how far off it is from the true DGP unless we know what
the true
The confidence interval provides us with a way of addressing this problem. It tells us how wrong we could be, or put another way, how much error there might be in our estimate given a certain desired level of confidence.
If the confidence interval is relatively wide given the situation, as it is in the tipping study, we would be saying something like, “the estimated effect of adding a smiley face to the check is 6.05 percentage points. But there is a lot of error in the estimate. We can say with 95% confidence that the true effect could be as low as 0 or slightly below that, or as high as 13.”
The width of the confidence interval (CI) tells us what the true
It is important to note that when we talk about the error in an estimate we are using the term error to mean something a little different than we have learned up to now. Previously, when we developed the concept of error (as in DATA = MODEL + ERROR), we were referring to the gap between the predicted tip for each table based on a model, and the actual tip left by that table. The errors were the individual residuals for each table.
When we think about error around a parameter estimate though, we’re
not thinking about individual tables any more. A single table can’t have
a
Because we generally don’t know what the true
In the case of the tipping experiment, we started with a point
estimate of the
What Does the 95% Mean?
One question you might have is this: what does it mean to have 95% confidence?
Let’s start by explaining what it does not mean. It
does not mean that there is a .95 probability that the true
One reason they will correct you is that
The other reason they will correct you is that there isn’t actually a
.95 chance that the
Because of this issue, someone (actually, a mathematician named Jerzy Neyman, in 1937) came up with the idea of saying “95% confident” instead of “95% probable.” Our guess is that all the statisticians and mathematicians breathed a sigh of relief over this.
When you construct a 95% confidence interval, therefore, you are
saying that you are 95% confident (alpha = .05) that the true