Path: blob/master/notebook-for-learning/Chapter-8-Inferences-on-a-Population-Mean.ipynb
388 views
Chapter 8 - Inferences on a Population Mean
Confidence Intervals
A confidence interval for an unknown parameter θ is an interval that contains a set of plausible values of the parameter
It is associated with a confidence level 1 - α, which measures the probability that the confidence interval actually contains the unknown parameter value
Inference methods on a population mean based upon the t-procedure are appropriate for large sample sizes n ≥ 30 and also for small sample sizes as long as the data can reasonably be taken to be approximately normally distributed
Two-sided t-Interval
A confidence interval with confidence level 1 − α for a population mean based upon a sample of n continuous data observations with a sample mean and a sample standard deviation is where
The central limit theorem ensures that the distribution of X is approximately normal for large sample sizes
Interval length
If we want to know the interval length smaller than a certain amount, we need at least samples:
One-Sided t-Interval: One-sided confidence intervals with confidence levels 1-α for a population mean : for a lower bound and for an upper bound
Z-intervals
If we want to construct a confidence interval with a known value for the population standard deviation , then we have
Hypothesis testing
A null hypothesis H 0 for a population mean is a statement that designates possible values for the population mean.
It is associated with an alternative hypothesis , which is the “opposite” of the null hypothesis.
Two-sided set of hypotheses
versus
One-sided set of hypoteses
Can be either:
versus
or
versus
Interpretation of -values
Types of error
Type I error: An error committed by rejecting the null hypothesis when it is true.
Type II error: An error committed by accepting the null hypothesis when it is false.
Significance level
is specified as the upper bound of the probability of type I error.
-values of a test
The p-value of a test is the probability of obtaining a given data set or worse when the null hypothesis is true. A data set can be used to measure the plausibility of null hypothesis through the construction of a -value.
The smaller the -value, the less plausible is the null hypothesis.
Rejection and acceptance of the Null Hypothesis
Rejection: -value is smaller than the significance level, then is rejected
Acceptance: -value larger than the significance level, then is plausible
Calculation of -values
where
Two-sided t-test
If we test versus then
-value =
One-sided t-test
If we test versus then
-value = n = 40
If we test versus then
-value =
Significance level of size
A hypothesis test with a significance level of size α
rejects the null hypothesis if a p-value smaller than α is obtained
accepts the null hypothesis if a p-value larger than α is obtained.
Two-sided problems
A size test for versus reject if the test statistics is in the rejection region and accepts if in the acceptance region
One-sided problems
A size test for versus rejects if the test statistics is in the rejection region and accepts if in the acceptance region
-tests
We test similarly to the t-statistics, but with data with sample size from assuming is known
The Z-statistics is given by:
Power of a hypothesis test
Definition:
power = 1 - P(Type II error )
which is the probability that the null hypothesis is rejected when it is false.
Computation of the power of a hypothesis test
We want to test vs with significance level . We assume a sample size from
If
Determination of sample size in hypotheses testing
Find for which with
When , we need more samples