Hypothesis Testing

statistical hypothesis : a statement about the numerical value of a population parameter

null hypothesis : the status quo; assumed to be true unless there’s convincing data otherwise. Denoted $H_{0}$

alternative hypothesis : hypothesis that’s only accepted if there’s compelling evidence, denoted $H_{a}$

test statistic : sample statistic, computed from information in the sample, used to decide between null and alternative hypothesis.

Type I error : (relationship doesn’t actually exist); researcher rejects the null hypothesis in favor of the alternative, when $H_{0}$ is true. The probability of committing Type I error is denoted $α$ . aka false positive

Type II error : (relationship really does exist); when $H_{0}$ is accepted, but it’s actually false. The probability of this error is denoted by $β$ . aka false negative

Power (statistics)- rejection region :: the possible values where the researcher will reject $H_{0}$ in favor of $H_{α}$ .

criterion : probability that defines whether it’s unlikely to represent the underlying population.

critical value : the score that marks the inner edge of the region of rejection

one-tailed test : the alternative case is strictly greater than the specific value (e.g. “strength of pipe must be > 2400 psi”)

two-tailed test : alternative hypothesis exists within some bounds (e.g. tolerance of a machined part)

p-value : aka observed significance level, is the probability of observing a value of the test statistic that is at least as contradictory to the null hypothesis and supportive of the alternative hypothesis, as the actual one computed form the sample data. Reworded: The likelihood that a test statistic overlaps with an interval, given a confidence value.

power of the test : probability that the test will correctly lead to the rejection of the null hypothesis for a particular value of $μ$ or $p$ in the alternative hypothesis. The power is equal to $(1 - β)$ .

significant : It means that this result is unlikely to have occured due to a sampling error; Indicates a rejection of the null hypothesis $H_{0}$

significance level : This is $α$ . $1 - α$ is the confidence level.

inferential statistics : procedures for deciding whether sample data represent a particular relationship in the population

parametric statistics : inferential stats for computing the mean; Require certain assumptions about the raw score population represented by the sample

nonparametric statistics : used for median and mode; don’t require assumptions about the raw score population of the sample

When a sample statistic exists outside of the “rejection region”, we can’t reject the null hypothesis.. but rather say “the sample evidence is insufficient to reject $H_{0}$ at $α = .05$ “.

Elements of a hypothesis test

Null hypothesis ( $H_{0}$ ).
Alternative (research) hypothesis ( $H_{α}$ ).
Test statistic.
Rejection region, which uses $α$ (also referred to as the “level of significance”).
Assumtpions: clear statements about the population being sampled.
Experiment & calculation of test statistic: the computation of the test statistic
Conclusion: rejection (with possible type I error) if the value is in the rejection region. Insufficient evidence to reject if it isn’t (given we don’t know the probability of $β$ , which is the likelihood of type II error).

Formulating a hypothesis

Pick an alternative hypothesis
1. upper-tailed ( $H_{a} : μ > 2400$ )
2. lower-tailed ( $H_{a} : μ < 2400$ )
3. two-tailed ( $H_{a} : μ \neq = 2400$ )
select null hypothesis ( $H_{0} : μ = 2400$ ).

Calculating a p-value

Determine the value of the test stat $z$ corresponding with the result of the sampling experiment.
If the test is one-tailed, the p-value is the area above or below (depending on which tail) the observed z-value. If it’s two-tailed, p-value is 2x the tail area beyond the z-value in the direction of z.

In practice, this means that you take the CDF of the z value (or perhaps, 1-z value, depending on direction) and check if it’s smaller than the $α$ you’ve chosen. If so, you should reject the null hypothesis, b/c it lives within the rejection region.

Calculating a p-value from a z score

You just need to take the cdf of the z-score. Ensure you’re capturing the tail(s) that you care about.

Converting a two-tailed p-value to a one-tailed p-value

p = \frac{reported p-value}{2} if ⎩ ⎨ ⎧ H_{a} is of the form > and z is positive H_{a} is of the form < and z is negative

p = 1 - (\frac{reported p-value}{2}) if ⎩ ⎨ ⎧ H_{a} is of the form > and z is negative H_{a} is of the form < and z is positive

p-value for proportions / chi

draw curve
draw region
label areas / test stat
compute the p-value

to compute the p-value, compute the chi-square CDF for the null hypothesis.

Statistical tests

Statistical tests often take the form of:

(observed difference - what we expect if null is true) / average variation

Z statistic testing

Statistical tests Used when you have a large (n>=30) sample Test statistic: $z_{c} = \frac{x ˉ - μ _{0}}{σ / n}$ Rejection region depend on alternative hypothesis.

$H_{a} : μ < μ_{0}, z_{c} < - z_{α}$
$H_{a} : μ > μ_{0}, z_{c} > z_{α}$

This requires:

a random sample
the sample size is large enough ( $n \geq 30$ )

T statistic testing

Statistical tests Used for small samples Test statistic: $t_{c} = \frac{x ˉ - μ _{0}}{s / n}$

Rejection region depend on alternative hypothesis.

$H_{a} : μ < μ_{0}, t_{c} < - t_{α}$
$H_{a} : μ > μ_{0}, t_{c} > t_{α}$

This requires:

A random sample
The population is approximately normal

Hypothesis about a population proportion

Test Statistic: $z = \frac{p ^ - p _{0}}{p _{0} q _{0} / n}$ Rejection region depend on alternative hypothesis.

$H_{a} : p < p_{0}, z_{c} < - z_{α}$
$H_{a} : p > p_{0}, z_{c} > z_{α}$

This requires:

A random sample of a binomial population
Sample size is n large ( $n p_{0} \geq 15, n q_{0} \geq 15$ )

Calculating $β$ for a mean

Calculate $\overset{x}{ˉ}$ that corresponds to the border between acceptance/rejection regions, denoted $\overset{x}{ˉ}_{0}$ . $\overset{x}{ˉ}_{0} = μ_{0} + z_{α} σ_{\overset{x}{ˉ}}$ for upper tailed tests. (minus for lower, and both plus and minus w/ $\frac{α}{2}$ instead for two tailed)
Specify the value of $μ_{a}$ in the alternative hypothesis to calculate for $β$ . Convert $\overset{x}{ˉ}_{0}$ to z-values, using the alternative. $z = \frac{x ˉ _{0} - μ _{a}}{σ _{\overset{x}{ˉ}}}$ .

Example

$H_{0} : μ = 2400, H_{α} : μ > 2400$ test statistic: $z = \frac{x ˉ - 2400}{σ / n}$ Rejection region: $z > 1.645$ for $α = .05$ . (note: this is only one tailed) s: 200 n: 50

—

\overset{x}{ˉ} = μ_{0} + 1.645 σ_{\overset{x}{ˉ}} = 2400 + 1.645 (\frac{σ}{n}) \approx 2400 + 1.645 (\frac{s}{n} = 2400 + 1.645 (\frac{200}{5 0}) = 2400 + 1.645 (28.28) = 2446.5

Then find the z-value for $\overset{x}{ˉ}_{0}$ (the border between rejection/acceptance).

Unsure: Where does 2425 come from? Do you just pick it out of a hat b/c it’s in the alternative hypothesis?

z = \frac{x ˉ _{0} - 2425}{σ _{\overset{x}{ˉ}}} \approx \frac{x ˉ _{0} - 2425}{s / n} = \frac{2446.5 - 2425}{28.28} = .76

so z = .76. The area under the curve is $β = .5 + .2764 = .7764$ .

This B value comes from Table II of our book, which gets the area under the curve. Likely need to +.5 if it’s bigger than the mean.

Calculating $β$ for a p (proportion)

Same as above, but we use $\frac{p _{0} q _{0}}{n}$ instead of $σ_{\overset{x}{ˉ}}$ aka $\frac{s}{n}$

Hypothesis about population variance

NOTE: Be careful on if you’re talking about variance or stddev and adjust accordingly.

test statistic: $χ_{a}^{2} = \frac{( n - 1 ) s ^{2}}{σ _{0}^{2}}$

Assumes:

random sample
population is approximately normal

For CVA (common value approach), we get the critical values by doing an inverse chi-square passing alpha and 1-alpha (assuming 2 tailed).

For PVA (p value approach): It’s the chi-cdf of the test stat, accounting for tail’dness.

The notes of Justin Abrahms

Recently updated

tests for quartz

Zero Knowledge Proofs (ZKP)

Sprint Ceremony input/outputs

Explorer

Hypothesis Testing

Elements of a hypothesis test

Formulating a hypothesis

Calculating a p-value

Calculating a p-value from a z score

Converting a two-tailed p-value to a one-tailed p-value

p-value for proportions / chi

Statistical tests

Z statistic testing

T statistic testing

Hypothesis about a population proportion

Calculating $β$ for a mean

Example

Calculating $β$ for a p (proportion)

Hypothesis about population variance

Graph View

Table of Contents

Backlinks

The notes of Justin Abrahms

Recently updated

tests for quartz

Zero Knowledge Proofs (ZKP)

Sprint Ceremony input/outputs

Explorer

Hypothesis Testing

Elements of a hypothesis test

Formulating a hypothesis

Calculating a p-value

Calculating a p-value from a z score

Converting a two-tailed p-value to a one-tailed p-value

p-value for proportions / chi

Statistical tests

Z statistic testing

T statistic testing

Hypothesis about a population proportion

Calculating β for a mean

Example

Calculating β for a p (proportion)

Hypothesis about population variance

Graph View

Table of Contents

Backlinks

Calculating $β$ for a mean

Calculating $β$ for a p (proportion)