one-way ANOVA

Requires:

All conditions contain independent samples
the dependent scores are normally distributed interval or ratio scores
the variances of the populations are homogenous

$n$ of each condition doesn’t need to be equal, but it’s way easier if they are. It only tests two-tailed hypothesis, but it’s actually a one-tailed test due to 0 being the same.

Diagram of an example study

	Factor A: Independent variable of perceived difficulty
Level $A_{1}$ : Easy	Level $A_{2}$ : Medium	Level $A_{3}$ : Difficult
X	X	X
X	X	X
$\overset{ˉ}{X_{1}}$	$\overset{ˉ}{X_{2}}$	$\overset{ˉ}{X_{3}}$

k = 3 b/c there are 3 conditions.

This is similar to the two-sample t-test, but we can’t just do a bunch of pair-wise t-tests because the probability of making a Type I error is too high. This is because we’re doing multiple comparisons with a 0.05 margin for error.. which aggregates. ANOVA limits Type I probability to $α$ .

s_{x}^{2} = \frac{Σ ( X - X ˉ ) ^{2}}{N - 1} = \frac{sum of squares}{degrees of freedom} = \frac{SS}{df} = mean square aka MS

“sum of squares” or “SS” is really short for “sum of the squared deviations”

Definitions

experiment-wise error rate : The probability of making a Type I error somewhere among the comparisons in an experiment.

Tukey’s HSD test : HSD = Honestly Significant Difference is a post-hoc procedure done after ANOVA to compare means between factors when all levels have equal $n$ ‘s.

Example

$H_{0} = μ_{1} = μ_{2} = μ_{3} = ... = μ_{k}$ . But $H_{a} :$ not all $μ$ are equal.

So given the above table:

easy	medium	difficult
9	4	1
12	6	3
4	8	4
8	2	5
7	10	2	totals
sum(X): 40	30	15	85
sum(X^2): 354	220	55	629
n: 5	5	5	15
xbar: 8	6	3	k=3

Calculate the total sum of squares:

SS_{t o t} = Σ X_{t o t}^{2} - \frac{( Σ X _{t o t} ) ^{2}}{N} = 629 - \frac{8 5 ^{2}}{15} = 629 - \frac{7225}{15} = 629 - 481.67 = 147.33

Then calculate the sum of squares between groups

SS_{bn} = Σ (\frac{( Σ X in column ) ^{2}}{n in column}) - \frac{( Σ X _{t o t} ) ^{2}}{N} = (\frac{4 0 ^{2}}{5} + \frac{3 0 ^{2}}{5} + \frac{1 5 ^{2}}{5}) - \frac{( 85 ) ^{2}}{5} = (320 + 180 + 45) - 481.67 = 545 - 481.67 = 63.33

Then calculate the sum of squares within groups:

SS_{w n} = SS_{t o t} - SS_{bn} = 147.33 - 63.33 = 84.00

Compute the degrees of freedom:

df_{bn} df_{w n} df_{t o t} = k - 1 = 3 - 1 = 2 = N - k = 15 - 3 = 12 = N - 1 = 15 - 1 = 14

Then get the mean square between groups ( $MS_{bn} = \frac{SS _{bn}}{df _{bn}}$ ):

$MS_{bn} = \frac{63.33}{2} = 31.67$

Within groups ( $MS_{w n} = \frac{SS _{w n}}{df _{w n}}$ ):

$MS_{w n} = \frac{84}{12} = 7$

then: $F_{o b t} = \frac{MS _{bn}}{MS _{w n}} = \frac{31.67}{7} = 4.52$

Which leaves us with:

Source	Sum of squares	df	mean square	f_obt
between	63.33	2	31.67	4.52
within	84	12	7
total	147.33	14

Distribution

Unlike t and z distribution, the f-distribution is positively skewed, b/c there’s no upper limit for how big f can be.. but it can never be lower than 0. Unlike those other distributions, finding $F_{cr i t}$ requires the df for both between and within to look up in the “F-table”.

For the example above, we get $F_{cr i t} = 3.88$ , so this is sufficient evidence to reject the null hypothesis.

This leads us to conclude that there does appear to be a relationship between perceived difficulty and score. We don’t know if it applies to all columns, though.

Post-hoc test

Tukey’s HSD is $H S D = (q_{k}) (\frac{MS _{w n}}{n})$ $q_{k}$ is found in a table called “Values of Studentized Range Statistic” (table 5 in the appendix of Behavioral Sciences Stats). For $k = 3, df_{w n} = 12, α = .05$ , $q_{k} = 3.77$ .

H S D = (q_{k}) (\frac{MS _{w n}}{n}) = (3.77) (\frac{7}{5}) = 3.77 (1.4) = 3.77 (1.183) = 4.46

Get all the differences of level-mean combinations:

x1 = 8 ; aka easy x2 = 6 ; aka medium x3 = 3 ; aka hard

x1 - x2 = 2 x1 - x3 = 5 x2 - x3 = 3

Compare to HSD. If the absolute difference is greater than the HSD, then they have signifiant differences. (so easy -> hard was a significant difference)

The notes of Justin Abrahms

Recently updated

tests for quartz

Zero Knowledge Proofs (ZKP)

Sprint Ceremony input/outputs

Explorer

one-way ANOVA

Definitions

Example

Distribution

Post-hoc test

Graph View

Table of Contents

Backlinks