t-tests

They’re like z-tests, but for when we don’t know the population standard deviation.

Definitions

independent-samples t-test : parametric procedure used to test sample means from two independent samples

independent samples : samples created by selecting each participant for one condition without regard to the participants selected for any other condition

homogeneity of variance : the requirement that the populations represented in a study have equal variances ( $σ_{x}^{2}$ )

sampling distribution of differences between means : infinitely take samples from one population then plot the frequency distribution of their differed means ( $\overset{ˉ}{X}_{1} - \overset{ˉ}{X}_{2}$ ). In this distribution, $μ$ is 0 (e.g. they’re equal)

pooled variance : Weight average of the sample variances in a two-sample t-test

effect size : the amount of influence that changing the conditions of the independent variable had on the dependent scores

Cohen’s d : A measure of effect in a two-sample experiment that shows the magnitude of the differences between the means

square point-biserial correlation coefficient : $r_{p b}^{2}$ proportion of variance in dependent scores that is accounted for by the independent variable in a two-sample experiment

One sample t-test

A one-sample t-test assumes you have a one-sample experiment using interval or ratio scores.

Because the t-distribution changes shape based on the size of samples used (n), this changes what x value $α$ is at (e.g. it’s not just 1.96/1.645). You can look this value up in a t-table using the degrees of freedom (df). When df>=120, it’s basically the Normal Distribution.

When df gets large, they start bucketing the critical values into ranges (e.g. df=65, but you’ll only see df=60 & df=120). If it’s less than the small number (2.00 at $α = .05$ ), you’re not significant. If it’s bigger than the big number (1.98), you are significant. If it’s somewhere in the middle, either use software like JASP/SPSS or do something called linear interpolation.

Steps:

Create a 1 or 2 tailed $H_{0}$ and $H_{a}$ .
Compute $t_{o b t}$
1. Compute $\overset{x}{ˉ}$ and $s_{x}^{2}$
2. Compute $s_{\overset{x}{ˉ}}$
3. Compute $t_{o b t}$
Use $df = N - 1$ to find $t_{cr i t}$ in the t-table

Another way to think of $t_{o b t}$ :

from math import sqrt
 
def t_obt(xbar, s2x,n,u):
  sxbar = sqrt(s2x/n)
  return (xbar-u)/sxbar

Determining population parameter $μ$ w/ interval estimation

We can do this two ways: “point estimation” allows us to say mu is equal to our sample mean, but that is subject to sampling error.

Instead, we can use interval estimation, which results in a statement plus a margin of error. We calculate this by determining the highest likely value (i.e. $t_{cr i t}$ ) that would still be represented by our sample mean. Because we want the upper and lower bounds, we always use the two-tailed version of $t_{cr i t}$ .

The formula for confidence interval

s_{\overset{x}{ˉ}} (- t_{cr i t}) + \overset{x}{ˉ} \leq μ \leq s_{\overset{x}{ˉ}} (+ t_{cr i t}) + \overset{x}{ˉ}

Example

We are testing the optimisim scores for a group of men (Behavioral Sciences Stats pg 129) and have the following data. We are trying to test if $H_{0} = (μ = 10)$

participants	scores (X)	X^2
1	9	81
2	8	64
3	10	100
4	7	49
5	8	64
6	8	64
7	6	36
8	4	16
9	10	100
sums	70	574
mean	7.7777778

We can then calculate the estimated Standard Error, but first we need the estimated population variance:

s_{x}^{2} = \frac{Σ X ^{2} - \frac{( Σ X ) ^{2}}{N}}{N - 1} = \frac{574 - \frac{( 70 ) ^{2}}{9}}{8} = \frac{574 - \frac{4900}{9}}{8} = \frac{574 - 544. 4 ˉ}{8} = \frac{29.5555}{8} = 3.6944

then:

s_{\overset{x}{ˉ}} = \frac{s _{x}^{2}}{N} = \frac{3.6944}{9} = .4105 = .6407

which we can then determine (note: mu is the thing we’re testing for in $H_{0}$ ; $t_{o b t}$ is the value we found that we need to compare to the critical value to determine if it’s significant):

t_{o b t} = \frac{X ˉ - μ}{s _{\overset{x}{ˉ}}} = \frac{7.778 - 10}{.6407} = \frac{- 2.222}{.6407} = - 3.468

given $n = 9$ and $α = .05$ , $t_{cr i t} = 2.306$ so there is sufficient evidence to reject the null hypothesis here.

Two sample t-test

We usually don’t know $μ$ , so research often uses the two sample t-test. We presume that in condition 1, $\overset{x}{ˉ}_{1}$ represents $μ$ if you tested the population. Then, we can conduct an altered condition 2 to see how it changes relative to that presumptive $μ$ .

The two sample test comes in two varieties: independent-samples t-test and the related-samples t-test. Independent-samples might be: You got a bunch of rats, gave half food A and half food B. See if there was a difference.

Related samples: You got a bunch of rats. You fed them food A. Tested them. Then fed them food B. Tested again. The two samples (A & B) are related (b/c they’re the same rats).

Effect size

This is “cool. It changed something.. but like.. how much did it move the needle?”

We can compute this two ways:

Cohen’s d (how much does this move the needle?)
proportion of variance accounted for (does it have a consistent effect?)

Cohen’s D

This measures the size of the changes in the scores.

Independent tests use $d = \frac{X _{1} ˉ - X _{2} ˉ}{s _{p oo l}^{2}}$ Related tests use $d = \frac{D ˉ}{s _{D}^{2}}$

Cohen proposed these guidelines:

d = .2	small effect
d = .5	medium effect
d = .8	large effect

Proportion of variance accounted for

This measures how consistently the scores change.

Given this data from a related-sample study:

Before therapy	After
10	5
11	6
12	7

We can see that there is some variance b/c of the independent variable (people b/c 10 != 11).

We describe this via “square point-biserial correlation coefficient”

if $t_{o b t} = + 2.93; df = 30$ ,

r_{p b}^{2} = \frac{t _{o b t}^{2}}{t _{o b t}^{2} + df} = \frac{2.9 3 ^{2}}{2.9 3 ^{2} + 30} = \frac{8.585}{38.585} = .22

Guidelines for interpreting:

< 0.9	Small effect
> .10 < .25	moderate / relatively common
> .25	large / rare

ANOVA

When calculating effect size for ANOVA, we’re looking for eta squared $η^{2} = \frac{S S _{bn}}{S S _{t o t}}$ .

The notes of Justin Abrahms

Recently updated

tests for quartz

Zero Knowledge Proofs (ZKP)

Sprint Ceremony input/outputs

Explorer

t-tests

Definitions

One sample t-test

Determining population parameter $μ$ w/ interval estimation

Example

Two sample t-test

Effect size

Cohen’s D

Proportion of variance accounted for

ANOVA

Graph View

Table of Contents

Backlinks

The notes of Justin Abrahms

Recently updated

tests for quartz

Zero Knowledge Proofs (ZKP)

Sprint Ceremony input/outputs

Explorer

t-tests

Definitions

One sample t-test

Determining population parameter μ w/ interval estimation

Example

Two sample t-test

Effect size

Cohen’s D

Proportion of variance accounted for

ANOVA

Graph View

Table of Contents

Backlinks

Determining population parameter $μ$ w/ interval estimation