One-way ANOVA Computations

This discussion covers the most crucial computational background for one-way Analysis of Variance (ANOVA). You will see that this is an extension of hypothesis testing, and is related to regression. For a review of hypothesis testing, please refer to Module 1, and for regression to Modules 2, 3, 4 and 5.

The objective of this section is to give you necessary theoretical background, formulae and tools to help you understand the background and basics of one-way ANOVA. This includes formulation of the ANOVA hypothesis test and conducting the test using already familiar hypothesis test steps.

So far, in hypothesis testing the discussion has concentrated on at most two population cases. For example, we conducted tests on the equality of population means (t-test), or on the equality of population variances (F-test). In this section we will expand the hypothesis testing to more than two populations. As an example, we would like to test the hypothesis that k population means are equal against the hypothesis that at least two population means are not equal. The hypotheses are commonly stated as follows:

The term Analysis of Variance (ANOVA) refers to a technique in which the total variability in the data is divided into components of variability each of which can be attributed to specific distinct sources of variation. In a One-Way ANOVA model there is thought to be one factor or one source of variation in addition to randomness, which may cause the population means to be unequal. The test is then to determine whether the contribution of this one factor is significant enough to make the population means significantly different.

The One Factor Model

The one factor model may be written as

Note: The term treatment is used to identify the population of concern.

The factor i effect is a measure of the deviation of the ith treatment mean µi from the overall mean µ. If there is no difference between the ith treatment mean µi and the overall mean µ then i=0. Hence, it is sufficient to consider the following hypothesis test:

This hypothesis test statement is equivalent to

The test is based on a comparison of two independent estimates of the overall variance 2 using the same approach as in regression. It is assumed in the one-factor experiment that the variation between the treatment averages can be attributed to two components:

The random variation within the treatment or factor is measured by the error sum of squares (SSE). The variation due to differences between the treatments or factors, the so called factor effect, is measured by the sum of squares between the treatments or factors (SSA). The total variability in the data, measured by the total sum of squares (SST) is divided into these two components like in regression:

Here

It can be shown that

If the treatment effect i=0 for all treatments i, then

is an unbiased estimator of 2. The estimator s12 is unbiased because

The second expectation, E(SSE) above, is not affected by the treatment differences, or factor effects i. Therefore

is always an unbiased estimator of 2.

Recall the above hypothesis

The hypothesis H0 is true if the estimator s12 is unbiased, otherwise the hypothesis is false. If the hypothesis H0 is false, then there is at least one i 0.

Based on the above discussion we can use an F-test to determine if the estimates from the two sources of variation are significantly different.

The estimator s12 is called the treatment mean square, mSA, and the estimator s2 is called the error mean square, mSE, because the estimators are the respective sums of squares divided by their respective degrees of freedom. These degrees of freedom, in turn, depend on the problem size, i.e. number of treatments and number of observations within treatments.

Hence, for the factor A we can write

For the random variation we obtain

Now, when H0 is true the ratio

gives values of an F-random variable, which follows an f-distribution with (k-1) and k(n-1) degrees of freedom. Because s12 overestimates 2 when H0 is false, the critical region is entirely on the right tail of the f-distribution. The decision rule to reject H0 at the level of significance becomes