Type a search term to find related articles by LIMS subject matter experts gathered from the most trusted and dynamic collaboration tools in the laboratory informatics industry.
Cochran's test,[1] named after William G. Cochran, is a one-sided upper limit variance outlier statistical test . The C test is used to decide if a single estimate of a variance (or a standard deviation) is significantly larger than a group of variances (or standard deviations) with which the single estimate is supposed to be comparable. The C test is discussed in many text books [2][3][4] and has been recommended by IUPAC[5] and ISO.[6] Cochran's C test should not be confused with Cochran's Q test, which applies to the analysis of two-way randomized block designs.
The C test assumes a balanced design, i.e. the considered full data set should consist of individual data series that all have equal size. The C test further assumes that each individual data series is normally distributed. Although primarily an outlier test, the C test is also in use as a simple alternative for regular homoscedasticity tests such as Bartlett's test, Levene's test and the Brown–Forsythe test to check a statistical data set for homogeneity of variances. An even simpler way to check homoscedasticity is provided by Hartley's Fmax test,[3] but Hartley's Fmax test has the disadvantage that it only accounts for the minimum and the maximum of the variance range, while the C test accounts for all variances within the range.
The C test detects one exceptionally large variance value at a time. The corresponding data series is then omitted from the full data set. According to ISO standard 5725 [6] the C test may be iterated until no further exceptionally large variance values are detected, but such practice may lead to excessive rejections if the underlying data series are not normally distributed. The C test evaluates the ratio:
where:
The C test tests the null hypothesis (H0) against the alternative hypothesis (Ha):
The sample variance of data series j is considered an outlier at significance level α if Cj exceeds the upper limit critical value CUL. CUL depends on the desired significance level α, the number of considered data series N, and the number of data points (n) per data series. Selections of values for CUL have been tabulated at significance levels α = 0.01,[6][7][8] α = 0.025,[8] and α = 0.05.[6][7][8] CUL can also be calculated from:[8][9]
Here:
The C test can be generalized to include unbalanced designs, one-sided lower limit tests and two-sided tests at any significance level α, for any number of data series N, and for any number of individual data points nj in data series j.[8][9]