Totals
a+c
b+d
n
If there were no systematic association between the variables A and B within the population from which the cell frequencies are randomly drawn, the probability of any particular possible array of cell frequencies, a, b, c, d, given fixed values for the marginal totals a+b, c+d, etc., would be given by the hypergeometric rule
which for computational purposes reduces to
Also, the degree of disproportion within any array of cell frequenciesin effect, the degree of ostensible association between variables A and B within the samplecan be measured by the absolute difference
For any particular observed array of cell frequencies, the programming embedded in this page calculates the probability of that particular array plus the probabilities of all other possible arrays whose degree of disproportion is equal to or greater than that of the observed array. Thus, for the observed array
2
7
9
8
2
10
10
9
19
the one-tailed probability would be the sum of the separate probabilities for the arrays
probability
2
7
8
2
0.01754
1
8
9
1
0.00097
0
9
10
0
0.00001
sum = 0.01852
(one-tailed probability)
And the two-tailed probability would be that sum plus the sum of the separate probabilities for the arrays of equal or greater disproportion at the other extreme:
probability
8
1
2
8
0.00438
9
0
1
9
0.00011
sum = 0.00449
two-tailed probability = 0.01852 + 0.00449 = 0.02301
Return to Top
©Richard Lowry 1998-
All rights reserved.