In an r x c table, two cross-classified variables are
independent if all the joint probabilities equal the
product of the corresponding marginal probabilities, that is,
pij = pi.p.j
for i = 1,2,...,r and j = 1,2,...,c .
(2.6)
Under the assumption of independence of the two variables, the ML
estimate of pij, denoted p^ij is
^ p
ij
=
^ p
i.
^ p
.j
nijN
=
æ ç è
ni.N
ö ÷ ø
æ ç è
n.jN
ö ÷ ø
=
ni. n.jN2
and, under the same assumption,
^ m
ij
= N
^ p
ij
= N
^ p
i.
^ p
.j
= nij =
ni. n.jN
(2.7)
Consequence. Independence is true when we demonstrate
nij =
^ mij
, where
^ mij
=
ni . n. jN
.
(2.8)
In order to test the hypothesis of independence, we introduce two statistics:
The Chi-square statistic [Pearson 1900]
c2 =
r
å
i
c
å
j
(nij -
^ mij
)2
^ mij
,
(2.9)
and the Likelihood ratio statistic (e.g., L76)
G2 = 2
æ ç è
r
å
i
c
å
j
nij log
æ ç è
nij
^ mij
ö ÷ ø
ö ÷ ø
.
(2.10)
When independence is true, both c2 and
G2 have asymptotic (as N ®
¥) Chi-square distribution with degrees of freedom
(r-1)(c-1). For either statistics, larger values provide
more evidence against the hypothesis (null hypothesis) of
independence (For more information about the likelihood statistic
consult C90 and BFH75).