Hi Ed,
You wrote:
>>>
In particular, it makes no sense to standardize a dummy variable. First, you obliterate the real information in the variable. Second, the variance of a dichotomous variable is p times 1 - p, where p is the fraction of 1's, and can never be equal to 1. So standardization creates an impossibility.
>>>
I am not clear why you would say that. Consider the following.
> Flag <- c(rep(0,25),rep(1,75))
> mean(Flag);sd(Flag)
[1] 0.75
[1] 0.4351941
> zFlag <- (Flag - mean(Flag))/sd(Flag)
> mean(zFlag);sd(zFlag)
[1] -2.775558e-17
[1] 1
> table(Flag, zFlag)
zFlag
Flag -1.72336879396141 0.574456264653803
0 25 0
1 0 75
>
zFlag has a variance of 1 and appears to retain all the information in Flag. What have I misunderstood?
I have always thought of the 0/1 coding as simply a matter of convenience that makes the effect coefficients easier to interpret (Cohen, Cohen, West & Aiken, 2002).
Keith
------------------------
Keith A. Markus
John Jay College of Criminal Justice, CUNY
http://jjcweb.jjay.cuny.edu/kmarkus
Frontiers of Test Validity Theory: Measurement, Causation and Meaning.
http://www.routledge.com/books/details/9781841692203/