win-vector.com
Modeling Trick: Impact Coding of Categorical Variables with Many Levels
One of the shortcomings of regression (both linear and logistic) is that it doesn’t handle categorical variables with a very large number of possible values (for example, postal codes). You c…