Here's the variables I think we should log transform, all representing income/wages/etc.
VERSIONS = {
...
'log_transform_feats': '''INTP OIP PAP RETP SEMP SSIP SSP WAGP PERNP
PINCP'''.split(),
Only issue is that some of these variables can be negative (for losses). So I guess the transformation for those should be x = log(x - min(x)) or something?
Once we figure that out it should be easy to put this into get_dummies.
Here's the variables I think we should log transform, all representing income/wages/etc.
Only issue is that some of these variables can be negative (for losses). So I guess the transformation for those should be x = log(x - min(x)) or something?
Once we figure that out it should be easy to put this into get_dummies.