Hi,
I built a data set for scoring purpose. And it needed multiple merges between many tables especially left joins.
that means I have many missing values for customers who aren't present in the second table of the join.
For example, the variable indicating the number of times the customer has opened an email will be missing for those who have not an email.
So a missing value is a different information from a 0 which indicates that the customer recieved emails but didn't open them!
I replaced missing values with 0, but I think it's not optimal.
My question is is there a way to distinguish between missing values and 0, so that it can be taken into account by the model (logistic regression which don't accept missing values)
Any advice is appreciated!
Thanks,
MK
... View more