I read the following (below) in some article on here:
For categorical variables, the most common methodology is “count” wherein you fill the missing values with the most common level of the categorical variable.
How is this performed? I can't find any information on it.
You need to first understand how and why the values are missing before you can say what an appropriate method is. Using the largest group isn't a great method. An alternative is to actually model the data to predict the category - using logistic regression or discriminant analysis. These are both covered in PROC MI and both have examples in the documentation, 79.4 & 79.5 Examples
I read the following (below) in some article on here:
For categorical variables, the most common methodology is “count” wherein you fill the missing values with the most common level of the categorical variable.
How is this performed? I can't find any information on it.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Registration is open
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss. Register now and lock in 2025 pricing—just $495!