1. Which of the following statements is false regarding categorical input variables and cardinality?
a. Cardinality is the number of distinct levels in a categorical variable.
b. Categorical variables with high cardinality can lead to overfitting in a predictive model.
c. Calculating cardinality is an important exploratory tool before transforming the categorical input.
d. Categorical variables with a low cardinality ratio have more distinct levels than variables with a high cardinality ratio.
e. All are false.
I would suggest you google "cardinality" and "cardinality ratio" so you understand what those terms mean.
d
It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.