1. Which of the following statements is false regarding categorical input variables and cardinality?
a. Cardinality is the number of distinct levels in a categorical variable.
b. Categorical variables with high cardinality can lead to overfitting in a predictive model.
c. Calculating cardinality is an important exploratory tool before transforming the categorical input.
d. Categorical variables with a low cardinality ratio have more distinct levels than variables with a high cardinality ratio.
e. All are false.
I would suggest you google "cardinality" and "cardinality ratio" so you understand what those terms mean.
d
Nearly 200 sessions are now available on demand in the Innovate Hub.
Watch Now →Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.