1. Which of the following statements is false regarding categorical input variables and cardinality?
a. Cardinality is the number of distinct levels in a categorical variable.
b. Categorical variables with high cardinality can lead to overfitting in a predictive model.
c. Calculating cardinality is an important exploratory tool before transforming the categorical input.
d. Categorical variables with a low cardinality ratio have more distinct levels than variables with a high cardinality ratio.
e. All are false.
I would suggest you google "cardinality" and "cardinality ratio" so you understand what those terms mean.
d
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.