1. Which of the following statements is false regarding categorical input variables and cardinality?
a. Cardinality is the number of distinct levels in a categorical variable.
b. Categorical variables with high cardinality can lead to overfitting in a predictive model.
c. Calculating cardinality is an important exploratory tool before transforming the categorical input.
d. Categorical variables with a low cardinality ratio have more distinct levels than variables with a high cardinality ratio.
e. All are false.
I would suggest you google "cardinality" and "cardinality ratio" so you understand what those terms mean.
d
Catch the best of SAS Innovate 2025 — anytime, anywhere. Stream powerful keynotes, real-world demos, and game-changing insights from the world’s leading data and AI minds.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.