BookmarkSubscribeRSS Feed
🔒 This topic is locked. We are no longer accepting replies to this topic. Need further help? Please sign in and ask a new question.
TrishPaquette
Calcite | Level 5

In the case study solution to Module 3, the author rejects variables with a high percent mode (>97%). Is this done in the metadata node? I can choose statistics to filter on, but mode is not an option. 

3 REPLIES 3
RobertBlanchard
SAS Employee

Hello Trish,

 

Yes, the variables with the large mode percentage are set to rejected using the metadata node.  

 

One alternative to the provided solution may be to set the data partition node to perform a stratified simple random sample, stratifying on those variables with a large mode to ensure all values in the variables exist in each partition.  To do this, click the variables ellipsis in the data partition node.  Then change the Partition Role value from default to  Stratification for the categorical variables of interest.  If a level is VERY rare, then sometimes this approach may not work and you'll receive an error from the node.

 

Good luck!

Best,

  Robert 

TrishPaquette
Calcite | Level 5
Revisiting this because I can't seem to reject automatically in
the Metadata node. I am offered some statistics on which to reject, but
percent mode is not one of them. How can I do this automatically?
RobertBlanchard
SAS Employee

Hey,

 

I believe one would have to reject the variables manually in this scenario.

 

Best,

  Robert

 

This is a knowledge-sharing community for learners in the Academy. Find answers to your questions or post here for a reply.
To ensure your success, use these getting-started resources:

Estimating Your Study Time
Reserving Software Lab Time
Most Commonly Asked Questions
Troubleshooting Your SAS-Hadoop Training Environment

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 1954 views
  • 1 like
  • 2 in conversation