BookmarkSubscribeRSS Feed
TrishPaquette
Calcite | Level 5

In the case study solution to Module 3, the author rejects variables with a high percent mode (>97%). Is this done in the metadata node? I can choose statistics to filter on, but mode is not an option. 

3 REPLIES 3
RobertBlanchard
SAS Employee

Hello Trish,

 

Yes, the variables with the large mode percentage are set to rejected using the metadata node.  

 

One alternative to the provided solution may be to set the data partition node to perform a stratified simple random sample, stratifying on those variables with a large mode to ensure all values in the variables exist in each partition.  To do this, click the variables ellipsis in the data partition node.  Then change the Partition Role value from default to  Stratification for the categorical variables of interest.  If a level is VERY rare, then sometimes this approach may not work and you'll receive an error from the node.

 

Good luck!

Best,

  Robert 

TrishPaquette
Calcite | Level 5
Revisiting this because I can't seem to reject automatically in
the Metadata node. I am offered some statistics on which to reject, but
percent mode is not one of them. How can I do this automatically?
RobertBlanchard
SAS Employee

Hey,

 

I believe one would have to reject the variables manually in this scenario.

 

Best,

  Robert

 

This is a knowledge-sharing community for learners in the Academy. Find answers to your questions or post here for a reply.
To ensure your success, use these getting-started resources:

Estimating Your Study Time
Reserving Software Lab Time
Most Commonly Asked Questions
Troubleshooting Your SAS-Hadoop Training Environment

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 3 replies
  • 1628 views
  • 1 like
  • 2 in conversation