- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Hi, I am trying to understand how SAS EM conducts the fuzzy method for reject inference. According to the documentation (Reject Inference Node or Reject Inference Techniques Implemented in Credit Scoring for SAS Enterprise Miner), SAS EM creates two observations in the augmented data set for each original observation in the rejects data set. In the first observation, a target value of 0 is assigned. In the second observation, a target value of 1 is assigned. The two observations are then individually weighted by the posterior probabilities, P(non-event) and P(event), respectively. The posterior probabilities, P(non-event) and P(event), are estimated from the model that was trained on the accepts (or known good-bad) data set.
A common frequency weight, called the reject weight, is then assigned to both observations to account for any over-sampling or under-sampling of the rejects data. The reject weight is computed as follows:
- rejection rate = 300 / (600) ?
- Nrejects = 300?
- What is our Naccepts in this example?
Thank you.