BookmarkSubscribeRSS Feed
deleted_user
Not applicable
Hi,

I am trying to identify common point/trend. For example, if I have 10 million records, of which 200,000 records are considered as "bad" , leaving 800,000 as "good" records. Each ID can have multiple set of records, but once an ID is tagged as "bad", no more records will occur. What I need to find out is if the records prior to going bad had a common theme/pattern.

I've used proc freq in SAS to basically categorize and state that if variable ABC is linked to 100 IDs, and variable DEF is linked to 300 IDs before going bad, then I assume that DEF is the common point. However, the problem I am facing is that if DEF naturally has lot more volume then it's not really considered as the bad link. Is there an algorithm to finding the bad link in SAS?

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 0 replies
  • 675 views
  • 0 likes
  • 1 in conversation