Hi all,
What I am trying to achieve is the following :
I am trying to count the categories by the ID i have and to keep only the top two frequent for every ID.
I have tried
count distinct categories grouping by ID with a proc sql statement by I get stuck on the next step where i want to keep the top 2 by every ID, for example if i just wanted the top value per ID i would take the max but how does that work for getting the second max value as well?
Kind regards and thank you in advance
One way, not necessarily the slickest but relatively easy to understand.
Proc freq data=have order=freq noprint;
table id*categoricalvar/ out=temp;
run;
data want;
set temp;
by id;
retain counter;
if first.id then counter=1;
else counter+1;
if counter le 2 then output;
run;
One way, not necessarily the slickest but relatively easy to understand.
Proc freq data=have order=freq noprint;
table id*categoricalvar/ out=temp;
run;
data want;
set temp;
by id;
retain counter;
if first.id then counter=1;
else counter+1;
if counter le 2 then output;
run;
Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.
Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.