BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
psnorrod
Obsidian | Level 7

Hello All!

 

Learning SAS user back again.

 

I'm working with ecological data, and (thanks to the SAS community!!!) successfully used code that counts the number of events across three variables (year, sex, age_group). The data is all events, so there are no "zeros" in the original data set. Now, I need to either generate zeros for the missing groups that do not have events or merge a data set that has "zeros" using a conditional statement. My final intent is to obtain rates using PROC GENMOD employing a nb or zero-inflated model. All that said, I'm uncertain as to how to generate or create the zeros in the sql data set for my final analysis.

 

Code to create the count:

proc sql;
create table work.sql as
select year, sex, age_group, count(*) as count
from one
group by year, sex, age_group
;
quit;

 

Variables: year, site, sex=1 or 2, age_group=1, 2, or 3

 

Sample sql data set:

year   site  sex  age_group  counts

2003   66      2         3                  11

2003   66      2         3                   21

2003   66      2          2                  17

2003   66     1           1                   3

2003   66      1          2                   1

 

As you likely see, there are no counts or "zeros" for age_groups 1 and 2 where sex=2 and so on.

 

Thank you for all of your help!

 

 

 

1 ACCEPTED SOLUTION

Accepted Solutions
ed_sas_member
Meteorite | Level 14

Hi @psnorrod 

You can use the sparse option in PROC FREQ

proc freq data=one noprint;
	tables year * sex * age_group / out=sql (drop=percent) sparse;
	weight counts;
quit;

NB: I believe that the variable COUNTS is the weight variable but not sure. Please feel free to remove it.

View solution in original post

5 REPLIES 5
ed_sas_member
Meteorite | Level 14

Hi @psnorrod 

You can use the sparse option in PROC FREQ

proc freq data=one noprint;
	tables year * sex * age_group / out=sql (drop=percent) sparse;
	weight counts;
quit;

NB: I believe that the variable COUNTS is the weight variable but not sure. Please feel free to remove it.

psnorrod
Obsidian | Level 7
working on running now.
ballardw
Super User

One approach might be the Proc Summary with the COMPLETETYPES options using your SAMPLE1 data set from your other question:

proc summary data=sample1 completetypes nway;
 class year sex age_group;
 output out=work.summary (drop=_type_ rename=(_freq_=count));
run;

which will have ALL of the class variable levels with counts of 0 for the ones that did not actually occur. So you will sex 1 & 2 with age_group 1, 2 & 3 for each year.

psnorrod
Obsidian | Level 7

Thank you all so very much!!!

 

I was able to get the solution posted by @ed_sas_member to work. And the code posted by @ballardw also worked!

 

So grateful for the help from both of you and the Programming community! I was soooo frustrated over the weekend and yesterday.

 

Again, thank you @ed_sas_member and @ballardw 

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 5 replies
  • 1289 views
  • 4 likes
  • 3 in conversation