Proposed steps for Question1.
data parks monuments; set pg1.np_summary; where type in ('NM', 'NP'); Campers= sum(OtherCamping, RVCampers, TentCampers, BackcountryCampers); format campers comma20.; length Parktype $ 8; if type='NP' then do parktype= 'Park'; output parks; end; else if type='NM' then do parktype='Monument'; output monuments; end; run;
I don't understand why we would use code in pink? It made more sense for me to do the below code in red and not include the where type in statement? Also, I included my answer for question2. Instead of having to figure out exactly how many digits the sum will contain, is it ok to simply use a high number (20) and see if any data is missing?
My code:
data parks monuments;
set PG1.NP_SUMMARY;
campers= sum(OtherCamping, RVCampers, TentCampers, BackcountryCampers);
format campers comma20.;
length Parktype $ 8;
if Type= 'NP' then do Parktype='Park';
output parks;
end;
else if type='NM' then do ParkType='Monument';
output monuments;
end;
keep Reg ParkName DayVisits OtherCamping Campers ParkType;
run;
A few comments ....
The code in red works (as you have probably seen by now), but would look strange to most programmers. The suggestion by @novinosrin is the more common way of specifying a DO group.
Why use the code in pink? It is mainly a matter of efficiency. Consider the contents of the incoming data. Maybe that data:
In that case, much of the work of the DATA step would be reading in data that we never need. The WHERE statement allows SAS to inspect the data before reading it in. The DATA step never needs to read in the data that it doesn't need, saving time.
if Type= 'NP' then
do;
Parktype='Park';
output parks;
end;
else if type='NM' then
do;
ParkType='Monument';
output monuments;
end;
I don't understand why we would use code in pink? It made more sense for me to do the below code in red and not include the where type in statement?
For this trivial example, it likely doesn't matter, however, it is inefficient. If you'll only be working with two types, reducing your data set to just those two types means it will be faster since you have less records to process.
Also, I included my answer for question2. Instead of having to figure out exactly how many digits the sum will contain, is it ok to simply use a high number (20) and see if any data is missing?
Yes, this is fine and a common thing to do.
As mentioned before, both pieces of code are incorrect and I'd expect you to get syntax errors. You have END statements but no DO statements that match. If you have a DO statement with something after DO, then it doesn't enter a DO/END situation, it does that command and the DO is finished. Review the documentation for a DO/END and DO statements.
Hi @Reeza Good afternoon, The reason there isn't any syntax errors is becuase sas treats the do statement as "do iterative" rather than a conditional do group. Not needed but funny
Example:
/*iterate once*/
data w;
set sashelp.class;
if sex='F' then do gender='Female';
end;
run;
/*iterate more than once below*/
data w;
set sashelp.class;
if sex='F' then do gender='Female',"fem";
output;
end;
run;
A few comments ....
The code in red works (as you have probably seen by now), but would look strange to most programmers. The suggestion by @novinosrin is the more common way of specifying a DO group.
Why use the code in pink? It is mainly a matter of efficiency. Consider the contents of the incoming data. Maybe that data:
In that case, much of the work of the DATA step would be reading in data that we never need. The WHERE statement allows SAS to inspect the data before reading it in. The DATA step never needs to read in the data that it doesn't need, saving time.
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Follow along as SAS technical trainer Dominique Weatherspoon expertly answers all your questions about SAS Libraries.
Find more tutorials on the SAS Users YouTube channel.