BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
cassylovescats
Calcite | Level 5

Hello, I am trying to create a new column of a count based on multiple conditions.

 

Current Code:

libname Example "~/my_courses/Homework/FinalHomework";

data Sugar;
length DistrictGroup $ 30;
infile '~/my_courses/Homework/FinalHomework/CaneData2.csv/' dsd firstobs=2;
input District $ DistrictGroup $ DistrictPosition $ SoilID SoilName $ Area Variety $ Ratoon $ Age HarvestMonth HarvestDuration TonnHect Fibre Sugar Jul96 Aug96 Sep96 Oct96 Nov96 Dec96 Jan97 Feb97 Mar97 Apr97 May97 Jun97 Jul97 Aug97 Sep97 Oct97 Nov97 Dec97;
;
run;

data SugarLong; 
set Sugar;
array mon{*} _numeric_;
do _n_=9 to dim(mon);
Month=vname(mon[_n_]);
Count=mon[_n_];
output;
end;
drop Month District DistrictPosition SoilID SoilName Area Variety Ratoon Age HarvestMonth HarvestDuration TonnHect Fibre Sugar Jul96 Aug96 Sep96 Oct96 Nov96 Dec96 Jan97 Feb97 Mar97 Apr97 May97 
Jun97 Jul97 Aug97 Sep97 Oct97 Nov97 Dec97;
run;

data SugarLongResult;
  set SugarLong;
  select;
    when (Count > 0) Result='Yes';
    otherwise Result='No';
    end;

Proc print data=SugarLongResult (obs=10);
print;

 

My current output:                                                             DistrictGroup     Count    Result

123456789101112131415161718

Cairns/Mulgrave(dry)0.0No
Cairns/Mulgrave(dry)6.0Yes
Cairns/Mulgrave(dry)0.0No
Cairns/Mulgrave(dry)308.0Yes
Cairns/Mulgrave(dry)35.0Yes
Cairns/Mulgrave(dry)353.5Yes
Cairns/Mulgrave(dry)505.0Yes
Cairns/Mulgrave(dry)476.0Yes
Cairns/Mulgrave(dry)453.0Yes
Cairns/Mulgrave(dry)0.0No
Cairns/Mulgrave(dry)220.0Yes
Cairns/Mulgrave(dry)127.0Yes
Cairns/Mulgrave(dry)47.0Yes
Cairns/Mulgrave(dry)48.0Yes
Cairns/Mulgrave(dry)73.0Yes
Cairns/Mulgrave(dry)66.0Yes
Cairns/Mulgrave(dry)198.0Yes
Cairns/Mulgrave(dry)222.0Yes

What I'm looking to do is to get the data grouped by district group and yes/no (From above Data):

     District Group           Result          CountFinal

Cains/Mulgrave(Dry)       Yes                 15

Cains/Mulgrave(Dry)       No                   3

 

All help is appreciated (:!

1 ACCEPTED SOLUTION

Accepted Solutions
r_behata
Barite | Level 11

Proc Freq Code for your reference :

 


proc freq data=SugarLongResult noprint;
tables DistrictGroup      * Result / out= want(keep=DistrictGroup Result count) ;

run;

View solution in original post

5 REPLIES 5
Reeza
Super User

Try a PROC FREQ instead of SQL for your last step. 

 


@cassylovescats wrote:

Hello, I am trying to create a new column of a count based on multiple conditions.

 

Current Code:

libname Example "~/my_courses/Homework/FinalHomework";

data Sugar;
length DistrictGroup $ 30;
infile '~/my_courses/Homework/FinalHomework/CaneData2.csv/' dsd firstobs=2;
input District $ DistrictGroup $ DistrictPosition $ SoilID SoilName $ Area Variety $ Ratoon $ Age HarvestMonth HarvestDuration TonnHect Fibre Sugar Jul96 Aug96 Sep96 Oct96 Nov96 Dec96 Jan97 Feb97 Mar97 Apr97 May97 Jun97 Jul97 Aug97 Sep97 Oct97 Nov97 Dec97;
;
run;

data SugarLong; 
set Sugar;
array mon{*} _numeric_;
do _n_=9 to dim(mon);
Month=vname(mon[_n_]);
Count=mon[_n_];
output;
end;
drop Month District DistrictPosition SoilID SoilName Area Variety Ratoon Age HarvestMonth HarvestDuration TonnHect Fibre Sugar Jul96 Aug96 Sep96 Oct96 Nov96 Dec96 Jan97 Feb97 Mar97 Apr97 May97 
Jun97 Jul97 Aug97 Sep97 Oct97 Nov97 Dec97;
run;

data SugarLongResult;
  set SugarLong;
  select;
    when (Count > 0) Result='Yes';
    otherwise Result='No';
    end;

Proc print data=SugarLongResult (obs=10);
print;

 

My current output:                                                             DistrictGroup     Count    Result

123456789101112131415161718

Cairns/Mulgrave(dry) 0.0 No
Cairns/Mulgrave(dry) 6.0 Yes
Cairns/Mulgrave(dry) 0.0 No
Cairns/Mulgrave(dry) 308.0 Yes
Cairns/Mulgrave(dry) 35.0 Yes
Cairns/Mulgrave(dry) 353.5 Yes
Cairns/Mulgrave(dry) 505.0 Yes
Cairns/Mulgrave(dry) 476.0 Yes
Cairns/Mulgrave(dry) 453.0 Yes
Cairns/Mulgrave(dry) 0.0 No
Cairns/Mulgrave(dry) 220.0 Yes
Cairns/Mulgrave(dry) 127.0 Yes
Cairns/Mulgrave(dry) 47.0 Yes
Cairns/Mulgrave(dry) 48.0 Yes
Cairns/Mulgrave(dry) 73.0 Yes
Cairns/Mulgrave(dry) 66.0 Yes
Cairns/Mulgrave(dry) 198.0 Yes
Cairns/Mulgrave(dry) 222.0 Yes

What I'm looking to do is to get the data grouped by district group and yes/no (From above Data):

     District Group           Result          CountFinal

Cains/Mulgrave(Dry)       Yes                 15

Cains/Mulgrave(Dry)       No                   3

 

All help is appreciated (:!


 

r_behata
Barite | Level 11

Proc Freq Code for your reference :

 


proc freq data=SugarLongResult noprint;
tables DistrictGroup      * Result / out= want(keep=DistrictGroup Result count) ;

run;
cassylovescats
Calcite | Level 5

Would I be able to run chi-square off this approach?

 

 

Reeza
Super User
PROC FREQ does the chi square test, add the CHISQ option to the TABLE statement.

https://stats.idre.ucla.edu/sas/modules/an-overview-of-statistical-tests-in-sas/
cassylovescats
Calcite | Level 5

Oh my goodness, sorry, my brain is fried, I've been working on this code for about 6 hours today.

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

SAS Enterprise Guide vs. SAS Studio

What’s the difference between SAS Enterprise Guide and SAS Studio? How are they similar? Just ask SAS’ Danny Modlin.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 5 replies
  • 1308 views
  • 0 likes
  • 3 in conversation