BookmarkSubscribeRSS Feed
jkf91
Calcite | Level 5

Hi all,

I have the following code (continuation from an earlier post):

data want;

  do until (last.id);

    set have;

    by id;

    if first.id then do;

      n=0;

            total=0;

    end;

    if month le 6 then total+packs and n+1;

  end;

  do until (last.id);

    set all_no09.Hh_all12w_hhsize_short;

    by id;

        if first.id then

       average=total/n;

      output;

    end;

run;

For some reason, it's not using the if condition (month le 6)...

Does anyone know why?

Thanks,

C

5 REPLIES 5
art297
Opal | Level 21

Its not working because the statement isn't written correctly.  I think you wanted something like:

  if month le 6 then do;

    total+packs;

    n+1;

end;

jkf91
Calcite | Level 5

still doesn't work. still adds up everything disregarding the "month le 6" condition.

data want;

  do until (last.id);

    set have;

    by id;

              if first.id then do;

                          n=0;

                          total=0;

              end;

              if month<6 then do;

       total +packs;

                                   n+1;

    end;

  end;

  do until (last.id);

    set have;

    by id;

        if first.id then average=total/n;

           output;

        end;

run;

art297
Opal | Level 21

Depends upon what you are trying to do.  The statements, as I wrote them, will exclude records from the two sums, but not exclude them from the output.  If you want to exclude all data that has a month that isn't less than 6, then I would simply include a where option in your two set statements.

I can't test the specific syntax at the moment, but you could probably use something like:

  set have (where=(month>5));

MikeZdeb
Rhodochrosite | Level 12

Hi ... here are a couple suggestions, seems to work OK (is that output what you actually want ?).

data have;

input id month packs @@;

datalines;

1 1 10 1 2 20 1 8 99 1 5 30 1 1 30

2 1 100 2 3 200 2 7 999 2 3 300 2 8 888

3 10 999 3 11 999

;

run;

data want;

total = 0; n = 0;

do until (last.id);

set have;

  by id;

  total + packs * (month lt 6);

  n + (month lt 6);

end;

if n then average = total / n;

do until (last.id);

  set have;

  by id;

  output;

end;

run;


id    month    packs    total   n    average

1       1       10       90     4      22.5

1       2       20       90     4      22.5

1       8       99       90     4      22.5

1       5       30       90     4      22.5

1       1       30       90     4      22.5

2       1      100      600     3     200.0

2       3      200      600     3     200.0

2       7      999      600     3     200.0

2       3      300      600     3     200.0

2       8      888      600     3     200.0

3      10      999        0     0        .

3      11      999        0     0        .

ballardw
Super User

I think you may not be getting some of your output because of the two DO UNTIL(LAST.ID); loops. With an explicit OUTPUT in the second one then that is the ONLY place SAS is outputting the data. You may want another OUTPUT in the first loop as well.

I would guess that you might be looking for a:

IF LAST.ID THEN OUTPUT;

And you might want a RETAIN TOTAL N .; at the begining of your data step.

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 5 replies
  • 4269 views
  • 0 likes
  • 4 in conversation