BookmarkSubscribeRSS Feed
turcay
Lapis Lazuli | Level 10

Hello everyone,

 

I would like to ask you question related to data manipulation. I want to drop the noncurent year and add the current year into my data set (HAVE). Also I want to do this when my data set (HAVE) includes 12 different yearmonth value.

 

Actually, I made my desired output as below(data set want) but I am not sure whether it is the correct way to do this.

 

ıf somebody has more practical method, does s/he help me, please?

 

Data Have;
Input  ID YEARMONTH $;
Datalines;
1 201606
1 201607
1 201608
1 201609
1 201610
1 201611
1 201612
1 201701
1 201702
1 201703
1 201704
1 201705
;
Run;
Data Have2;
Input  ID YEARMONTH $;
Datalines;
1 201706
;
Run;

PROC SQL NOPRINT;
    SELECT COUNT(DISTINCT(YEARMONTH)) INTO :COUNTYEAR FROM Have;
QUIT;
%PUT &COUNTYEAR;

%MACRO LOOP;
%IF &COUNTYEAR=12 %THEN %DO;
PROC SQL;
CREATE TABLE HAVE AS
SELECT * FROM HAVE
HAVING YEARMONTH NE MIN(YEARMONTH);
QUIT;
PROC SQL;
CREATE TABLE WANT AS 
SELECT * FROM HAVE
 OUTER UNION CORR 
SELECT * FROM HAVE2
;
Quit;
%END;
%MEND LOOP;
%LOOP;

Thank you,

 

 

 

 

9 REPLIES 9
ChrisNZ
Tourmaline | Level 20

That's one way:

proc sql noprint;
  select count(distinct(YEARMONTH)), min(YEARMONTH) into :countyear,:minyear from HAVE;
quit;

data WANT;
  if &countyear=12 then set HAVE(where=(YEARMONTH ne "&minyear")) HAVE2;
  else set HAVE HAVE2;
run;

 

turcay
Lapis Lazuli | Level 10

Hello,

 

Thank you for your response. However, what if my have2 table come 201705 again, I do not want to multiplixity. I want to prevent it. I think we need to add some more controls, right?

 

Data Have;
Input  ID YEARMONTH $;
Datalines;
1 201606
1 201607
1 201608
1 201609
1 201610
1 201611
1 201612
1 201701
1 201702
1 201703
1 201705
;
Run;
Data Have2;
Input  ID YEARMONTH $;
Datalines;
1 201705
;
Run;

proc sql noprint;
  select count(distinct(YEARMONTH)), min(YEARMONTH),max(YEARMONTH) into :countyear,:minyear, :maxyear from HAVE;
quit;
%PUT &countyear.;
%PUT &minyear.;
%PUT &maxyear.;


data WANT;
  if &countyear=12 then set HAVE(where=(YEARMONTH ne "&minyear")) HAVE2;
  else set HAVE ;
run;

@ChrisNZ 

Thanks

PaigeMiller
Diamond | Level 26

There's rarely a need to use macros or macro variables to do simple data manipulation like this.


From what I can see, HAVE has 11 rows , and WANT has 12 rows, where one more month in the sequence is added. So, this code gets you there. (Or perhaps I'm not understanding the problem entirely...)

 

You also make your life much simpler by handling calendar values as actual SAS date values rather than a text string.

 

data want;
    set have end=eof;
	ym=input(yearmonth,yymmn6.);
    output;
    if eof then do;
		ym=intnx('month',ym,1,'b');
		output;
	end;
	format ym yymm6.;
run;
        

 

 

--
Paige Miller
turcay
Lapis Lazuli | Level 10

Thank you for your response but I think we both need to control 12 month count and min and max yearmonth control. If there is 12 distinct yearmonth and have data set max(yearmonth) is not equal to Have2 table max(yearmonth)

 

It should be like following but becaust it is character variable it does not work. Can you help me about it?

 

proc sql noprint;
  select count(distinct(YEARMONTH)), min(YEARMONTH),max(YEARMONTH) into :countyear,:minyear, :maxyear from HAVE;
quit;
%PUT &countyear.;
%PUT &minyear.;
proc sql noprint;
  select max(YEARMONTH) into :maxyear from HAVE2;
quit;
%PUT &maxyear.;


data WANT;
  if &countyear=12 and max(YEARMONTH) ne "&maxyear" then set HAVE(where=(YEARMONTH ne "&minyear")) HAVE2;
  else set HAVE ;
run;
PaigeMiller
Diamond | Level 26

Thanks for the explanation. As far as I can see, there is nothing that requires macros or macro variables here, but as I will not have the time to work on this the rest of today, I will try to come up with something tomorrow.

--
Paige Miller
turcay
Lapis Lazuli | Level 10

Thank you I also tried to do with following way but it does not work;

 

proc sql;
create table have3 as
select * from have
union all
select * from have2 where YEARMONTH not in(select YEARMONTH from have) ;
quit;

proc sort data=have3;
by ID descending yearmonth;
run;

proc sql noprint;
  select count(distinct(yearmonth)) into :countyear from have3;
quit;
%put &countyear;

data have;
set have3;
by id ;
if not last.id and &countyear.>12;
Run;

any suggestions? @ChrisNZ @Ksharp @Reeza @novinosrin @LinusH @Kurt_Bremser @Quentin @SuryaKiran 

 

Reeza
Super User
I think you're seriously over complicating this. Are you trying to ensure that you have a full 12 month for each ID and that it's the current month, not the last month? Please confirm if this is correct and if so, I'll post a revised solution.
PaigeMiller
Diamond | Level 26

@Reeza wrote:
I think you're seriously over complicating this. Are you trying to ensure that you have a full 12 month for each ID and that it's the current month, not the last month? Please confirm if this is correct and if so, I'll post a revised solution.

@Reeza makes a very good point. Is it correct? You haven't given this explanation, and quite honestly, the more I think about what you said, the less clear I am on what you want. The explanation you have given doesn't seem general enough to program, except for the very specific original data set you provided, so its not difficult to provide a solution for that one original data set with 11 observations (and I have done that) but obviously you have a more general problem and we are not sure exactly what that more general problem is.

--
Paige Miller

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 9 replies
  • 2679 views
  • 2 likes
  • 5 in conversation