BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
pbhatt
Calcite | Level 5

I am trying to create a count variable based on a numeric date variable (format yyq6.), but I am unable to get what I want where each new date value is ennumerated

DATE      TCOUNT

2013Q1   1

2013Q1   1

2013Q2   2

2013Q2   2

2013Q2   2

2013Q3   3

2013Q3   3

2013Q3   3

2013Q4   4

2013Q4   4

2014Q1   5

2014Q1   5

2014Q2   6

2014Q2   6

 

Below is the code that I have:

 

DATA WORK.TEST1;
SET HAVE;
LABEL TCOUNT = "Time Count";
TCOUNT = 0;
RUN;

 

PROC SORT DATA = WORK.TEST1;
BY DATE ;
RUN;

 

DATA WORK.TEST1;
SET WORK.TEST1;
BY DATE;
IF FIRST.DATE THEN TCOUNT = TCOUNT + 1;
RUN;

 

The output is the following:

DATE      TCOUNT

2013Q1   1

2013Q1   0

2013Q1   0

2013Q1   0

2013Q1   0

2013Q2   0

2013Q2   0

2013Q2   0

2013Q3   0

2013Q3   0

2013Q4   0

2013Q4   0

2013Q4   0

2013Q4   0

2014Q1   0

2014Q1   0

2014Q1   0

...             0

1 ACCEPTED SOLUTION

Accepted Solutions
PaigeMiller
Diamond | Level 26

You are assigning the value zero to TCOUNT for every observation, but that is the problem. Also, RETAIN TCOUNT; is not wrong, but it is also not necessary.

 

data work.test1;
    set have;
    label tcount = "time count";
    /*tcount = 0;*/
run;

proc sort data = work.test1;
    by date ;
run;

data work.test1;
    set work.test1;
/*    retain tcount; */
    by date;
    if first.date then tcount + 1;
run;

Thus, the code can be simplified to

 

proc sort data = have;
    by date ;
run;

data work.test1;
    set have;
    by date;
    if first.date then tcount + 1;
run;

 

--
Paige Miller

View solution in original post

6 REPLIES 6
jimbarbour
Meteorite | Level 14

Have you tried putting a RETAIN on TCOUNT?  Or, alternatively, change it to TCOUNT + 1 instead of TCOUNT = TCOUNT + 1.  TCOUNT + 1 syntax causes an automatic retain.

 

I would also get rid of the Data step before the Sort.  Apply the label in the final step.  If you create TCOUNT before the final Data step, TCOUNT will not be a counter.  TCOUNT will get read in as zero each time.

 

I would fix it like this:

DATA	Have;
	FORMAT	DATE	YYQ6.;
	INFILE DATALINES	TRUNCOVER;
	INPUT	DATE	:	YYQ6.;
DATALINES;
2013Q1
2013Q1
2013Q2
2013Q2
2013Q2
2013Q3
2013Q3
2013Q3
2013Q4
2013Q4
2014Q1
2014Q1
2014Q2
2014Q2
;
RUN;

PROC SORT DATA 		= 	Have
			OUT		=	WORK.TEST1;
	BY	DATE;
RUN;

DATA WORK.TEST1;
	SET WORK.TEST1;
		BY	DATE;

	LABEL TCOUNT	= 	"Time Count";

	IF FIRST.DATE 		THEN
		TCOUNT 		+ 	1;
RUN;

Results:

jimbarbour_0-1628709139759.png

 

Jim

pbhatt
Calcite | Level 5
I tried the following but it resulted in the same output. I also referenced this article but am still running into hurdles
https://stats.idre.ucla.edu/sas/faq/how-can-i-create-an-enumeration-variable-by-groups/

DATA WORK.TEST1;
SET HAVE;
LABEL TCOUNT = "Time Count";
TCOUNT = 0;
RUN;

PROC SORT DATA = WORK.TEST1;
BY DATE ;
RUN;

DATA WORK.TEST1;
SET WORK.TEST1;
RETAIN TCOUNT;
BY DATE;
IF FIRST.DATE THEN TCOUNT + 1;
RUN;
PaigeMiller
Diamond | Level 26

You are assigning the value zero to TCOUNT for every observation, but that is the problem. Also, RETAIN TCOUNT; is not wrong, but it is also not necessary.

 

data work.test1;
    set have;
    label tcount = "time count";
    /*tcount = 0;*/
run;

proc sort data = work.test1;
    by date ;
run;

data work.test1;
    set work.test1;
/*    retain tcount; */
    by date;
    if first.date then tcount + 1;
run;

Thus, the code can be simplified to

 

proc sort data = have;
    by date ;
run;

data work.test1;
    set have;
    by date;
    if first.date then tcount + 1;
run;

 

--
Paige Miller
pbhatt
Calcite | Level 5
Thanks so much to Paige and Jim! It worked. I was under the impression I needed to initialize TCOUNT by setting it to zero after I created its label, but that is not the case.
jimbarbour
Meteorite | Level 14

Yes, actually quite the opposite.  By adding TCOUNT to the dataset before the Data step counting TCOUNT, you caused TCOUNT to be read in each time the SET statement was executed, wiping out any count and replacing it with the initial value (zero).  You could have initialized TCOUNT in the same Data step as the counting, but it would not have been necessary.

 

Glad we got you working,

 

Jim

jimbarbour
Meteorite | Level 14

Read my response in detail.  I added an example after I initially posted it.  

 

This Data step is screwing things up.  It needs to be deleted.  

DATA WORK.TEST1;
SET HAVE;
LABEL TCOUNT = "Time Count";
TCOUNT = 0;
RUN;

Jim

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 6 replies
  • 1747 views
  • 1 like
  • 3 in conversation