BookmarkSubscribeRSS Feed
hnb_matt_d
Obsidian | Level 7

I have a dataset that has counts of activity types grouped by month and year (for a 13 month period).  This dataset will be compiled monthly (at the beginning of each month for the preceding 13 months), so the immediately preceding month (December, in January's instance) will have two (2) appearances.  One for the month immediately preceding (December 2022) and one for 13 months ago (December 2021). 

 

I've also been tasked with re-creating the last seven (7) months' data (June 2022 through December 2022).  Everything is working outside the proc transpose not ordering dates (months/years - or however this needs to be formatted) chronologically from left to right.  More specifically, March 2022 continually appears as the last month listed (to the right) with every other month/year appearing in order as wanted.  

 

data volumes;
set volumes_1;
format month_2 monyy.;
month_2 = input(month !! "01",yymmdd8.);
run;

^This code creates a new variable, month_2, in a date format.  Full disclosure, I've run the proc transpose (included below) using a date formatted month_2 and a character formatted month variable with the same results (March 2022 not cooperating and appearing in order).  

 

proc sort data=volumes (drop=month);
by transaction_type_code month_2;
run;


proc transpose data=volumes out=PIVOT (drop=_:);
by transaction_type_code;
id month_2;
var volume;
run;

^There's the sort and subsequent proc transpose, to result in:

hnb_matt_d_0-1674758057721.png

 

I've utilized multiple date formats for month_2, with none of them working properly.  HELP!  

 

 

14 REPLIES 14
data_null__
Jade | Level 19

Create a dummy level of 

transaction_type_code=.

with missing value and one record for each MONTH in the correct order.

ballardw
Super User

If you don't have all of the months in every level of the by variable, such as your transaction_type_code variable then that is likely. The order of appearance of the ID values sets which order the variable appears.

 

I generally don't worry about the order of variables in a data set as anytime people need to read things you specify the order in the report generating code.

 

You can quite often complicate a task by placing actual date into variable names like this.

 

Without actual data or an example of what you need for the output it is a tad difficult to suggest another approach.

 

PaigeMiller
Diamond | Level 26

If the goal is to create a report or an Excel output with calendar names as the column headings, then PROC TRANSPOSE simply is not the way to go. There is nothing in PROC TRANSPOSE that will force the result into calendar order.

 

Instead you want to use PROC REPORT with an ACROSS variable and the option ORDER=INTERNAL. If you do this, then this problem you are having goes away, and furthermore doesn't involve you creating a transposed data set. EXAMPLE

 

 

--
Paige Miller
SASJedi
SAS Super FREQ

Since you didn't provide sample data, I took the liberty of creating some to play with:

/*Make fake data*/
data volumes;
	call streaminit(12345);
	do transaction_type_code = "CHECK","CASH","CARD";
		DO YEAR=2021 TO 2022;
			do Month=1 TO 12;
				month_2 = input(CATS(YEAR,PUT(month,Z2.),"01"),yymmdd8.);
				Volume=rand('INTEGER',10,1000);
				IF month_2 >= '01DEC2021'd then output;
			end;
		end;
	end;
format month_2 monyy.; DROP YEAR;
run;

This produced an input data set that looked like this:

SASJedi_2-1674760877309.png

Then I ran your code against that data. The only mod I did was to drop YEAR, which you didn't have in your original data:

proc sort data=volumes (drop=month year);
	by transaction_type_code month_2;
run;

proc transpose data=volumes out=PIVOT (drop=_:);
   by transaction_type_code;
   id month_2; 
   var volume;
run;

This is the result from SAS 9.4 M7, Windowing environment (PC SAS):

SASJedi_3-1674761071977.png

From Enterprise Guide 8.3 running on SAS 9.4M7 (I run in dark mode)

SASJedi_4-1674761164678.png

And from Viya 2023. (also in dark mode):

SASJedi_6-1674761343520.png

I can't seem to replicate your results using your code. Is it possible to share a few rows of the original volumes data set without violating privacy?

 

Check out my Jedi SAS Tricks for SAS Users
hnb_matt_d
Obsidian | Level 7

Can't really share the specific transaction_type _code.  I can tell you there are ~400 of them with ~2.5 billion occurrences across those 400 in the last 13 months.  

 

Also, this will be an automated code execution and output, so I'd rather not hard-code dates/years.  Need this to be as self-sustainingly dynamic as possible.  

PaigeMiller
Diamond | Level 26

@hnb_matt_d wrote:

Need this to be as self-sustainingly dynamic as possible.  


PROC REPORT! I have already provided a small example.

--
Paige Miller
hnb_matt_d
Obsidian | Level 7

Reviewing this now.  I may take overnight to digest, @PaigeMiller.  It's been an interesting Thursday thus far.  

hnb_matt_d
Obsidian | Level 7

This really looks like the same thing that proc tabulate was doing (which seemed to work correctly), but I need the Output Data formatted and not just the Results tab to be correct.  

PaigeMiller
Diamond | Level 26

but I need the Output Data formatted and not just the Results tab to be correct.  

 

I don't understand what this means, but there are a gazillion and 12 different formatting options in PROC REPORT.

--
Paige Miller
hnb_matt_d
Obsidian | Level 7

The proc report example provided also doesn't work with the date formatting needed.  These transaction code volumes are being aggregated by month.  Having a date of 1/1/2022 appearing to represent January 2022 won't work.  The inconsistency in the presentation will be noticed and called out.  This, unfortunately, is a compliance-based document and has to adhere to established guidelines.  Not my call on that.  

PaigeMiller
Diamond | Level 26

@hnb_matt_d wrote:

These transaction code volumes are being aggregated by month.  Having a date of 1/1/2022 appearing to represent January 2022 won't work.  


All you have to do is change the format.

--
Paige Miller
Reeza
Super User

Building on @SASJedi solution a brute force method, that's not ideal is to select the columns in the date order from a list of macro variables. There's a few ways of doing this but it's data driven. Basically adds two more steps to your code. See the last two steps in the code below. Note because it's data driven, if you change from 7 to 12 months, you could either just select a smaller amount by modifying the SQL query or if the data changes to just 7 months it would be data driven so you have some options to make it dynamic.

 

/*Make fake data*/
data volumes;
	call streaminit(12345);
	do transaction_type_code = "CHECK","CASH","CARD";
		DO YEAR=2021 TO 2022;
			do Month=1 TO 12;
				month_2 = input(CATS(YEAR,PUT(month,Z2.),"01"),yymmdd8.);
				Volume=rand('INTEGER',10,1000);
				IF month_2 >= '01DEC2021'd then output;
			end;
		end;
	end;
format month_2 monyy.; 
run;

proc sort data=volumes (drop=month year);
	by transaction_type_code month_2;
run;

proc transpose data=volumes out=PIVOT (drop=_:);
   by transaction_type_code;
   id month_2; 
   var volume;
run;

proc sql noprint;
select distinct month_2 into :date_list separated by " ," from volumes
order by month_2;
quit;

proc sql;
create table volumes_report as 
select transaction_type_code, &date_list
from pivot;
quit;

 

 

 

 

ballardw
Super User

@hnb_matt_d wrote:

Can't really share the specific transaction_type _code.  I can tell you there are ~400 of them with ~2.5 billion occurrences across those 400 in the last 13 months.  

 

Also, this will be an automated code execution and output, so I'd rather not hard-code dates/years.  Need this to be as self-sustainingly dynamic as possible.  


Have you verified that each transaction code has a record in each month?

I suspect you may be leaving out some details. If you have duplicate month values for any given transaction code using the shown code would fail with an error similar to this:

ERROR: The ID value "DEC22" occurs twice in the same BY group.

Which means that the maximum number of records you could be dealing with is 400 codes * 13 months and expect the transpose to work which is WAY short of 2.5 billion.

hnb_matt_d
Obsidian | Level 7

hnb_matt_d_0-1674762449769.png

 

Working with this version...  

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 14 replies
  • 925 views
  • 2 likes
  • 6 in conversation