BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
RTao
Calcite | Level 5

Hi, basically what I need is based on where the Key falls in the interval for each quarter, a column indicating the proper group it belongs should be created from have1. Left join won't work since I don't really have a unique key, this is simply a lookup based on a range. Any help is appreciated!

 

Example: 

 

have1: 

QuarterKey
131.39715
153.59118
184.20986
181.27638
258.62042
212.07567
291.42742
369.06109
453.8079
433.30056

 

have 2: 

QuarterGroupCutoff
1A20
1B50
1C90
1D100
2A10
2B20
2C30
2D100
3A60
3B70
3C80
3D100
4A10
4B60
4C99
4D100

 

want: 

QuarterKeyGroup
131.39715B
153.59118C
184.20986C
181.27638C
258.62042D
212.07567B
291.42742D
369.06109A
453.8079A
433.30056A
1 ACCEPTED SOLUTION

Accepted Solutions
SASJedi
SAS Super FREQ

Here is a possible approach using formats. First, create the data HAVE sets:

data have1;
infile datalines dsd dlm='|';
input Quarter Key;
datalines;
1|31.39715
1|53.59118
1|84.20986
1|81.27638
2|58.62042
2|12.07567
2|91.42742
3|69.06109
4|53.8079
4|33.30056
;


data have2;
infile datalines dsd dlm='|';
input Quarter Group:$1. Cutoff;
datalines;
1|A|20
1|B|50
1|C|90
1|D|100
2|A|10
2|B|20
2|C|30
2|D|100
3|A|60
3|B|70
3|C|80
3|D|100
4|A|10
4|B|60
4|C|99
4|D|100
;

Next, use the HAVE2 dataset to create a series of formats, one for each Quarter:

data cntlin;
	set have2 (rename=(Group=Label));
	by quarter ;
	drop quarter cutoff;
	fmtname=cats('qtr',quarter,'_');
    Start=put(lag1(Cutoff),32.);
	End=put(Cutoff,32.);
	if first.quarter then Start='LOW';
run;

proc format cntlin=cntlin fmtlib;
run;

Finally, apply the appropriate format to the Key values using PUTN, which allows assigning the format at execution time:

data want;
	set have1;
	Group=putn(Key,cats('qtr',quarter,'_'));
run;

This produces the desired results:

Obs Quarter Key Group
1 1 31.3972 B
2 1 53.5912 C
3 1 84.2099 C
4 1 81.2764 C
5 2 58.6204 D
6 2 12.0757 B
7 2 91.4274 D
8 3 69.0611 B
9 4 53.8079 B
10 4 33.3006 B

 

Note that the values for Quarter 4 different from your "Want" data, but these values are correct based on the cutoff values provided in your HAVE2 dataset.

May the SAS be with you!
Mark

 

Check out my Jedi SAS Tricks for SAS Users

View solution in original post

2 REPLIES 2
SASJedi
SAS Super FREQ

Here is a possible approach using formats. First, create the data HAVE sets:

data have1;
infile datalines dsd dlm='|';
input Quarter Key;
datalines;
1|31.39715
1|53.59118
1|84.20986
1|81.27638
2|58.62042
2|12.07567
2|91.42742
3|69.06109
4|53.8079
4|33.30056
;


data have2;
infile datalines dsd dlm='|';
input Quarter Group:$1. Cutoff;
datalines;
1|A|20
1|B|50
1|C|90
1|D|100
2|A|10
2|B|20
2|C|30
2|D|100
3|A|60
3|B|70
3|C|80
3|D|100
4|A|10
4|B|60
4|C|99
4|D|100
;

Next, use the HAVE2 dataset to create a series of formats, one for each Quarter:

data cntlin;
	set have2 (rename=(Group=Label));
	by quarter ;
	drop quarter cutoff;
	fmtname=cats('qtr',quarter,'_');
    Start=put(lag1(Cutoff),32.);
	End=put(Cutoff,32.);
	if first.quarter then Start='LOW';
run;

proc format cntlin=cntlin fmtlib;
run;

Finally, apply the appropriate format to the Key values using PUTN, which allows assigning the format at execution time:

data want;
	set have1;
	Group=putn(Key,cats('qtr',quarter,'_'));
run;

This produces the desired results:

Obs Quarter Key Group
1 1 31.3972 B
2 1 53.5912 C
3 1 84.2099 C
4 1 81.2764 C
5 2 58.6204 D
6 2 12.0757 B
7 2 91.4274 D
8 3 69.0611 B
9 4 53.8079 B
10 4 33.3006 B

 

Note that the values for Quarter 4 different from your "Want" data, but these values are correct based on the cutoff values provided in your HAVE2 dataset.

May the SAS be with you!
Mark

 

Check out my Jedi SAS Tricks for SAS Users
Patrick
Opal | Level 21

And here a SQL way using the source data created by @SASJedi 

The proposed data step with formats will perform better so I'd certainly be using this approach for any bigger data volumes.

The advantage of pure SQL is that it could execute in-database (with a few amendments to the current SAS SQL flavor) if both of your source tables are in a database.

proc sql;
  select 
    h1.quarter
    ,h1.key
    ,h2.group
  from have1 h1 left join have2 h2
    on h1.quarter=h2.quarter
      and h1.key <= h2.Cutoff
  group by h1.quarter, h1.key
  having h2.Cutoff-h1.key = min(h2.Cutoff-h1.key)
  ;
quit;

 

Patrick_0-1699925712287.png

 

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 542 views
  • 5 likes
  • 3 in conversation