BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
1 ACCEPTED SOLUTION

Accepted Solutions
SubbuPaz
SAS Employee

Try this code:

 

data have;
input ID state $ city $ score @@;
datalines;
1 A A 100
1 A B 100
1 A C 101
1 B D 102
2 B E 99
2 B F 99
2 B G 99
3 A C 88
4 C H 120
4 D J 110
4 E H 111
4 E I 121
;
run;

data tmp_want_state;
set have;
by id state city;
retain state_num;
if first.id then state_num = 1;
else state_num = state_num + 1;
state_id = compress('State'|| state_num);
run;

proc transpose data=tmp_want_state out=tmp_want;
by ID;
id state_id;
var state;
run;

proc sql;
create table want1 as
select
a.*, b.same_state
from tmp_want a
left join
(select
ID, count(distinct state) as num_distinct_states,
case when count(distinct state) = 1 then 'yes'
else 'no'
end as same_state
from have
group by ID)b
on a.ID = b.ID;

create table want2 as
select
a.*, b.same_score
from tmp_want a
left join
(select
ID, count(distinct score) as num_distinct_scores,
case when count(distinct score) = 1 then 'yes'
else 'no'
end as same_score
from have
group by ID)b
on a.ID = b.ID;

quit;

 

View solution in original post

3 REPLIES 3
ballardw
Super User

Cleaning by creating those data sets, in my opinion, isn't particularly helpful.

 

I would start with something like

Proc freq data=have;
   tables id*state*score / list;
run;

Which will give counts of the same combinations and show the differences near each other.

Actually and output data set could be made with the counts and filtered to only those where the count indicates a problem.

Or consider REPORTS instead of data sets

 

Proc tabulate data=have;
   class id state ;
   table id,
           state
           /misstext=' '
  ;
run;
SubbuPaz
SAS Employee

Try this code:

 

data have;
input ID state $ city $ score @@;
datalines;
1 A A 100
1 A B 100
1 A C 101
1 B D 102
2 B E 99
2 B F 99
2 B G 99
3 A C 88
4 C H 120
4 D J 110
4 E H 111
4 E I 121
;
run;

data tmp_want_state;
set have;
by id state city;
retain state_num;
if first.id then state_num = 1;
else state_num = state_num + 1;
state_id = compress('State'|| state_num);
run;

proc transpose data=tmp_want_state out=tmp_want;
by ID;
id state_id;
var state;
run;

proc sql;
create table want1 as
select
a.*, b.same_state
from tmp_want a
left join
(select
ID, count(distinct state) as num_distinct_states,
case when count(distinct state) = 1 then 'yes'
else 'no'
end as same_state
from have
group by ID)b
on a.ID = b.ID;

create table want2 as
select
a.*, b.same_score
from tmp_want a
left join
(select
ID, count(distinct score) as num_distinct_scores,
case when count(distinct score) = 1 then 'yes'
else 'no'
end as same_score
from have
group by ID)b
on a.ID = b.ID;

quit;

 

Quentin
Super User

One way to approach this would be to count the number of unique values for STATE for each subject.  

 

With a data step, you could do this using BY-group processing, like:

 

data have;
input ID state $ city $ score @@;
datalines;
1 A A 100
1 A B 100
1 A C 101
1 B D 102
2 B E 99
2 B F 99
2 B G 99
3 A C 88
4 C H 120
4 D J 110
4 E H 111
4 E I 121
;
run;

data want (keep=id statecount);
  set have (keep=id state);
  by id state ;

  if first.id then statecount=0 ;      *If this is a new ID, set a counter variable to 0 ;
  if first.state then statecount++1 ;  *If this is a new state, increment the counter ;
  if last.id ;                         *If this is the last record for an ID, use a subsetting IF to select it ;

  put (id statecount)(=) ;
run ;
BASUG is hosting free webinars Next up: Don Henderson presenting on using hash functions (not hash tables!) to segment data on June 12. Register now at the Boston Area SAS Users Group event page: https://www.basug.org/events.

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

Mastering the WHERE Clause in PROC SQL

SAS' Charu Shankar shares her PROC SQL expertise by showing you how to master the WHERE clause using real winter weather data.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 370 views
  • 3 likes
  • 4 in conversation