BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.

Hi,

Could you please help me to select few observation based on certain conditions.

I have a dataset with 2 variables like below.

Variable 1Variable 2
AAS
AAL
AAW
ABS
ACS
ACL
ADL
ADW
AES
AFS
AGL
AHW

I want to select the observation if Variable 2 is a combination of S & L, L & W, S & W and S & L & W.. If alone S or L or W is there I don't want it.

The output I am expecting is like below

Variable 1Variable 2
AAS
AAL
AAW
ACS
ACL
ADL
ADW


Could you please help me on this regard.

1 ACCEPTED SOLUTION

Accepted Solutions
Ksharp
Super User
data temp;
input Var1 $     Var2 $ ;
cards;
AA     S
AA     L
AA     W
AB     S
AC     S
AC     L
AD     L
AD     W
AE     S
AF     S
AG     L
AH     W
;
run;
proc sql;
 select * 
  from temp
   group by var1
    having count(distinct var2) gt 1;
quit;

Ksharp

View solution in original post

6 REPLIES 6
art297
Opal | Level 21

I think that proc sql offers the easiest solution to implement.  How about:

proc sql noprint;

  create table want as

    select *

      from have

        group by variable1

          having min(variable2) ne max(variable2)

  ;

quit;

Narasimha_Kulkarni
Calcite | Level 5

Hi Art297

Thank you for the help works kool...

Great...

Ksharp
Super User
data temp;
input Var1 $     Var2 $ ;
cards;
AA     S
AA     L
AA     W
AB     S
AC     S
AC     L
AD     L
AD     W
AE     S
AF     S
AG     L
AH     W
;
run;
proc sql;
 select * 
  from temp
   group by var1
    having count(distinct var2) gt 1;
quit;

Ksharp

Narasimha_Kulkarni
Calcite | Level 5

Hi Ksharp,

Answer works correct,

Kool thank you

Tom
Super User Tom
Super User

Did you mean to request that you want to delete the groups that only have one observation?
If your data is sorted by VARIABLE1 (is it an ID variable perhaps) and there are not duplicate identical rows then you can just use FIRST./LAST. processing.

data want;

   set have;

   by id;

   if first.id and last.id then delete;

run;

SAS_Niels
Calcite | Level 5

Hi,

I found a way with proc transpose. You can check each single step.

data test;

  input variable1 $ variable2;

datalines;

AA S

AA L

AA W

AB S

AC S

AC L

AD L

AD W

AE S

AF S

AG L

;

run;

proc sort data=test;

  by variable1 variable2;

run;

proc transpose data=test out=turn prefix=value;

  by variable1;

  var variable1 variable2;

run;

data turn;

  set turn;

   if value2= '' and value3= '' then delete;

run;

proc transpose data=turn out=turnback (drop=_name_);

  by variable1;

  var value1 value2 value3;

run;

data testend;

  set turnback;

   if variable1= '' or variable2= '' then delete;

run;

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 6 replies
  • 1188 views
  • 3 likes
  • 5 in conversation