SAS Data Integration Studio, DataFlux Data Management Studio, SAS/ACCESS, SAS Data Loader for Hadoop and others

Deleting an ID with single row from a dataset containing multiple rows for each ID

Reply
Occasional Contributor
Posts: 14

Deleting an ID with single row from a dataset containing multiple rows for each ID

Hi,

I AM WORKING ON A SAS DATA SET  CONTAINING 1999 IDs. MOST OF THEM CONTAINS MULTIPLE ROWS (APPROXIMATELY 100000 ROWS IN TOTAL ). THERE IS ANOTHER VARIABLE IN THE DATA SET NAMED "HISTORY" HAVING VALUES "Y" AND "N".

I WANT TO DELETE THE IDs WHO HAVE ONLY ONE ROW AND HISTORY VALUE "Y". 

I AM NEW SAS USER AND COULDN'T FIGURE OUT HOW TO DO THIS. PLEASE HELP ME. ATTACHED IS A SAMPLE FILE.  

PROC Star
Posts: 1,591

Re: Deleting an ID with single row from a dataset containing multiple rows for each ID

Please don't post the same question multiple times. I answered earlier 

Occasional Learner
Posts: 1

Re: Deleting an ID with single row from a dataset containing multiple rows for each ID

data temp;

   input id $ history $;

   datalines;

T101 Y

T101 Y

T101 Y

T102 Y

T103 Y

T103 Y

;

run;

 

proc sql;

    create table count as

   select id, count(id) as count, history

  from temp

  group by id

;

quit;

 

data temp2;

   set count (where=(history='Y' and count > 1));

run;

Ask a Question
Discussion stats
  • 2 replies
  • 117 views
  • 0 likes
  • 3 in conversation