First. And last .

Akshaya_1397 · Posted 06-30-2023 05:16 AM

Hi,

I have a datset final in which there are columns like

Id,name,function,status,reason, department and has values

123,flowers-gift,nw,onboard,blank SLS,human resource

123,flowers-gifts,nw,onboard,missing country,Human resource

123,onboard gift,nw,onboard,missing country,

I need to take unique rows so I performed first. In the dataset

Data new;

Set final;

If first.id and first.name and first.department then output;

Run;

The output I'm getting is only the first row that is with the reason blank SSL but I also need the second row with missing country.

Could anyone please help me on this

PaigeMiller · Posted 06-30-2023 05:44 AM

I have a feeling that you are not representing the data in your SAS data set properly. Please provide the data (in this thread, and from now on in all future threads) as working SAS data step code (examples and instructions).

--
Paige Miller

Akshaya_1397 · Posted 06-30-2023 05:56 AM

Sorry for that.
I have now edited the it properly

PaigeMiller · Posted 06-30-2023 05:58 AM

@Akshaya_1397 wrote:
Sorry for that.
I have now edited the it properly

This is not the format we requested.

--
Paige Miller

Kurt_Bremser · Posted 06-30-2023 07:31 AM

So you have the texts „blank SLS“ and „missing country“ in place of missing values?

I will not answer further posts until you post your data as requested.

Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
The macro for direct download as ZIP
How to post code
Please vote for Provide Sequential Search Capability for Hash Objects
How to deal with locked files on UNIX

Kurt_Bremser · Posted 06-30-2023 05:50 AM

You show six variables, but only five values in the data.

Please do ALWAYS post example data in a working DATA step with DATALINES, so we don‘t have to ask.

Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
The macro for direct download as ZIP
How to post code
Please vote for Provide Sequential Search Capability for Hash Objects
How to deal with locked files on UNIX

Akshaya_1397 · Posted 06-30-2023 06:49 AM

i have dataset name merged file as shown below:

i wrote a code to obtain unique rows:

DATA fin;
set Merged_file;
by ID dep status reason;
if first.id and first.dep and first.status and first.reason then output;
run;

the output which im getiing is only the first row.but the output im expecting is the first two rows as the reason is different

can anyone please help me on this

PaigeMiller · Posted 06-30-2023 07:11 AM

Please provide the data in the requested form (we provided a link with examples and instructions). Please don't start new threads on the same topic.

--
Paige Miller

Kurt_Bremser · Posted 06-30-2023 07:34 AM

I merged the posts.

Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
The macro for direct download as ZIP
How to post code
Please vote for Provide Sequential Search Capability for Hash Objects
How to deal with locked files on UNIX

Tom · Posted 06-30-2023 08:24 AM

@Akshaya_1397 wrote:

i have dataset name merged file as shown below:

i wrote a code to obtain unique rows:

DATA fin;
set Merged_file;
by ID dep status reason;
if first.id and first.dep and first.status and first.reason then output;
run;

the output which im getiing is only the first row.but the output im expecting is the first two rows as the reason is different

can anyone please help me on this

This test

if first.id and first.dep and first.status and first.reason then output;

indicates a misunderstanding of how FIRST. flags are created.

That test is the same as

if first.id then output;

because the time when all of the FIRST. flags will be true is when you start a new value of the first BY variable, in this case the ID variable.

If you want to find the distinct observations then you should just test the value of the last BY variable, in this case the REASON variable.

if first.reason then output;

Akshaya_1397 · Posted 06-30-2023 08:34 AM

Thanks for ur reply tom.

But we have exact same reasons for other I'd too that's why I used first.id and first.reason.

Is there any other way to do it?

Tom · Posted 06-30-2023 08:37 AM

@Akshaya_1397 wrote:

Thanks for ur reply tom.

But we have exact same reasons for other I'd too that's why I used first.id and first.reason.

Is there any other way to do it?

Do WHAT?

Are you look to get one observation per distinct values of REASON?

If so then sort by REASON.

proc sort data=have out=one_per_reason nodupkey ;
  by reason;
run;

Tom · Posted 06-30-2023 08:54 AM

All of the rows in your example dataset are unique already. Please provide example data that has one or more duplicates that you would like to remove. Explain which ones should be removed (or which ones kept) and why.

data have ;
  infile cards dsd truncover ;
  input Id $ name :$20. function :$5. status :$20. reason :$20. department :$20.;
cards;
123,flowers-gift,nw,onboard,blank SLS,human resource
123,flowers-gifts,nw,onboard,missing country,Human resource
123,onboard gift,nw,onboard,missing country, 
;

Obs    Id         name         function    status     reason               department

 1     123    flowers-gift        nw       onboard    blank SLS          human resource
 2     123    flowers-gifts       nw       onboard    missing country    Human resource
 3     123    onboard gift        nw       onboard    missing country

Akshaya_1397 · Posted 06-30-2023 09:08 AM

data have ;
infile cards dsd truncover ;
input Id $ name :$20. function :$5. status :$20. reason :$20. department :$20.;
cards;
123,flowers-gift,nw,onboard,blank SLS,human resource
123,flowers-gifts,nw,onboard,missing country,Human resource
123,onboard gift,nw,onboard,missing country,

124,flowers-gift,nw,onboard,blank SLS,human resource
124,flowers-gifts,nw,onboard,missing country,Human resource
124,onboard gift,nw,onboard,missing country,

125,flowers-gift,nw,onboard,blank SLS,human resource

126,flowers-gifts,nw,onboard,missing country,Human resource
126,onboard gift,nw,onboard,missing country,

;

proc sort data= have; by reason; run;

data haveuniq;
set have;
by reason;
if first.reason then output;
run;

the output which im expecting is shown below:

123,flowers-gift,nw,onboard,blank SLS,human resource
123,flowers-gifts,nw,onboard,missing country,Human resource

124,flowers-gift,nw,onboard,blank SLS,human resource
124,flowers-gifts,nw,onboard,missing country,Human resource

125,flowers-gift,nw,onboard,blank SLS,human resource
126,flowers-gifts,nw,onboard,missing country,Human resource

the output im getting is

this is what is confusing me.

Tom · Posted 06-30-2023 09:48 AM

Why does the output confuse you? There are only two distinct values of REASON so you should only get two observations out when selecting based on distinct values of REASON.

What is the reason that you want to remove the three observations you want removed?

The only thing I can see that would produce that output would be to just delete the observations with blank values of DEPARTMENT.

data have ;
  infile cards dsd truncover ;
  input Id $ name :$20. function :$5. status :$20. reason :$20. department :$20.;
cards;
123,flowers-gift,nw,onboard,blank SLS,human resource
123,flowers-gifts,nw,onboard,missing country,Human resource
123,onboard gift,nw,onboard,missing country,
124,flowers-gift,nw,onboard,blank SLS,human resource
124,flowers-gifts,nw,onboard,missing country,Human resource
124,onboard gift,nw,onboard,missing country,
125,flowers-gift,nw,onboard,blank SLS,human resource
126,flowers-gifts,nw,onboard,missing country,Human resource
126,onboard gift,nw,onboard,missing country,
;


data expect ;
  infile cards dsd truncover ;
  input Id $ name :$20. function :$5. status :$20. reason :$20. department :$20.;
cards;
123,flowers-gift,nw,onboard,blank SLS,human resource
123,flowers-gifts,nw,onboard,missing country,Human resource
124,flowers-gift,nw,onboard,blank SLS,human resource
124,flowers-gifts,nw,onboard,missing country,Human resource
125,flowers-gift,nw,onboard,blank SLS,human resource
126,flowers-gifts,nw,onboard,missing country,Human resource
;


data no_miss_department;
  set have;
  if not missing(department) then output;
run;

proc compare data=no_miss_department compare=expect;
run;

First. And last .

Re: First. And last .

Re: First. And last .

Re: First. And last .

Re: First. And last .

Re: First. And last .

first dot in base sas

Re: first dot in base sas

Re: first dot in base sas

Re: first dot in base sas

Re: first dot in base sas

Re: first dot in base sas

Re: First. And last .

Re: First. And last .

Re: First. And last .

Catch up on SAS Innovate 2026

Catch up on SAS Innovate 2026

SAS Training: Just a Click Away