BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Babloo
Rhodochrosite | Level 12

I have a data as follows and I need to find the top 10 and last 10 values from a variable. Appreciate if someone of you help me to achieve the desired output.

 

Id
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24

 

Desired output:

 

top_10 last_10
1 15
2 16
3 17
4 18
5 19
6 20
7 21
8 22
9 23
10 24
1 ACCEPTED SOLUTION

Accepted Solutions
Ksharp
Super User

Assuming the table has been sorted.

 

 

data have;
input Id;
cards;
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
;

data first_10 last_10;
 set have nobs=nobs;
 if _n_ lt 11 then output first_10;
 if _n_ gt nobs-10 then output last_10;
run;
data want;
 merge first_10(rename=(id=first_10)) last_10(rename=(id=last_10));
run;

 

View solution in original post

10 REPLIES 10
Kurt_Bremser
Super User

Read into an array, and use the nobs=option to help in determining the last 10 items:

data want (keep=first_10 last_10);
array first {10};
array last {10};
n = 0;
do until (eof);
  n + 1;
  set have nobs=nobs end=eof;
  if n <= 10 then first{n} = id;
  if n >= nobs - 9 then last{n+10-nobs} = id;
end;
do n = 1 to 10;
  first_10 = first{n};
  last_10 = last{n};
  output;
end;
run;
Babloo
Rhodochrosite | Level 12

I ran your code and it is not producing the desired results. It is generating missing values for the variables to first_10 and last_10.

Kurt_Bremser
Super User

@Babloo wrote:

I ran your code and it is not producing the desired results. It is generating missing values for the variables to first_10 and last_10.


Maybe that's because you once again failed to provide proper example data?

data have;
do id = 1 to 24;
output;
end;
run;

data want (keep=first_10 last_10);
array first {10};
array last {10};
n = 0;
do until (eof);
  n + 1;
  set have nobs=nobs end=eof;
  if n <= 10 then first{n} = id;
  if n >= nobs - 9 then last{n+10-nobs} = id;
end;
do n = 1 to 10;
  first_10 = first{n};
  last_10 = last{n};
  output;
end;
run;

proc print data=want noobs;
run;

Result:

first_10    last_10

    1          15  
    2          16  
    3          17  
    4          18  
    5          19  
    6          20  
    7          21  
    8          22  
    9          23  
   10          24  

Just what you wanted.

PaigeMiller
Diamond | Level 26

 

PROC RANK can do this, and can also handle ties if that is a problem. You could also use PROC SORT to find the top 10 and bottom 10, but it will not handle ties. The code provided by @Kurt_Bremser does not seem to handle ties either.

--
Paige Miller
Babloo
Rhodochrosite | Level 12

Could you please help me understand how will you do in proc sort?

PaigeMiller
Diamond | Level 26

@Babloo wrote:

Could you please help me understand how will you do in proc sort?


If you run PROC SORT then the first 10 values are the lowest numbers and the last 10 values are the biggest numbers. You extract the top ten and bottom ten from the list and merge them together to get your desired result.

--
Paige Miller
Ksharp
Super User

Assuming the table has been sorted.

 

 

data have;
input Id;
cards;
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
;

data first_10 last_10;
 set have nobs=nobs;
 if _n_ lt 11 then output first_10;
 if _n_ gt nobs-10 then output last_10;
run;
data want;
 merge first_10(rename=(id=first_10)) last_10(rename=(id=last_10));
run;

 

FreelanceReinh
Jade | Level 19

@Babloo: From your sample data it is not clear whether you mean the first and last ten or the smallest and largest ten values. Hence the confusion and different approaches.

PaigeMiller
Diamond | Level 26

@FreelanceReinh wrote:

@Babloo: From your sample data it is not clear whether you mean the first and last ten or the smallest and largest ten values. Hence the confusion and different approaches.


Yes, @FreelanceReinh, very good point. This is a very poor example, or the words used to explain the example need to be clearer.

--
Paige Miller
novinosrin
Tourmaline | Level 20
data have;
input Id;
cards;
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
;

data want;
first_10=1;
last_10=nobs-9;
output;
do _n_=2 to nobs;
set have point=_n_ nobs=nobs;
first_10=_n_;
last_10=last_10+1;
if first_10>10 then stop;
output;
end;
drop id;
run;

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 10 replies
  • 3011 views
  • 0 likes
  • 6 in conversation