BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Lida
Obsidian | Level 7

Hello All, I am wondering whether it is possible to calculate weighted percentiles using proc means.

When I use this syntax, it seems only mean is weighed.

 

proc means data= sodiumdata2009 min max mean median p10 q1 q3 noprint;

by Level_1;

var sodp100g;

weight RKVol;

run;

 

Any suggestions?

Thank you!

 

1 ACCEPTED SOLUTION

Accepted Solutions
Reeza
Super User

The percentiles are affected as far as I can tell. 

Perhaps your data isn't affected by the weights for some reason?

If the weights are small?

 

data check;
set sashelp.class;
x=floor(rand('uniform')*8)+1;
run;

proc means data=check n p5 p10 p90 p95;
title 'no weights';
var weight;
run;

proc means data=check n p5 p10 p90 p95;
title 'weights';
var weight;
weight x;
run;

View solution in original post

14 REPLIES 14
overmar
Obsidian | Level 7

Why not use proc univariate? But I'm not quite sure what you are looking for because you included a noprint option. Are you trying to output this to another dataset, or look at it in the Results viewer, or something else entirely?

Lida
Obsidian | Level 7

In my code I have an output statement that I deleted when posted the code. Sorry for the confusion. of course I need to see the results. 🙂

 

Reeza
Super User

The percentiles are affected as far as I can tell. 

Perhaps your data isn't affected by the weights for some reason?

If the weights are small?

 

data check;
set sashelp.class;
x=floor(rand('uniform')*8)+1;
run;

proc means data=check n p5 p10 p90 p95;
title 'no weights';
var weight;
run;

proc means data=check n p5 p10 p90 p95;
title 'weights';
var weight;
weight x;
run;
Lida
Obsidian | Level 7

 

I compared weighted and unweighted stats and saw that only mean is affected. The other stats such as min to max did not change, so I thought maybe I am missing some option that would allow to obtain weighed min---max stats.

Lida
Obsidian | Level 7

Hi Reeza,

 

When I ran your code I saw the difference in only one value, so I went back to my data and did more checking. Same as in your example I now see the difference in a very few values, so I guess the weighting is working. I kind of expected to see all weighted values to be different from unweighted one. Problem solved! Thank you All for contribution!

 

 

ballardw
Super User

By any chance is your RKVol variable a COUNT type variable? If so then you may want to use FREQ instead of Weight.

That will in effect replicate each value of your variable RKVol times.

Lida
Obsidian | Level 7

It is not an integer

 

overmar
Obsidian | Level 7

So how about:

 


proc univariate data=sodiumdata2009 noprint;

    by Level_1
    var sodp100g;
    weight RKVol;
   output out=want min=min max=max mean=mean p10=p10 q1=q1 q3=q3;
run;

Lida
Obsidian | Level 7

I have the same  looking code:

proc means data=data2009.sodiumdata2009 min max mean median p10 q1 q3 noprint;

by Level_1;

var sodp100g;

weight RKVol;

output out=Weighted_2009data mean=mean_per100gr_sodium_mg2009 min=Min_per100gr_sodium_mg2009 median=median_per100gr_sodium_mg2009 p10=_10th_2009 q1=_25th_2009 q3=_75th_2009 max=Max_per100gr_sodium_mg2009;

run;

 

overmar
Obsidian | Level 7

So it doesn't look like the weight has an effect on the min and max, but that it does on everyother value.

 

proc univariate data=sashelp.bweight noprint;
    var Weight;
    weight MomWtGain;
    output out=goofy1 min=min max=max mean=mean p10=p10 q1=q1 q3=q3;
run;
proc univariate data=sashelp.bweight noprint;
    var Weight;
    output out=goofy2 min=min max=max mean=mean p10=p10 q1=q1 q3=q3;
run;

proc means data=sashelp.bweight noprint;
    var Weight;
    weight MomWtGain;
    output out=goofy3 min=min max=max mean=mean p10=p10 q1=q1 q3=q3;
run;
proc univariate data=sashelp.bweight noprint;
    var Weight;
    output out=goofy4 min=min max=max mean=mean p10=p10 q1=q1 q3=q3;
run;

proc sql;
create table goofy as
select distinct min, max, mean, p10, q1, q3, "Univariate no weight" as type from goofy2
union corr
select distinct min, max, mean, p10, q1, q3, "Univariate weighted" as type from goofy1
union corr
select distinct min, max, mean, p10, q1, q3, "Means no weight" as type from goofy4
union corr
select distinct min, max, mean, p10, q1, q3, "Means weighted" as type from goofy3;

 

However thats because a weight statement only makes that observation more valuable to the statistic, it doesn't actually change the real value of the variable. Hence the reason that the min and max never change, but the computed statistics do change. Oh and there is no difference between proc means and proc univariate outputs, thats what the above shows.

Lida
Obsidian | Level 7

thank you!

 

ballardw
Super User

Percentiles generally will not be effected by a weight value, though would by a freq, since they are ORDER statistics. The smallest stays the smallest, the largest the largest and the sort order of the variable does not change. 

 

overmar
Obsidian | Level 7

I wanted to look a bit farther into this, and this is what I found. https://communities.sas.com/t5/SAS-Procedures/Weighted-v-Unweighted-data-in-proc-means-and-proc-univ... so maybe don't use the standard deviation, but all of the other statistics that you have shouldn't really be affected too much.

overmar
Obsidian | Level 7

Oh and apparently include QMETHOD=OS in the proc statement.

http://support.sas.com/documentation/cdl/en/proc/61895/HTML/default/viewer.htm#a000146736.htm

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 14 replies
  • 5304 views
  • 1 like
  • 4 in conversation