I am somewhat new to survey analysis. I am running a code for survey sample and want to report the standard deviation, I am getting this big numbers in comparison to an unweighted code, I am pretty sure is something easy I am not looking at,can anyone help?
proc means data=ms19.ms19 MEAN STD STDERR ALPHA=0.05 CLM;;
VAR AGE;
RUN ;
SAS Output
45.8475175 | 13.9318244 | 0.1087396 | 45.6343762 | 46.0606588 |
proc surveymeans data=ms19.ms19 MEAN RMSSTD STDERR ALPHA=0.05 CLM;
strata strat; cluster PSU_ID; weight WEIGHT_PA_IPW_OVERALL;
var AGE;
run;
SAS Output
SAS Output
20 |
664 |
16415 |
23413.2548 |
Age | 39.867382 | 0.369889 | 39.1410467 | 40.5937167 | 40229 |
My initial guess is that you have one or more AGEs that is impossibly high and also has a very high value in the variable you are using to weight the analysis.
Art, CEO, AnalystFinder.com
Hi Art297,
thank you for your help, I have read up on it and it seems I have a code the is not appropriate. The age range I have is from 45 to 75 years old and have no outliers, and my weight is to adjust for age as well.
it seems the code I need is someting similar to this:
proc means data=xxx n MIN MAX MEAN STD STDERR ALPHA=0.05 CLM;
weight xxx;
BY xxx;
var AGE;
run;
SAS Output
7675 | 45 | 75 | 56.50 | 6.63 | 0.09 | 56.31 | 56.68 |
PROC SURVEYMEANS does not directly compute the standard deviation for the mean, the reported st. dev. is for the SUM.
You might find this SAS Note useful: http://support.sas.com/kb/45/701.html.
Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.
Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.