About SharonZS

SharonZS · ‎03-20-2019

Thank you both for your responses! Tremendously appreciated! Plots complete!

SharonZS · ‎03-19-2019

Sorry! I was just showing you what I'd already tried (tried plot (only) before posting...)! I just output the data set to read into PROC SGPLOT (which I've most unfortunately never used...). The two variables that I need to plot curves for have very different ranges (0 - 440,000) and (-200 - 60,000). Would you suggest customizing the bins somehow using PROC SURVEYMEANS before reading that data into PROC SGPLOT? I will next read chapters & white papers on PROC SGPLOT. Thank you again for your response!

SharonZS · ‎03-19-2019

Thank you for your response. When I run the syntax below, I do get only a histogram but the outlines for the kernel and normal densities are still retained and overlaid. I tried to remove them using "ods exclude.... " but haven't been able to get rid of them. Thank you for the tip about outputting my data into Proc SGPLOT. Didn't think I could use it b/c of complex sample design. Thank you! ods graphics on ; proc surveymeans data=fullsamp.Costs_All09 plots(only) = histogram; var acsum; weight final_wgt; strata strat; cluster psu; where _mult_=1; run; ods graphics off;

SharonZS · ‎03-19-2019

Hi, In order to create and save a permanent SAS data file in a library you'll want to use the following syntax: libname pdata 'your-data-library'; /*Path for where you want to store data set/where your data is located*/ data pdata.ds1; /* New permanent SAS data file stored in pdata library. */ set ds1; /* temporary SAS data file*/ run;

SharonZS · ‎03-19-2019

Hi All, I'm using Proc Surveymeans to plot several curves. I'd like to eliminate the kernel and normal density plots that overlay the histogram by default. Does anyone have an example of syntax for doing this? I have to use proc surveymeans b/c it's complex survey data. Many thanks in advance for any suggestions, syntax examples, or references you may have on how to do this! Shar

SharonZS · ‎01-11-2019

Thank you for your response! I'm going to study this and give it a try. PIPE was the first option I found when I was searching and trying to figure out how to do this.

SharonZS · ‎01-11-2019

Thank you!

SharonZS · ‎01-10-2019

Hi Reeza, Thank you for your response. I've been trying to do exactly that and can't seem to get the syntax right. Still trying to master working with data set names rather than variables in data sets. I'm trying to create a variable for the original rep number from the name of the data file and then sort by that. I think that I'm almost there! The data set of filenames is obviously being sorted, just not in in the order that I'd like. When I sort using the syntax below, I'm now getting: 101 for orep= 1; 102 for orep= 14; 103 for orep= 18; 104 for 2; 105 for 22. . It's sorting by cycling the first digits by the order of the second digit (cycling through all of the 1s in first position and then all the 2s...) rather than treating the number as a whole I want: 101 for orep=1; 102 for orep=2; 103 for orep=14; 104 for 22.... Here's the syntax that I've been using: Filename DIRLIST1 pipe 'dir "E:\AC estimates\causal_tables1_78c\Rep2Files\causaladj00rep2_*.sas7bdat" /b /O:N'; option noxwait noXSYNC; data dirlist1; infile dirlist1 length=len; input file_name $varying256. len; run; data dirlist1; set dirlist1; orep=substr(file_name,17,2); n=100+_n_; run; proc sort data=dirlist1; by orep; run; data _null_; set dirlist1; call system ('rename "E:\AC estimates\causal_tables1_78c\Rep2Files\'||trim(file_name)||'" causal3adj00_'||strip(n)||'.sas7bdat'); run; Filename DIRLIST1 clear;

SharonZS · ‎01-10-2019

Hi Learsaas, I've been using this SAS pipe syntax to renumber my data sets. Thank you! Works beautifully. The one detail that I haven't figured out is how to specify the sequence for the original numbers so that order is preserved in the new index. For example, in 1999 I might have original numbers: 3,5,7,27, 63 which I would like to retain in sequence so that my new data set names will be 101 for 3; 102 for 5; 103 for 7.... Is there a way to do this? Current seems to renumber them randomly. Will appreciate any suggestions that you or any one else may have. Thank you!

SharonZS · ‎11-30-2018

Thank you so much for the replies! So appreciated! I'm going to be studying all three but am going to go with Learsaas' use of the SAS pipe because I have to run this syntax 42+ times for multiple years of data and sets of outcomes. Very succinct and gets the job done. Many thanks to all, Sharon

SharonZS · ‎11-29-2018

Hi All, I'm renaming several sets of permanent SAS data sets using a SAS pipe -- which I'm still learning how to use... Each set of of data sets has a non-continuous index (e.g causal3adj99rep2_1, causal3adj99rep2_9, causal3adj99rep2_14...). I'm renaming the data sets with a continuous index that goes from 101 to 121 (causal3adj99_101 - causal3adj99_121). I used a SAS pipe to create a list of all of the permanent SAS data sets that I need to rename (outputs a data set list with all of the names of my specified permanent data sets). I then created my new names & index for each of the data sets. I now have no idea how to link the new dat set names to the permanent SAS data sets in my dirrectory. Does anyone hafve any suggestions or example syntax? Here's my syntax thus far: Filename DIRLIST1 pipe 'dir "E:\AC estimates\causal_matrix_rep2\causal_matrix_output7_Rep2\causal3adj99rep2_*.sas7bdat" /b '; /* Creates list of permanent data sets to be renamed*/ data dirlist1 ; infile dirlist1; input file_name $19.; run; data dirlist1; set dirlist1; /* Breaks names of data sets into 2 parts-- retaining first part and replacing second*/ firstpart=substr(file_name,1,12); strat=substr(file_name,18,2); stratnum=strat+0; run; proc sort data=dirlist1; by stratnum; run; data dirlist1; set dirlist1; num=_n_; index=100+ num; file_new=catx('_', firstpart, index); run; I'll be very appreciative of any suggestions anyone may have for how to best attach the new series of names to my permanent SAS data sets. Also wondering if this question might be better suited for one of the other forums? Many thanks!

SharonZS · ‎11-01-2018

The replicates are definitely the problem! I have over 1600+ of them (other secret detail that I didn't even mention is that in 2012 they changed the PSU variable so I have 525 replicates for that year alone!!). I may end up hard coding this...Ugh.. Thank you so much for your responses! Seriously the most useful feedback I've gotten in weeks! Couple thoughts I had after reading your responses. Is there a way to use the %GOTO or %RETURN to do this? For example, I've created variables for the number of PSUs in each stratum by year: psuYR equal to 2 or 3 for each stratum. %macro combdat (psuYR); %if &psuYR ne 3 %then %return; /*Only creates data set for strata having 3 PSUs*/ data final_estimatesYR; set estYR.final_estimates_rep2_&j; run; %mend combdat; Does this look like a direction worth pursuing? Would run for each year within each stratum number. The other piece that I thought might be useful that I do have is a list of strata with 3 PSUs created for each year: data stratlist; set strats; by psu strat; if psu = 3 then list = '&j='||trim(left(strat))||' or '; if last.psu and psu=3 then list= '&j='||trim(left(strat)); run; *creates macro variable; %global list1; %let list1=; data replist; Set stratlist; if psu = 3 then call symput("list1", trim(resolve('&list1'))||' '||trim(list)); run; %put list1=&list1; /* Need to redo this and add in a suffix for year */ I then use this list of strata having 3 PSUs to run the next set of computations for these replicates: %do j = 1 %to 100; %if &list1 %then %do; %let num = %eval(&num+1); Any useful way to use this?

SharonZS · ‎10-31-2018

Thank you so much for your responses! My apologies for any lack of clarity -- trying to be concise and missed! This is a long-standing project (10 year grant) and I'm revising and extending macros written by another statistician... Looking at changes in cost for 78 medical conditions from 1999-2012. The data sets that I'm trying to identify and merge are actually separate data sets containing cost estimates, prevalence, and coefficient estimates for each replicate (1-100 for reps and the non-continuous index for strata which contain a third PSU) for each year of observation. I'm merging the cost, prevalence and beta estimates to create a merged set of data sets to examine the extent to which changes in medical spending are attributable to changes in the cost of treatment versus changes in the prevalence of the health condition over time. All of the replicates for each year are in separate SAS libraries by year: est99.replicate2_&j est00.replicate2_&j The problem I have is that the second replicate for the strata having 3 PSUs isn't consistent for all years. I somehow need to create indices for both year and strata number. I've used indices for variables and imputed data sets in the same library (er.g imputed data sets) but don't know how to do this for data sets across libraries. I also don't know how to do this for a non-continuous index that changes across years.... First set of replicates was very straightforward (do j =1 to 100). My existing syntax is run by strata number (other statistician didn't make it this far to deal with this...). For example, I want to identify and merge together all of the cost estimates for strata 3 for the years when this strata had 3 PSUs. I've done the easy case where the strata has 3 PSUs for all years. I'm now trying to figure out how to do it for the remaining strata without doing incredibly laborious and error prone hard coding.... Existing code looked like this: The names here are the macro call are the data sets containing means for each replicate (final_est_replicate2_&j for all 13 years) I use another macro to run this for all years and strata... %macro cgar_from_year_a (year=,rep=,master=,coeff=,cost=, name1=,name2=,name3=,name4=,name5=,name6=,name7=,......name13= ); data final_estimates99; set est99.&name1; year = 1999; run; data final_estimates00; set est00.&name2; year = 2000; run; data final_estimates01; set est01.&name3; year = 2001; run; Goes up to 2012. Problem now is that SAS of course stops when it encounters a year/library that doesn't contain a data set for that replicate (strata that doesn't have a third PSU for that year). The existing syntax concatenates all of the cost data sets: Data attrib_cost_long; set final_estimates99 final_estimates00 final_estimates01 final_estimates02 final_estimates03 final_estimates04 final_estimates05 final_estimates06 final_estimates07 final_estimates08 final_estimates09 final_estimates10 final_estimates11 final_estimates12; run; I need to figure out how to get something similar when the replicate data sets only exist for a subset of years of observation. At this point the only variable for number of PSUs for strata per year exists in another data set that I made. The names of the data sets are really the only indicator as to whether there was a third PSU in that strata for the year (if rep2_&j exists in libraryYR). Thank you so much for your responses! I've been asking around at my institution and no one has any idea of what to do (I'm more of a modeler...).

SharonZS · ‎10-31-2018

I need to write a program to identify and selectively merge SAS data sets. I’m working with 13 years of survey data from 1999-2012 – each is a separate data set. Complex survey data – each year of data has 100 strata containing 2 or 3 PSUs (grouped PSUs). I’ve created separate replicate data sets for each PSU by strata (replicates for jackknife standard error estimation): have one for each of the strata having 2 PSUs and 2 for all strata having 3 PSUs. The number of strata having 3 PSUs varies by year of data collection (range: 13-31 strata/year have 3 PSUs). The strata ID numbers (1-100) for those having 3 PSUs aren’t a continuous series. For example, for one year strata numbers 3, 7, 9, 13, 21, 22 and 37 might have 3 PSUs. The replicate data sets are indexed by the strata number. So for example rep2_3, rep2_7, rep2_9…. would identify the replicate data sets for my example. I now need to write syntax to identify and merge these replicate data sets for strata having 3 PSUs across years of observation by strata index number. For example, Strata # 5 (Rep2_5) might have had 3 PSUs and now corresponding replicate data sets for 1999, 2001, 2002, 2007 and 2011. I want to find the rep2_5 data sets in folders for their respective years and concatenate them. Is this a situation for %GOTO? Any suggestions as to how I can use some sort of conditional programming and indices to identify and concatenate replicate data sets for each strata ID across years? Thank you so much for reading this! I’m truly flummoxed!

SharonZS · ‎08-23-2018

Thank you both! Both solutions give me the continuous index that I'm looking for --- but first solution is strictly referenced with respect to the calendar (e.g. month 2 begins when next calendar month after month of hire begins rather than 4 weeks after hire date) rather than a straight count index of number of weeks/months worked since date of hire (solution #2). Thank you for your help!

Online Status	Offline
Date Last Visited	‎03-19-2019 10:09 PM

Re: Proc Surveymeans: Suppressing the kernel & normal density plot ove...

Re: Proc Surveymeans: Suppressing the kernel & normal density plot ove...

Re: Proc Surveymeans: Suppressing the kernel & normal density plot ove...

Re: How to assign permanent lib and use SET statement..

Proc Surveymeans: Suppressing the kernel & normal density plot overlay...

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Conditional programming (macro?) to identify & merge data sets?

Re: Proc Surveymeans: Suppressing the kernel & normal density plot ove...

Re: Proc Surveymeans: Suppressing the kernel & normal density plot ove...

Re: Proc Surveymeans: Suppressing the kernel & normal density plot ove...

Re: How to assign permanent lib and use SET statement..

Proc Surveymeans: Suppressing the kernel & normal density plot overlay...

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Re: Renaming permanent data sets using SAS PIPE

Renaming permanent data sets using SAS PIPE

Re: Conditional programming (macro?) to identify & merge data sets?

Re: Conditional programming (macro?) to identify & merge data sets?

Conditional programming (macro?) to identify & merge data sets?

Re: INTCK? Incremental count variables for # of weeks/months since hir...