- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Hi
I need to do simulation analysis for survival study. I have one real clinical dataset with only 40 patients. need to create 1000 simulation samples.
Thanks
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
I'm not aware of any quick user friendly references. I believe there's a few answered question on here about this and the topic is covered in Rick Wicklin's Simulating data book.
It also depends on if you're simulating via or just randomly repeating data. You could expand your 40 to 1000 using PROC SURVEYSELECT for example, obviously sample with repetition.
@HappySASUE wrote:
Hi
I need to do simulation analysis for survival study. I have one real clinical dataset with only 40 patients. need to create 1000 simulation samples.
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
@HappySASUE wrote:
Hi
I need to do simulation analysis for survival study. I have one real clinical dataset with only 40 patients. need to create 1000 simulation samples.
Thanks
How would you create one simulation sample? And BTW, what sample size do you intend to generate?
I presume by "simulation" that you intend to start out with a sequence of hazard rates, then randomly apply those hazard rates to get period-by-period survivals, yes? What is the relevance of your clinical dataset? Is it the source of your hazard rates?
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set
Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets
--------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Thank you for your reply.
Yes the clinical dataset is the source of my hazard rates, also with independent variables, such as age, gender.... "Simulation" here I mean is to create samples which "mimic" the real clinical dataset. The sample size could vary, 30, 40, 400...etc.
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
I'm not aware of any quick user friendly references. I believe there's a few answered question on here about this and the topic is covered in Rick Wicklin's Simulating data book.
It also depends on if you're simulating via or just randomly repeating data. You could expand your 40 to 1000 using PROC SURVEYSELECT for example, obviously sample with repetition.
@HappySASUE wrote:
Hi
I need to do simulation analysis for survival study. I have one real clinical dataset with only 40 patients. need to create 1000 simulation samples.
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Thank you. I will take a look of this book. This is a very useful book!