BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.

Good day my friends:

 

Based in laboratory research at 1, 12, 24 36 and 72 hours it has been collected 3 observations each hour:

 

data have;
input hour repetition concentration;
cards;
1         1       11.21
1         2       12.15
1         3       9.91
12       1       11.21
12       2       10.28
12       3       11.82
24       1       10.28
24       2       10.28
24       3       12.61
36       1       11.21
36       2       11.21
36       3       11.71
72       1       10.28
72       2       11.21
72       3       12.73

;

 

as you can see, in each hour can be obtained the mean ans standar deviation. For teaching purposes i need to increase the sample size to 10, 50, 100, 1000 and 10 000 repetitions, each sample size in diferent data sets. Thus, i need to create a new data set using aleatory data with the mean ans standar deviation from the original data set.

 

Thankyou very much.

1 ACCEPTED SOLUTION

Accepted Solutions
Reeza
Super User

Then if I understand your question correclty you're looking for something like the following:

 

data have;
input hour repetition concentration;
cards;
1         1       11.21 
1         2       12.15 
1         3       9.91 
12       1       11.21 
12       2       10.28 
12       3       11.82 
24       1       10.28 
24       2       10.28 
24       3       12.61 
36       1       11.21 
36       2       11.21 
36       3       11.71 
72       1       10.28 
72       2       11.21 
72       3       12.73
;

proc means data=have noprint nway;
class hour;
var concentration;
output out=stats mean=p1 std=p2;
run;

%let n_records = 1000;

data simulated;
call streaminit(24);
set stats;
do rep = 1 to &n_records;
conc_sim = rand('normal', p1, p2);
conc_sim = round(conc_sim, 0.01);
output;
end;
run;

proc means data=simulated noprint nway;
class hour p1 p2;
var conc_sim;
output out=check mean=p1_sim std=p2_sim;
run;


I used the old RNG though, you can look into the new ones and see if you want to use one of those instead:

https://blogs.sas.com/content/iml/2018/01/29/random-number-generators-sas.html

 

 

View solution in original post

3 REPLIES 3
Reeza
Super User

Do you know what the distribution of the data should follow? Because it's concentration it likely cannot go below zero, so a normal distribution may not be appropriate. Once you know the distribution you can use the mean/std to simulate your data. 

 

If you don't have enough data to determine the distribution, sometimes literature in your field will have the type of distribution and you can assume that. 

 

 

jonatan_velarde
Pyrite | Level 9
Thank you for the answer:

The data we need to simulate is normally distributed

thank you
Reeza
Super User

Then if I understand your question correclty you're looking for something like the following:

 

data have;
input hour repetition concentration;
cards;
1         1       11.21 
1         2       12.15 
1         3       9.91 
12       1       11.21 
12       2       10.28 
12       3       11.82 
24       1       10.28 
24       2       10.28 
24       3       12.61 
36       1       11.21 
36       2       11.21 
36       3       11.71 
72       1       10.28 
72       2       11.21 
72       3       12.73
;

proc means data=have noprint nway;
class hour;
var concentration;
output out=stats mean=p1 std=p2;
run;

%let n_records = 1000;

data simulated;
call streaminit(24);
set stats;
do rep = 1 to &n_records;
conc_sim = rand('normal', p1, p2);
conc_sim = round(conc_sim, 0.01);
output;
end;
run;

proc means data=simulated noprint nway;
class hour p1 p2;
var conc_sim;
output out=check mean=p1_sim std=p2_sim;
run;


I used the old RNG though, you can look into the new ones and see if you want to use one of those instead:

https://blogs.sas.com/content/iml/2018/01/29/random-number-generators-sas.html

 

 

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 3 replies
  • 879 views
  • 0 likes
  • 2 in conversation