About MetinBulus

MetinBulus · ‎03-25-2016

thanks a lot Xia!

MetinBulus · ‎03-25-2016

Xia, yes, I forgot to put a summation sign. weighted_mean_for_group_0 = (1/0.86 )*6/8 + (1/0.87)*7/8 + (1/0.75)*3/8 +.....+(1/0.30)*7/8

MetinBulus · ‎03-25-2016

Hi everyone, I have a table similiar to this: g p1 p2 p3 p4 p5 x 0 0.80 0.86 0.19 0.01 0.96 6 0 0.15 0.57 0.87 0.36 0.03 7 0 0.41 0.75 0.33 0.39 0.27 3 0 0.38 0.88 0.53 0.49 0.17 5 0 0.14 0.32 0.09 0.35 0.63 9 0 0.22 0.49 0.32 0.58 0.96 2 0 0.48 0.33 0.98 0.19 0.42 8 0 0.30 0.88 0.87 0.84 0.31 7 1 0.32 0.31 0.10 0.08 0.36 10 1 0.85 0.83 0.30 0.40 0.97 8 1 0.45 0.85 0.98 0.43 0.16 7 1 0.29 0.47 0.53 0.10 0.13 11 1 0.84 0.58 0.95 0.43 0.81 5 1 0.28 0.18 0.06 0.74 0.95 24 1 0.08 0.37 0.97 0.17 0.88 9 1 0.92 1.00 0.47 0.27 0.82 4 For each group I will have a weighted mean (1/pi)*x/8. I would like to minimize weighted mean difference between group 1 and 0, by selecting one pi from each row. Any idea, help will appreciated. Note: Bold numbers are to show the idea and were not obtained using a minimization technique.

MetinBulus · ‎02-29-2016

Thanks Rick! Modeling the data based on (event='0') or (event='1') shouldn't make a difference, as it only changes the sign. I also could not induce mean difference on x1 or x2, between t=1 and t=0 groups. But the code suprisingly works using SAS university edition on my laptop, whereas it consistently underestimated coefficients almost by half with no mean difference induced on my office PC. That's weird, SAS on the PC may need some update, I believe it was v9.2!

MetinBulus · ‎02-29-2016

Hi all, in the code below I would like to simulate a data based on logistic regression model. My primary goal is to create the simluated data and retreive the coefficinets on the log scale, making sure predictor means by the binary outcome variable differ by, say ~ 0.5 standard deviation. When I employ proc logistic on the simulated data coefficients are off on the log scale. In addition trying many samples, I found the mean of the predictor by the binary outcome are very close to each other regardless of the magnitude of beta coefficients. Something is wrong, and I can't seem to pinpoint the problem. Please help. %let N=200; proc iml; t = J(&N, 1); X = J(&N, 2); call randseed(4321); call RANDGEN(X, "NORMAL", 0, 1); beta = {1.40, -0.60, -0.40}; Xb = J(&N,1,1)||X; eta = Xb*beta; mu = LOGISTIC(eta); call RANDGEN(t, "BERNOULLI", mu); tempdata = t||x; create logdata from tempdata[colname={'t' 'x1' 'x2'}]; append from tempdata; close logdata; quit; proc logistic data=logdata; model t = x1 x2; run; proc means data = logdata; class t; var x1 x2; run;

MetinBulus · ‎12-04-2015

Hi everyone, I have a question regarding randomly subsetting the data based on predetermined unequal probabilites. Is there any technique or SAS procude that would help me accopmplish this? For example I have a data with 1000 subjects, and subjects are assigned probabilties that varies across individuals. Based on these probailities I would like to randomly select 100 subjects. How would I do this?

MetinBulus · ‎10-26-2015

Any similar graph will do if it works. The data may not include the arbitrary x axis value and the predicted value on y axis.

MetinBulus · ‎10-26-2015

Hi all, I need your help to overcome the obstacle below. The graph is from PROC LOESS procedure. I want to find and plot the predicted value on x axis for a given value on x axis. Is there a way to do that in PROC LOESS procedure or using other plotting techniques?

MetinBulus · ‎10-10-2015

I have the regression coefficient estimated ten times for example, each estimation has a standard error associated with it. Using Proc means or Univariate won't take into account standard errors, as far as I know. I don't know how to come up with a single value for ten coefficients, taking into account their standard errors.

MetinBulus · ‎10-09-2015

Hi everyone, I need your help to analyze a GLIMMIX parameter estimates output using PROC MIANALYZE. The output is created by group, and for each group treatment effect is estimated with its standard error. Basically I want to pool estimates from multiple simulated datasets. Unlike PROC MI procedure imputed values are not variables but they are estimates. Thanks.

MetinBulus · ‎09-30-2015

Thank you Rick, DO WHILE did the trick!

MetinBulus · ‎09-30-2015

Hi Ryan please see my message below.

MetinBulus · ‎09-30-2015

Hi Paige, please see my message below.

MetinBulus · ‎09-30-2015

Here is the code to generate data. The problem arises during multinomial logistic regression data generation process. Code snippet below is to generate the the categorical outcome from multinomial logistics regression model. This is to be used for treatment status. ** generate treatment status; call RANDGEN(t, "TABLE", p); However, simulating 1000 samples most often result in some categories with no values which result in error during dummy code transformation. I was trying to find a way to skip these samples which stall the program, and begin next cycle. If there are 950 samples out 1000 it is fine. But for now, sometimes the error happen to be with the 10th sample and cannot go beyond that. %let S=1211; %let NumSamples = 100; %let N=1200; %let beta11 = log(3); %let beta12 = log(4); %let beta13 = 0; %let beta21 = log(9/10); %let beta22 = log(10/9); %let beta23 = 0; %let alpha0 = 0; %let alpha1 = 0.2; %let alpha2 = 0.4; %let alphaX = 0.2; %let alphaX3 = 0.1; ** simulate data; proc iml; ** assign variable names and allocate space for the data and parameters; varNamesData={SampleID x x3 t t1 t2 y}; TempSimData = J(&N, NCOL(varNamesData)); x = J(&N, 1); t = J(&N, 1); p = J(&N, 3); t1 = J(&N, 1, 0); t2 = J(&N, 1, 0); t3 = J(&N, 1, 0); y = J(&N,1); epsilon = J(&N, 1); TempSimData = J(&N, NCOL(varNamesData)); create SimData from TempSimData[c=varNamesData]; ** simulation loop; do SampleID = 1 to &NumSamples; call RANDSEED(0); call RANDGEN(x, "NORMAL", 1, 1); ** calculate the qubic term; x3 = x##3; beta01 = -(&beta11 * 1 + &beta21 * 4); beta02 = -(&beta12 * 1 + &beta22 * 4); ** define linear equations; eta13 = beta01 + &beta11 * x + &beta21 * x3; *T=1 vs T=3; eta23 = beta02 + &beta12 * x + &beta22 * x3; *T=2 vs T=3; *eta33 = 0 + 0 * x + 0*x3;; *T=3 vs T=3; ** find actual probabilities for subjects to be in each treatment level; pi1 = exp(eta13) / (exp(eta13) + exp(eta23) + 1); pi2 = exp(eta23) / (exp(eta13) + exp(eta23) + 1); pi3 = 1 / (exp(eta13) + exp(eta23) + 1); ** fill the probability matrix from pi1, pi2, and pi3; p[,1] = pi1; p[,2] = pi2; p[,3] = pi3; ** generate treatment status; call RANDGEN(t, "TABLE", p); idx1 = LOC(t=1); idx2 = LOC(t=2); idx3 = LOC(t=3); * create dummy variables for treatment levels; if NCOL(idx1)>0 then t1[idx1]=1; else print "No observations in level 1"; if NCOL(idx2)>0 then t2[idx2]=1; else print "No observations in level 2"; if NCOL(idx3)>0 then t3[idx3]=1; else print "No observations in level 3"; ** generate residuals; call RANDGEN(epsilon, "NORMAL", 0, .5); ** generate y; y = &alpha0 + &alpha1*t1 + &alpha2*t2 + &alphaX*x + &alphaX3*x3 + epsilon; ** create a temporary simulated data for each simulation loop; TempSimData[,1] = SampleID; TempSimData[,2] = x; TempSimData[,3] = x3; TempSimData[,4] = t; TempSimData[,5] = t1; TempSimData[,6] = t2; TempSimData[,7] = y; setout SimData; append from TempSimData; end; close SimData; quit;

MetinBulus · ‎09-30-2015

I am trying to accomplish the following in a simulation study in IML: Some of the samples drawn are bad samples that generates error and aborts the program. How can I stop where there is a possibility of error and return to the begining of loop and start the next iteration? Thanks.

Online Status	Offline
Date Last Visited	‎03-13-2022 04:52 PM

Re: Estimating random effect when there exist none (simulation)

Re: Estimating random effect when there exist none (simulation)

Estimating random effect when there exist none (simulation)

Re: Compile a macro from GitHub gist

Re: Compile a macro from GitHub gist

Compile a macro from GitHub gist

Re: Automatic generation of all possible interaction terms (no duplica...

Re: Automatic generation of all possible interaction terms (no duplica...

Re: Automatic generation of all possible interaction terms (no duplica...

Re: Automatic generation of all possible interaction terms (no duplica...

Re: Compile a macro from GitHub gist

Re: Automatic generation of all possible interaction terms (no duplica...

Re: Automatic generation of all possible interaction terms (no duplica...

Re: Automatic generation of all possible interaction terms (no duplica...

Re: Minimization challenge

Pooling results from multiple GLIMMIX output

Re: Minimization challenge

Re: Minimization challenge

Minimization challenge

Re: Logistic regression simulation - inducing mean difference

Logistic regression simulation - inducing mean difference

Random selection with unequal probabilities

Re: Plotting predicted values on y axis given a value on x axis

Plotting predicted values on y axis given a value on x axis

Re: Pooling results from multiple GLIMMIX output

Pooling results from multiple GLIMMIX output

Re: Skipping to the next loop in IML do loop

Re: Skipping to the next loop in IML do loop

Re: Skipping to the next loop in IML do loop

Re: Skipping to the next loop in IML do loop

Skipping to the next loop in IML do loop