Solved: Repeated measures with no subject

palolix · Posted 08-14-2024 07:57 PM

Dear SAS Community,

I have some data that I want to anlalyze as repeated measures. Since I dont have blocks I didnt include the random intercept/subject= line in my model so I just want to make sure that the model Im using is correct. I would greatly appreciate your input.

Harvest is the date the fruits were harvested (2 dates), and what I want to use as repeated measures. Variety is the fruit variety (3 different varieties), and weeks are the number of weeks the fruits were storaged in a coldroom before the measurements like FirmAvg were assesed (after 1 week, 3 weeks, and 6 weeks).

proc glimmix data=one;

class harvest variety weeks;

model FirmAvg=harvest|variety|weeks/dist=lognormal ddfm=km;

random harvest/ residual type=cs;

run;

I would greatly appreciate your feedback.

Thank you very much

Caroline

Ksharp · Posted 08-20-2024 10:46 PM

Yes. You could use PROC CATMOD to do ANOVA.
Use PROC CATMOD or PROC LOGISTIC to do logistic model (either ordinal or non-ordinal data.).

View solution in original post

jiltao · Posted 08-15-2024 09:33 AM

The syntax looks reasonable to me.

Thanks,

Jill

jiltao · Posted 08-15-2024 09:36 AM

Actually, I just noticed that you do not have subject. I wonder how your data looks like. Can you post some sample data? I am not sure why there is no subject...

palolix · Posted 08-15-2024 11:57 AM

Hi Jiltao,

Thank you very much for your reply. Since I dont have blocks, that was my concern, not having the intercept/subject line in my model. I dont know what my subject should be and if I have one. I attached an excel sheet with the data that shows the independent var: harvest date, variety, and weeks in storage, and the dep var storage firm avg. To give you some context, the fruits (different varieties) were harvested at two different dates (day 1 and day 55), and stored in a coldroom during 1, 3, and 6 weeks. After that, they were assesed for firmness (firm avg). So I want to use harvest date as repeated measures, to see if the harvest date has an effect on the fruit firmness. I also want to know the effect of the variety and weeks in storage on the firmness.

Thank you very much!

Caroline

jiltao · Posted 08-15-2024 03:38 PM

If you cannot identify subjects, that is, for the first 5 observations for Variety BL516 at wk1 --

They might not be the same subjects for the 5 observations appearing later for BL516 for wk 3 -- then you might not have repeated measures data. Just fit an ANOVA model.

If you can say that the first observation in the data (BL516 for wk1) is the same subject as the first BL516 for wk3, and so on so forth for other observations, then you have repeated measures data, and you can add a variable such as ID to identify these subjects. Then you would have a repeated measures data.

Hope this helps,

Jill

palolix · Posted 08-15-2024 11:03 PM

Thank you so much for your reply! Strictly speaking, the first 5 fruits for variety BL516 at week 1 are not the same fruits I used to do the measures at week 3, and so forth, but since the variety is the same for week1, 3, and 6, what if I treat variety as the subject?

Thanks

Caroline

Ksharp · Posted 08-15-2024 09:53 PM

If you don't have SUBJECT= cluster variable,that means you don't have RANDOM effect(a.k.a it is not MIXED model).
I guess you can fix GLM model instead.

proc genmod data=one;
class harvest variety weeks;
..............
or
proc glm data=one;
class harvest variety weeks;
..........

palolix · Posted 08-16-2024 12:04 AM

Thank you very much for your reply. Im thinking that maybe I could treat variety as subject? If not, I guess it woud be best to use genmod if my data has only fixed effects, and especially if some of my dependent variables can only take values that go on a scale from 1 to 5. That way I could use a beta or binomial distribution, right?

Thanks!

Caroline

Ksharp · Posted 08-16-2024 02:53 AM

If your dependent variable Y is always greater than zero, you could try GAMMA distribution.

palolix · Posted 08-16-2024 01:40 PM

Thank you Ksharp! So do think the gamma distribution will be more appropiate for values of 1 to 5 with half values like 1.5?

Thank you!

Caroline

Ksharp · Posted 08-17-2024 02:38 AM

Nope. Gamma distribution is for any positive Y value,no matter it is 1 or 1.5 .
And as Jiltao said if your Y has only 1-5 five value,you could also try fit a logistic/multinomial / ordinal regression.

proc logistic ....
model y=...../ link=clogit;
run;

palolix · Posted 08-19-2024 12:38 PM

Ok, thank you so much for your reply! What if my dependent var is an ordinal scale that goes from 0 to 5 but where I also have half values of 1.5 for example? Can I still use the multinomial distribution?

Thank you!

Ksharp · Posted 08-19-2024 09:43 PM

Yes. you can.

/*y has value 1 ,1.5, 2,3*/
proc logistic ....
model y=...../ link=clogit;
run;

I noticed that your data could be formed by contingency table, you also could try PROC CATMOD to do Ordianl Logtisc Model.

palolix · Posted 08-20-2024 02:19 PM

Thank you so much for your great suggestions!

So since I have different types of response variables, the most appropiate procedures will be as follow:

For continuous (measures) I would use proc genmod (normal or gamma dist).

For binary (0/1 or 0/100) I would use proc logistic.

For multinomial (scales; 0, 1, 1.5, 2, 2.5, 3) I would use proc logistic.

For ordinal (fruit browning; 0=no browning, 1=mild, 2=severe) I would use proc catmod (is this procedure only for ordinal data?).

Does it sounds reasonably to you? Also, how can I specify in proc logistic if my response var is binary or multinomial?

Thank you very much!

Caroline

Ksharp · Posted 08-20-2024 09:16 PM

"For multinomial (scales; 0, 1, 1.5, 2, 2.5, 3) I would use proc logistic.
For ordinal (fruit browning; 0=no browning, 1=mild, 2=severe) I would use proc catmod (is this procedure only for ordinal data?)."

Multinomial logistic model has two type of model.
One is for non-ordianl variable like : color: red,green,blue , you need to use link function GLOGIT.
/*y is non-ordinal*/
proc logistic ....
model color=...../ link=glogit;
run;

Another is for ordianl variable like : scale: 1,2,3, you need to use link function CLOGIT.
/*y is ordinal*/
proc logistic ....
model scale=...../ link=clogit;
run;

" I would use proc catmod (is this procedure only for ordinal data?)."
Nope. PROC CATMOD is specially designed for contingency table.
Sine your data can be formed by contingency table,you can use PROC CATMOD to model your data for LOGISTIC model or ANOVA model (check the code I showed above).

"how can I specify in proc logistic if my response var is binary or multinomial?"
That is easy ,just don't specify LINK function,SAS would automatically check Y and suite the right model for you. Like:
proc logistic ....
model y=.....;
run;

You will find out which model you are using in OUTPUT.

BTW,

If Y is binary variable like: 0,1 you need specify LINK=logit :

/*y is binary variable*/
proc logistic ....
model scale=...../ link=logit;
run;

Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

Re: Repeated measures with no subject

SAS Innovate 2025: Call for Content