BookmarkSubscribeRSS Feed
yuhhwa
Calcite | Level 5

in case of  multiple variables(+100 variables),the following statement is OK?

 

proc fmm data=XXX;
class XXX;←think separately about model?
model A=,B=C=,・・・・・・ = / k=X;
run;

2 REPLIES 2
Rick_SAS
SAS Super FREQ

Your statements are not valid syntax.

1. How does the classification variable enter the models?

2. The K= option requires a constant value, such as K=2 or K=3. 

3. You cannot use commas to separate models. You cannot put multiple models on one MODEL statement.

 

It is not clear what you are trying to accomplish, but here is a guess.

A. It looks like you might have many response variables named Y1, Y2, Y3, ..., Y100

B. It looks like you are trying to fit them all with a mixture of k Gaussian distributions.

 

If those assumptions are correct then you should convert your data from wide to long format. Let Y be the name of the long variable that contains the values for variables Y1, ..., Y100 and let B be an indicator variable that indicates which observations correspond to which variables. Then you can use syntax such as the following:

 

proc fmm data=LongData plots=none;
   by B;
   model Y= / k=2;
run;

Almost surely you do not want 100 pages of output, so you should suppress ODS output and write the results (parameter estimates?) to a data set. You should study the techniques in the following articles:

Simulation in SAS: The slow way or the BY way

Turn off ODS when running simulations in SAS

An easy way to run thousands of regressions in SAS

 

 

yuhhwa
Calcite | Level 5

thank you for replying.

 

>>It is not clear what you are trying to accomplish, but here is a guess.

A. It looks like you might have many response variables named Y1, Y2, Y3, ..., Y100

B. It looks like you are trying to fit them all with a mixture of k Gaussian distributions.

 

A and B is  right, but I want to cluster combination of multiple variables as well as k-means.

 

   proc fastclus data=stan out=clust maxclusters=3;
      var v1 v2 v3;
   run;

I think the way you taught me is to cluster variables separately. Is it right?

 

if you are OK,please tell me  how shoud I wright the statement Gaussian mixture model  compared with k-means.

 

thank you in advance.

 

 

 

 

 

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1143 views
  • 0 likes
  • 2 in conversation