BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
neilxu
Calcite | Level 5

If, for example, I have monthly sales number for different department (total 10 departments). Now I want to look at the trend of these sales based on departments. However 10 is too much so naturally clustering/grouping comes to my mind.

I saw the example of proc similarity to cluster time series and followed it to create the clusters. Now my question is I can use proc corr to group these departments, right? what is the benefit to do proc similarity?

Thanks

1 ACCEPTED SOLUTION

Accepted Solutions
ets_kps
SAS Employee

Hello,

I assume you are following this example SAS/ETS User's Guide Example Programs.

As you can see, after SIMILARITY gives you the similarity matrix, then you can cluster in the same way you would use cross sectional clustering routines.  Use PROC CORR, CLUSTER, whatever you wish.

Similarity has a number of utilities but all are related to temporal ordering.  Typical methods of clustering ignore the ordering.  In the time series version of this clustering we are looking for variables(series) that we can treat as a group, rather than observations that we treat as a group.  The SIMILARITY procedure effectively transposes this information (with some other tweaks) so the clustering can be done on the variables.  If you were to use clustering directly (a perfectly sensible practice for some uses) then you would effectively be looking for intervals that behave similarly.  This might be perfectly reasonable for some sort of time series segmentation but that is not what we are showing in this example. 

Hope this helps-Ken

View solution in original post

2 REPLIES 2
ets_kps
SAS Employee

Hello,

I assume you are following this example SAS/ETS User's Guide Example Programs.

As you can see, after SIMILARITY gives you the similarity matrix, then you can cluster in the same way you would use cross sectional clustering routines.  Use PROC CORR, CLUSTER, whatever you wish.

Similarity has a number of utilities but all are related to temporal ordering.  Typical methods of clustering ignore the ordering.  In the time series version of this clustering we are looking for variables(series) that we can treat as a group, rather than observations that we treat as a group.  The SIMILARITY procedure effectively transposes this information (with some other tweaks) so the clustering can be done on the variables.  If you were to use clustering directly (a perfectly sensible practice for some uses) then you would effectively be looking for intervals that behave similarly.  This might be perfectly reasonable for some sort of time series segmentation but that is not what we are showing in this example. 

Hope this helps-Ken

neilxu
Calcite | Level 5

Thank you so much.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 3179 views
  • 1 like
  • 2 in conversation