<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Time Series Similarity Clustering in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19068#M97</link>
    <description>could u detail the clustering algorithm and process you used in your analysis?&lt;BR /&gt;
plain k-mean clustering on raw/simply stdized data?&lt;BR /&gt;
&lt;BR /&gt;
&amp;gt; By no means am I an expert on the matter.  I haven't&lt;BR /&gt;
&amp;gt; used PROC Similarity, so I can't comment about the&lt;BR /&gt;
&amp;gt; proc.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; A friend of mine told me once he was clustering a&lt;BR /&gt;
&amp;gt; number of series according to their measurements&lt;BR /&gt;
&amp;gt; across time.  That didn't sit right with me, so I&lt;BR /&gt;
&amp;gt; created some data to see how it would work.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; I started by creating several series of data with no&lt;BR /&gt;
&amp;gt; pattern across time, but with different averages.&lt;BR /&gt;
&amp;gt; Even with telling the software to standardize the&lt;BR /&gt;
&amp;gt; data, the series that had similar means were&lt;BR /&gt;
&amp;gt;  clustered together.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; I then created data with patterns across time, with&lt;BR /&gt;
&amp;gt; some series having the same pattern.  And those&lt;BR /&gt;
&amp;gt; series that had different patterns across time were&lt;BR /&gt;
&amp;gt; given similar means.  The clustering grouped the&lt;BR /&gt;
&amp;gt; series that had similar means, even though the&lt;BR /&gt;
&amp;gt; patterns across time were different.  Also, the&lt;BR /&gt;
&amp;gt; series that had similar patterns were not grouped,&lt;BR /&gt;
&amp;gt; because the means were different.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; I then took that same data and made all the means&lt;BR /&gt;
&amp;gt; equal.  When I did the clustering, the groups with&lt;BR /&gt;
&amp;gt; similar patterns were put together.  &lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; So, to make along story short, how do you quantify a&lt;BR /&gt;
&amp;gt; pattern across time, so you can cluster series that&lt;BR /&gt;
&amp;gt; are similar.  That's a hard question.  The patterns&lt;BR /&gt;
&amp;gt; have to be more obvious than differences in series&lt;BR /&gt;
&amp;gt; averages, or other things you don't want to cluster&lt;BR /&gt;
&amp;gt; on.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; I'm sure there's more to this, but I'm not educated&lt;BR /&gt;
&amp;gt; enough on the subject to comment futher.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; Message was edited by: jonathan@jmp</description>
    <pubDate>Sat, 10 Apr 2010 00:54:41 GMT</pubDate>
    <dc:creator>oloolo</dc:creator>
    <dc:date>2010-04-10T00:54:41Z</dc:date>
    <item>
      <title>Time Series Similarity Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19064#M93</link>
      <description>Hello Folks,&lt;BR /&gt;
&lt;BR /&gt;
Have any one tried using Proc Similarity or any other technique for clustering similar time series patterns.&lt;BR /&gt;
&lt;BR /&gt;
Thanks,&lt;BR /&gt;
Karteek</description>
      <pubDate>Wed, 18 Nov 2009 19:24:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19064#M93</guid>
      <dc:creator>Karteek</dc:creator>
      <dc:date>2009-11-18T19:24:09Z</dc:date>
    </item>
    <item>
      <title>Re: Time Series Similarity Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19065#M94</link>
      <description>In the M2009 conference, a paper was presented for Proc Similarity and Time Series Clustering, you may look at it: Mining Transactional and Time Series Data (Michael Leonard &amp;amp; Meredith John, SAS),</description>
      <pubDate>Fri, 20 Nov 2009 17:11:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19065#M94</guid>
      <dc:creator>tailee</dc:creator>
      <dc:date>2009-11-20T17:11:53Z</dc:date>
    </item>
    <item>
      <title>Re: Time Series Similarity Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19066#M95</link>
      <description>check my SCSUG 2009 paper&lt;BR /&gt;
SVD Filtered Temporal Usage Pattern Analysis and Clustering&lt;BR /&gt;
&lt;BR /&gt;
the main idea is SVD on a toeplitz matrix like the SSA technique used in Meteorology&lt;BR /&gt;
&lt;BR /&gt;
you can obtain a copy either at SCSUG.org or at my blog&lt;BR /&gt;
&lt;BR /&gt;
&lt;A href="http://www.sas-programming.com/2009/09/svd-filtered-temporal-usage-pattern.html" target="_blank"&gt;http://www.sas-programming.com/2009/09/svd-filtered-temporal-usage-pattern.html&lt;/A&gt;&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
&amp;gt; Hello Folks,&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; Have any one tried using Proc Similarity or any other&lt;BR /&gt;
&amp;gt; technique for clustering similar time series&lt;BR /&gt;
&amp;gt; patterns.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; Thanks,&lt;BR /&gt;
&amp;gt; Karteek

Message was edited by: oloolo</description>
      <pubDate>Wed, 07 Apr 2010 00:52:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19066#M95</guid>
      <dc:creator>oloolo</dc:creator>
      <dc:date>2010-04-07T00:52:50Z</dc:date>
    </item>
    <item>
      <title>Re: Time Series Similarity Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19067#M96</link>
      <description>By no means am I an expert on the matter.  I haven't used PROC Similarity, so I can't comment about the proc.&lt;BR /&gt;
&lt;BR /&gt;
A friend of mine told me once he was clustering a number of series according to their measurements across time.  That didn't sit right with me, so I created some data to see how it would work.&lt;BR /&gt;
&lt;BR /&gt;
I started by creating several series of data with no pattern across time, but with different averages.  Even with telling the software to standardize the data, the series that had similar means were clustered together.&lt;BR /&gt;
&lt;BR /&gt;
I then created data with patterns across time, with some series having the same pattern.  And those series that had different patterns across time were given similar means.  The clustering grouped the series that had similar means, even though the patterns across time were different.  Also, the series that had similar patterns were not grouped, because the means were different.&lt;BR /&gt;
&lt;BR /&gt;
I then took that same data and made all the means equal.  When I did the clustering, the groups with similar patterns were put together.  &lt;BR /&gt;
&lt;BR /&gt;
So, to make along story short, how do you quantify a pattern across time, so you can cluster series that are similar.  That's a hard question.  The patterns have to be more obvious than differences in series averages, or other things you don't want to cluster on.&lt;BR /&gt;
&lt;BR /&gt;
I'm sure there's more to this, but I'm not educated enough on the subject to comment futher.

Message was edited by: jonathan@jmp</description>
      <pubDate>Thu, 08 Apr 2010 15:29:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19067#M96</guid>
      <dc:creator>jonathan_jmp</dc:creator>
      <dc:date>2010-04-08T15:29:26Z</dc:date>
    </item>
    <item>
      <title>Re: Time Series Similarity Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19068#M97</link>
      <description>could u detail the clustering algorithm and process you used in your analysis?&lt;BR /&gt;
plain k-mean clustering on raw/simply stdized data?&lt;BR /&gt;
&lt;BR /&gt;
&amp;gt; By no means am I an expert on the matter.  I haven't&lt;BR /&gt;
&amp;gt; used PROC Similarity, so I can't comment about the&lt;BR /&gt;
&amp;gt; proc.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; A friend of mine told me once he was clustering a&lt;BR /&gt;
&amp;gt; number of series according to their measurements&lt;BR /&gt;
&amp;gt; across time.  That didn't sit right with me, so I&lt;BR /&gt;
&amp;gt; created some data to see how it would work.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; I started by creating several series of data with no&lt;BR /&gt;
&amp;gt; pattern across time, but with different averages.&lt;BR /&gt;
&amp;gt; Even with telling the software to standardize the&lt;BR /&gt;
&amp;gt; data, the series that had similar means were&lt;BR /&gt;
&amp;gt;  clustered together.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; I then created data with patterns across time, with&lt;BR /&gt;
&amp;gt; some series having the same pattern.  And those&lt;BR /&gt;
&amp;gt; series that had different patterns across time were&lt;BR /&gt;
&amp;gt; given similar means.  The clustering grouped the&lt;BR /&gt;
&amp;gt; series that had similar means, even though the&lt;BR /&gt;
&amp;gt; patterns across time were different.  Also, the&lt;BR /&gt;
&amp;gt; series that had similar patterns were not grouped,&lt;BR /&gt;
&amp;gt; because the means were different.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; I then took that same data and made all the means&lt;BR /&gt;
&amp;gt; equal.  When I did the clustering, the groups with&lt;BR /&gt;
&amp;gt; similar patterns were put together.  &lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; So, to make along story short, how do you quantify a&lt;BR /&gt;
&amp;gt; pattern across time, so you can cluster series that&lt;BR /&gt;
&amp;gt; are similar.  That's a hard question.  The patterns&lt;BR /&gt;
&amp;gt; have to be more obvious than differences in series&lt;BR /&gt;
&amp;gt; averages, or other things you don't want to cluster&lt;BR /&gt;
&amp;gt; on.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; I'm sure there's more to this, but I'm not educated&lt;BR /&gt;
&amp;gt; enough on the subject to comment futher.&lt;BR /&gt;
&amp;gt; &lt;BR /&gt;
&amp;gt; Message was edited by: jonathan@jmp</description>
      <pubDate>Sat, 10 Apr 2010 00:54:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19068#M97</guid>
      <dc:creator>oloolo</dc:creator>
      <dc:date>2010-04-10T00:54:41Z</dc:date>
    </item>
    <item>
      <title>Re: Time Series Similarity Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19069#M98</link>
      <description>It was nothing special.  I don't want to make more out of it than it was.  &lt;BR /&gt;
&lt;BR /&gt;
I just created several series of data, and tried to cluster the series, with their values across time used as the clustering variables.  I used JMP, and used the default Ward clustering.  I know there are other clustering methods, but I didn't try them.  JMP's clustering platform has a Standardize Data option.  I tried it with and without that option.</description>
      <pubDate>Mon, 12 Apr 2010 19:18:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Time-Series-Similarity-Clustering/m-p/19069#M98</guid>
      <dc:creator>jonathan_jmp</dc:creator>
      <dc:date>2010-04-12T19:18:21Z</dc:date>
    </item>
  </channel>
</rss>

