<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Euclidean length option for Standardization method in Standardize Data Task (SAS Studio 3.5) in SAS Studio</title>
    <link>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291024#M950</link>
    <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;While working with the Standardize Data Task in SAS Studio 3.5 I've come across 'Euclidean length' as a standardisation method. I understand how z-scores are obtained by subtracting mean from each observation and dividing the result by standard deviation. What is Euclidean distance and how does it help in standardisation?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For example, if we are using the sashelp.baseball dataset, what would using the 'Euclidean length' method of standardisation for the 'nhits' (number of hits) variable do for us?&lt;/P&gt;</description>
    <pubDate>Thu, 11 Aug 2016 17:09:49 GMT</pubDate>
    <dc:creator>DataScientist</dc:creator>
    <dc:date>2016-08-11T17:09:49Z</dc:date>
    <item>
      <title>Euclidean length option for Standardization method in Standardize Data Task (SAS Studio 3.5)</title>
      <link>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291024#M950</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;While working with the Standardize Data Task in SAS Studio 3.5 I've come across 'Euclidean length' as a standardisation method. I understand how z-scores are obtained by subtracting mean from each observation and dividing the result by standard deviation. What is Euclidean distance and how does it help in standardisation?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For example, if we are using the sashelp.baseball dataset, what would using the 'Euclidean length' method of standardisation for the 'nhits' (number of hits) variable do for us?&lt;/P&gt;</description>
      <pubDate>Thu, 11 Aug 2016 17:09:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291024#M950</guid>
      <dc:creator>DataScientist</dc:creator>
      <dc:date>2016-08-11T17:09:49Z</dc:date>
    </item>
    <item>
      <title>Re: Euclidean length option for Standardization method in Standardize Data Task (SAS Studio 3.5)</title>
      <link>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291253#M954</link>
      <description>&lt;P&gt;A SAS Studio task generates SAS code, usually in the form of a call to a SAS procedure. &amp;nbsp;If you click on the&amp;nbsp;&lt;STRONG&gt;Code&lt;/STRONG&gt; tab, you can see the program. &amp;nbsp;In this case, the call is to PROC STDIZE and the METHOD=EUCLEN option is specified.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So the general way to answer the question "What does a task do?" is to&lt;/P&gt;
&lt;P&gt;1. Go to the &lt;A href="http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_whatsnew_toc.htm" target="_self"&gt;SAS/STAT User's Guide documentation.&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;2.&amp;nbsp;Scroll down and click on the&amp;nbsp;doc for the relevant&amp;nbsp;procedure.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For this question here is a link to&amp;nbsp;&lt;A href="http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_stdize_details01.htm" target="_self"&gt;the&amp;nbsp;formulas that are applied for each method.&lt;/A&gt;&amp;nbsp;For the EUCLEN option, the location is 0 and the scale is the Euclidean length of the variable:&lt;/P&gt;
&lt;P&gt;scale = sqrt(ssq(x)) = sqrt( x1**2 + x2**2 + ... + xN**2 ),&lt;/P&gt;
&lt;P&gt;where N is thenum ber of observations in the sample.&lt;/P&gt;
&lt;P&gt;The new variable is therefore&amp;nbsp;&lt;/P&gt;
&lt;P&gt;X_New[i] = (X[i] - 0) / scale&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The transformation has the property that the new variable has unit Euclidean length. Geometrically, you can think of the transformation as a projection onto the surface of the unit N-dimensional sphere. This transformation might be useful for spherically symmetric problems in which the angle that the observation makes with the origin is important.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Aug 2016 13:54:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291253#M954</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2016-08-12T13:54:44Z</dc:date>
    </item>
    <item>
      <title>Re: Euclidean length option for Standardization method in Standardize Data Task (SAS Studio 3.5)</title>
      <link>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291262#M955</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS﻿&lt;/a&gt;, thanks very much for the detailed explanation. I will read through the documentation for the PROC STDIZE procedure for a better understanding.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Am I correct in assuming then that a transformation&amp;nbsp;using the Euclidean Length would only be used for scientific / mathematical data and cannot be used in domains like marketing? If this is incorrect would there be an example from the marketing / business domain that you can point me to in which this transformation is used to analyse data and&amp;nbsp;generate insight?&lt;/P&gt;</description>
      <pubDate>Fri, 12 Aug 2016 14:19:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291262#M955</guid>
      <dc:creator>DataScientist</dc:creator>
      <dc:date>2016-08-12T14:19:45Z</dc:date>
    </item>
    <item>
      <title>Re: Euclidean length option for Standardization method in Standardize Data Task (SAS Studio 3.5)</title>
      <link>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291269#M956</link>
      <description>&lt;P&gt;I am not familiar with&amp;nbsp;marketing, so I can't answer your question. However, I will say that the METHOD=EUCLEN is more geeky/scientific than the more intuitive standard deviation. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It's not that strange, though. If your data are centered, then the formula for the standard deviation is closely related to the Euclidean length. &amp;nbsp;The Euclidean length is sqrt(N-1) times longer than the standard deviation.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Aug 2016 14:30:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Studio/Euclidean-length-option-for-Standardization-method-in/m-p/291269#M956</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2016-08-12T14:30:26Z</dc:date>
    </item>
  </channel>
</rss>

