<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Correlations between Input and Targetvariables in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27177#M139</link>
    <description>Hi there,&lt;BR /&gt;
&lt;BR /&gt;
first of all: Hello to the SAS Community, I'm new in the Data Mining Business and working with the SAS Enterprise Miner 6.1.&lt;BR /&gt;
&lt;BR /&gt;
I want to find out what correlation there is between the monthly temperatures (degree Celsius) in Germany and the monthly sales output (capacity in tons) from a specific food-product.&lt;BR /&gt;
I know that there are many parameters more, which influence the sales output, but for the beginning I only want to find out how powerful the parameter temperature is.&lt;BR /&gt;
&lt;BR /&gt;
The result that i want to have is like: If the monthly temperature rises 1 degree Celsius, the sales output will rise between 1,3 and 1,9 tons.&lt;BR /&gt;
&lt;BR /&gt;
My intention is to have an "temperature-adjustment" for the forecasting of the monthly sales output.&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
My course of action right now is to take sales output data and temperature data from the past (in an excel file) and import this data with the File Import Node to the Enterprise Miner. Then I correct some "critical" sales output data (I use the replacement and impute node) and run the regression node. But the result isn't very satisfying.&lt;BR /&gt;
&lt;BR /&gt;
So do you have any tips and tricks for me to get a better result?&lt;BR /&gt;
&lt;BR /&gt;
PS: I know that there is a correlation between the temperature and the sales output, but I don't know how powerful it is.&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
Thanks&lt;BR /&gt;
BlackCan</description>
    <pubDate>Wed, 03 Nov 2010 08:39:53 GMT</pubDate>
    <dc:creator>BlackCan</dc:creator>
    <dc:date>2010-11-03T08:39:53Z</dc:date>
    <item>
      <title>Correlations between Input and Targetvariables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27177#M139</link>
      <description>Hi there,&lt;BR /&gt;
&lt;BR /&gt;
first of all: Hello to the SAS Community, I'm new in the Data Mining Business and working with the SAS Enterprise Miner 6.1.&lt;BR /&gt;
&lt;BR /&gt;
I want to find out what correlation there is between the monthly temperatures (degree Celsius) in Germany and the monthly sales output (capacity in tons) from a specific food-product.&lt;BR /&gt;
I know that there are many parameters more, which influence the sales output, but for the beginning I only want to find out how powerful the parameter temperature is.&lt;BR /&gt;
&lt;BR /&gt;
The result that i want to have is like: If the monthly temperature rises 1 degree Celsius, the sales output will rise between 1,3 and 1,9 tons.&lt;BR /&gt;
&lt;BR /&gt;
My intention is to have an "temperature-adjustment" for the forecasting of the monthly sales output.&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
My course of action right now is to take sales output data and temperature data from the past (in an excel file) and import this data with the File Import Node to the Enterprise Miner. Then I correct some "critical" sales output data (I use the replacement and impute node) and run the regression node. But the result isn't very satisfying.&lt;BR /&gt;
&lt;BR /&gt;
So do you have any tips and tricks for me to get a better result?&lt;BR /&gt;
&lt;BR /&gt;
PS: I know that there is a correlation between the temperature and the sales output, but I don't know how powerful it is.&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
Thanks&lt;BR /&gt;
BlackCan</description>
      <pubDate>Wed, 03 Nov 2010 08:39:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27177#M139</guid>
      <dc:creator>BlackCan</dc:creator>
      <dc:date>2010-11-03T08:39:53Z</dc:date>
    </item>
    <item>
      <title>Re: Correlations between Input and Targetvariables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27178#M140</link>
      <description>Hi there.  Can you tell us more about what you mean by result was not satisfactory - poor model fit, unacceptable interpretation, bad prediction to 1 degree change, what ?</description>
      <pubDate>Wed, 03 Nov 2010 13:27:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27178#M140</guid>
      <dc:creator>David_Duling</dc:creator>
      <dc:date>2010-11-03T13:27:26Z</dc:date>
    </item>
    <item>
      <title>Re: Correlations between Input and Targetvariables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27179#M141</link>
      <description>The result is not satisfactory, because my Average Squared Error is 10996,45 for train data, and 12450,12 for validation data.&lt;BR /&gt;
&lt;BR /&gt;
My properties for the regression node are:&lt;BR /&gt;
Regression Type: Logistic Regression&lt;BR /&gt;
Link Function: Logic&lt;BR /&gt;
...&lt;BR /&gt;
Selection Model: Stepwise&lt;BR /&gt;
Selection Criterion: Validation Error&lt;BR /&gt;
&lt;BR /&gt;
By the way: Do I have to partition my Data? At the moment I split it into 50% train and 50% validation data. The more I think of it, I believe I don't have to do it...&lt;BR /&gt;
&lt;BR /&gt;
PS: I miss info like Odds Ratio Estimates in the regression output. How do I let the miner create those info? Are those info missing because I import the data with the File Import node and not with a data source?&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
I'm sorry for asking so many questions &lt;span class="lia-unicode-emoji" title=":confused_face:"&gt;😕&lt;/span&gt;</description>
      <pubDate>Wed, 03 Nov 2010 14:36:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27179#M141</guid>
      <dc:creator>BlackCan</dc:creator>
      <dc:date>2010-11-03T14:36:43Z</dc:date>
    </item>
    <item>
      <title>Re: Correlations between Input and Targetvariables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27180#M142</link>
      <description>Regarding the data and the odds ratio - the answer is no.  Enterprise Miner uses proc dmreg, rather than proc logistic or proc reg, which produces a somewhat different set of output statistics based on data mining needs including scalability.&lt;BR /&gt;
&lt;BR /&gt;
Regarding the model fit, it will of course depend on your data and your function; I suggest you call tech support and ask for assistance, they would probably enjoy that interaction.</description>
      <pubDate>Mon, 08 Nov 2010 15:52:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27180#M142</guid>
      <dc:creator>David_Duling</dc:creator>
      <dc:date>2010-11-08T15:52:11Z</dc:date>
    </item>
    <item>
      <title>Re: Correlations between Input and Targetvariables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27181#M143</link>
      <description>Hello BlackCan,&lt;BR /&gt;
&lt;BR /&gt;
You can use proc Npar1Way to test correlation between your target variable and predictor variables and find the best predictor before moving to EM.&lt;BR /&gt;
&lt;BR /&gt;
Sincerely,&lt;BR /&gt;
SPR</description>
      <pubDate>Fri, 19 Nov 2010 14:37:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27181#M143</guid>
      <dc:creator>SPR</dc:creator>
      <dc:date>2010-11-19T14:37:48Z</dc:date>
    </item>
    <item>
      <title>Re: Correlations between Input and Targetvariables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27182#M144</link>
      <description>the stat explore node will compute corrleations between interval inputs and interval targets, and the varclus node will compute variable-variable correlations.  stat explore node will also choose predictors.</description>
      <pubDate>Fri, 19 Nov 2010 15:14:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27182#M144</guid>
      <dc:creator>David_Duling</dc:creator>
      <dc:date>2010-11-19T15:14:06Z</dc:date>
    </item>
    <item>
      <title>Re: Correlations between Input and Targetvariables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27183#M145</link>
      <description>Hello David Duling,&lt;BR /&gt;
&lt;BR /&gt;
Proc NPAR1WAY measures differences in empirical distributions of predictors for Event and Not Event classes of a binary target and provides several statistics to estimate this differences. This approach can be also applied to continuous targets by using a suitable format to split a target into several intervals. I found this approach very convenient for preliminary analysis of predictors.&lt;BR /&gt;
&lt;BR /&gt;
Sincerely,&lt;BR /&gt;
SPR</description>
      <pubDate>Fri, 19 Nov 2010 20:01:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27183#M145</guid>
      <dc:creator>SPR</dc:creator>
      <dc:date>2010-11-19T20:01:02Z</dc:date>
    </item>
    <item>
      <title>Re: Correlations between Input and Targetvariables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27184#M146</link>
      <description>SPR - it sounds like you should write an extension tool that we all can use.  &lt;BR /&gt;
&lt;BR /&gt;
&lt;A href="http://support.sas.com/documentation/onlinedoc/miner/em61/ext_nodes.pdf" target="_blank"&gt;http://support.sas.com/documentation/onlinedoc/miner/em61/ext_nodes.pdf&lt;/A&gt;&lt;BR /&gt;
&lt;BR /&gt;
Cheers.</description>
      <pubDate>Mon, 22 Nov 2010 13:37:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlations-between-Input-and-Targetvariables/m-p/27184#M146</guid>
      <dc:creator>David_Duling</dc:creator>
      <dc:date>2010-11-22T13:37:46Z</dc:date>
    </item>
  </channel>
</rss>

