<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Analysis Question: Which SAS/Stat  proc to use in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32744#M1327</link>
    <description>We have a very large sample of observations of a process that is believed to depend on a four real variables (w, x, y, and z), some of which are correlated. The process is expected to be “close to” a known function of w, call it g(w), with fluctuations around g that are the effect of the other three variables.&lt;BR /&gt;
&lt;BR /&gt;
Question: How can SAS/Stat be used to derive a predictive formula of the form of:&lt;BR /&gt;
&lt;BR /&gt;
F(w,x,y,z) = (a1 * g(w)) + (a2 * x) + (a3 * y) + (a4 * z),&lt;BR /&gt;
&lt;BR /&gt;
where the ai’s are the standard “least squares” coefficients based on the given sample?&lt;BR /&gt;
&lt;BR /&gt;
Thanks in advance...</description>
    <pubDate>Fri, 18 Dec 2009 19:42:43 GMT</pubDate>
    <dc:creator>deleted_user</dc:creator>
    <dc:date>2009-12-18T19:42:43Z</dc:date>
    <item>
      <title>Analysis Question: Which SAS/Stat  proc to use</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32744#M1327</link>
      <description>We have a very large sample of observations of a process that is believed to depend on a four real variables (w, x, y, and z), some of which are correlated. The process is expected to be “close to” a known function of w, call it g(w), with fluctuations around g that are the effect of the other three variables.&lt;BR /&gt;
&lt;BR /&gt;
Question: How can SAS/Stat be used to derive a predictive formula of the form of:&lt;BR /&gt;
&lt;BR /&gt;
F(w,x,y,z) = (a1 * g(w)) + (a2 * x) + (a3 * y) + (a4 * z),&lt;BR /&gt;
&lt;BR /&gt;
where the ai’s are the standard “least squares” coefficients based on the given sample?&lt;BR /&gt;
&lt;BR /&gt;
Thanks in advance...</description>
      <pubDate>Fri, 18 Dec 2009 19:42:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32744#M1327</guid>
      <dc:creator>deleted_user</dc:creator>
      <dc:date>2009-12-18T19:42:43Z</dc:date>
    </item>
    <item>
      <title>Re: Analysis Question: Which SAS/Stat  proc to use</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32745#M1328</link>
      <description>You don't provide all relevant information to answer your question.&lt;BR /&gt;
&lt;BR /&gt;
For instance, are there any parameters in F(w,x,y,z) or g(w) which need to be estimated?  Or are F(w,x,y,z) and g(w) specified a priori so that all you need to do is estimate a1, a2, a3, and a4?&lt;BR /&gt;
&lt;BR /&gt;
Also, would it be prudent to include an intercept in your model so that you estimate&lt;BR /&gt;
&lt;BR /&gt;
F(w,x,y,z) = a0  +   (a1*g(w))  +  (a2*x)  +  (a3*y)  +  (a4*z)&lt;BR /&gt;
&lt;BR /&gt;
And too, can we assume that the residuals are normally distributed?  I would think that would be a reasonable assumption, but you know what happens when you assume too much.&lt;BR /&gt;
&lt;BR /&gt;
Those are a few questions which come to mind immediately.  If there is anything else which is important to know, please share that as well.</description>
      <pubDate>Fri, 18 Dec 2009 22:58:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32745#M1328</guid>
      <dc:creator>Dale</dc:creator>
      <dc:date>2009-12-18T22:58:36Z</dc:date>
    </item>
    <item>
      <title>Re: Analysis Question: Which SAS/Stat  proc to use</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32746#M1329</link>
      <description>1. All values of g(w) will be known a priori, and the F(w,x,y,z) will be known (for a very large number of specific {w, x, y, z} combinations) as they are the results of the "observations" we're making.   &lt;BR /&gt;
&lt;BR /&gt;
2. Yes, we should probably calculate the intercept value, a0, as well.&lt;BR /&gt;
&lt;BR /&gt;
3. Yes, I have been assuming the residuals are normally distributed.&lt;BR /&gt;
&lt;BR /&gt;
Thanks!</description>
      <pubDate>Tue, 22 Dec 2009 13:18:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32746#M1329</guid>
      <dc:creator>deleted_user</dc:creator>
      <dc:date>2009-12-22T13:18:54Z</dc:date>
    </item>
    <item>
      <title>Re: Analysis Question: Which SAS/Stat  proc to use</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32747#M1330</link>
      <description>I don't know what you mean when you state that "the F(w,x,y,z) will be known ... as they are the results of 'observations' we're making."  Is there a variable which you are collecting which represents F(w,x,y,z)?  Your response is a little cryptic here.&lt;BR /&gt;
&lt;BR /&gt;
If you have a variable RESULT and you know the parameters of the function g(w) (and can therefore construct in a data set a variable g_w=g(w)), then you could simply use PROC REG to obtain the least squares solution to the equation&lt;BR /&gt;
&lt;BR /&gt;
  RESULT = a0 + (a1 * g_w) + (a2 * x) + (a3 * y) + (a4 * z)&lt;BR /&gt;
&lt;BR /&gt;
You would simply write:&lt;BR /&gt;
&lt;BR /&gt;
proc reg data=mydata;&lt;BR /&gt;
  model RESULT = g_2 x y z;&lt;BR /&gt;
run;&lt;BR /&gt;
&lt;BR /&gt;
But I am not certain that this is what you are looking for since I still don't know how F(w,x,y,z) is obtained.</description>
      <pubDate>Wed, 30 Dec 2009 20:13:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32747#M1330</guid>
      <dc:creator>Dale</dc:creator>
      <dc:date>2009-12-30T20:13:21Z</dc:date>
    </item>
    <item>
      <title>Re: Analysis Question: Which SAS/Stat  proc to use</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32748#M1331</link>
      <description>I think the problem is that I'm being sloppy in the way I'm using the function F. What we really known are the &lt;B&gt;values of the observations&lt;/B&gt; based on a large number of specific combinations of parameters.  Maybe this will help:&lt;BR /&gt;
&lt;BR /&gt;
Let's assume I have N observations based on experiments performed with N sets of the four parameters. Let's say that for j = 1, 2...,N, O(j) is the value observed from experiment j, which had inputs of w(j), x(j), y(j), and z(j). What I'm really looking for is a formula, F(w,x,y,z), that lets me predict the outcome of an experiment run with a (potentially new) combination of the four parameters. I.e., given 4 new parameters, {w0, x0, y0, z0}, the (least squares) &lt;B&gt;predicted&lt;/B&gt; value of the experiment would equal F(w0, x0, y0, z0).    &lt;BR /&gt;
&lt;BR /&gt;
I apologize for the confusion -- and hope this helps. Thanks.</description>
      <pubDate>Wed, 30 Dec 2009 21:07:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32748#M1331</guid>
      <dc:creator>deleted_user</dc:creator>
      <dc:date>2009-12-30T21:07:16Z</dc:date>
    </item>
    <item>
      <title>Re: Analysis Question: Which SAS/Stat  proc to use</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32749#M1332</link>
      <description>So, F(w,x,y,z) is an observed value placed into the variable O.  As previously stated, the REG procedure would be appropriate for constructing a predictor of O=F(w, x, y, z) which you could apply to F(w0, x0, y0, z0).  With variables O, w, x, y, z, and g_w=g(w) in a data set named MYDATA, you can obtain parameters a0 through a4 of the equation&lt;BR /&gt;
&lt;BR /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;O = a0 + (a1 * g_w) + (a2 * x) + (a3 * y) + (a4 * z)&lt;BR /&gt;
&lt;BR /&gt;
employing the code&lt;BR /&gt;
&lt;BR /&gt;
&amp;nbsp;&amp;nbsp;proc reg data=mydata;&lt;BR /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;model O = g_w x y z;&lt;BR /&gt;
&amp;nbsp;&amp;nbsp;run;&lt;BR /&gt;
&lt;BR /&gt;
You could also use any of a number of other SAS procedures to obtain the same results: procs GLM, GENMOD, MIXED, GLIMMIX, ORTHOREG to name a few.  The ORTHOREG procedure is of some special interest in that it works well with what are referred to as ill-conditioned data.  Ill-conditioned data arises when there are very strong correlations among the predictor variables.  But I presume that for your experimental setting where you are manipulating w, x, y, and z, poorly conditioned data should not really be a problem.</description>
      <pubDate>Thu, 31 Dec 2009 17:19:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analysis-Question-Which-SAS-Stat-proc-to-use/m-p/32749#M1332</guid>
      <dc:creator>Dale</dc:creator>
      <dc:date>2009-12-31T17:19:04Z</dc:date>
    </item>
  </channel>
</rss>

