<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Random Forests in Enterprise Miner in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8889#M16</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;If you have access to EM product or product documentation, gradient boosting details are just under the Gradient Boosting node. The PROC version of GB is proc treeboost. I think the details of all the PROCS behind EM are now public at SAS support site, although the official policy remains 'as it is' meaning not supported by SAS technical support. I know the high performance version of GB is under construction. No info when it will be ready.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Jason Xin&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sat, 28 Sep 2013 22:11:55 GMT</pubDate>
    <dc:creator>JasonXin</dc:creator>
    <dc:date>2013-09-28T22:11:55Z</dc:date>
    <item>
      <title>Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8881#M8</link>
      <description>Via the settings in the decision tree node, is it possible to mimic random forests?  I've read SAS help, Applied Analytics Using SAS Enterprise Miner, and done a google search, but I'm not getting very far.&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
( I think I can do k-fold cross validation using the cross validation settings for the decision tree, but I'm not sure I'm doing it correctly)&lt;BR /&gt;
&lt;BR /&gt;
Any suggestions or references? I feel like I've got a good basic feel for enterprise miner, and a decent theoretical background in various machine learning techniques (I've read a lot of  Elements of Statistical Learning: Data Mining, Inference and Prediction (http://www-stat.stanford.edu/~tibs/ElemStatLearn/) and wathced Andrew Ng's Machine learning lectures (http://www.youtube.com/view_play_list?p=A89DCFA6ADACE599).&lt;BR /&gt;
&lt;BR /&gt;
I need some more advanced references using Enterprise Miner. &lt;BR /&gt;
&lt;BR /&gt;
Thanks.</description>
      <pubDate>Sat, 12 Feb 2011 02:04:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8881#M8</guid>
      <dc:creator>SlutskyFan</dc:creator>
      <dc:date>2011-02-12T02:04:11Z</dc:date>
    </item>
    <item>
      <title>Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8883#M10</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Does the new version of Miner perform random forests?&lt;/P&gt;&lt;P&gt;&lt;A href="http://support.sas.com/documentation/cdl/en/whatsnew/64209/HTML/default/viewer.htm#emwhatsnew71m1.htm"&gt;http://support.sas.com/documentation/cdl/en/whatsnew/64209/HTML/default/viewer.htm#emwhatsnew71m1.htm&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #000000; font-family: arial, 'Arial Unicode MS', geneva, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;"New procedures cover data binning, imputation, sampling, decisions, logistic and linear regressions, neural networks, &lt;EM&gt;&lt;STRONG&gt;random forests&lt;/STRONG&gt;&lt;/EM&gt;&lt;/SPAN&gt;"&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 13 Mar 2012 18:10:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8883#M10</guid>
      <dc:creator>Ajay</dc:creator>
      <dc:date>2012-03-13T18:10:54Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8884#M11</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It has been a while since I inquired about this, but I found that gradient boosting was very useful! Thanks. &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 14 Nov 2012 21:13:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8884#M11</guid>
      <dc:creator>SlutskyFan</dc:creator>
      <dc:date>2012-11-14T21:13:56Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8885#M12</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;In EM 7.1, try PROC FOREST, which conducts random Forest in SAS EM. Unfortunately, SAS doesn't release the syntax or detailed documents. If you have EM7.1, you need to use the code generating function to peek into the secretes.&lt;/P&gt;&lt;P&gt;One advantage of random forest is that it is very easily to be parallelized by user. I can build a random forest of 2000 small trees by firing 4 sessions simultaneously, each building 500 small ones.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 15 Nov 2012 15:19:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8885#M12</guid>
      <dc:creator>oloolo</dc:creator>
      <dc:date>2012-11-15T15:19:07Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8886#M13</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;In the major release SAS had in August 2012, EM has a random forest node. Its latest version EM is 12.2 with HPFOREST node which essentially runs its PROC HPFOREST in its High Performance Analytics offerings.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I posted in my blog &lt;A href="http://analytics-in-writing.blogspot.com/" title="http://analytics-in-writing.blogspot.com/"&gt;Analytics in Writing&lt;/A&gt; several use examples on HPFOREST node and PROC HPFOREST.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Random forest modeling typically requires a lot of memory. In large-scale predictive learning world there are people who invest in building in-memory models and modes of modeling, vs. others who invest in 'smart finesses' such as MapReduce. In in-meory modes of applications, for example, for the sake of building a random forest, often 1.5 TB RAM, distributed across parallel worker nodes, is not considered LARGE or MUCH.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I once saw SAS programmers writing SAS Base to build random forests. Over 10 years ago, I first saw Salford System's offering, which typically ran on smaller data sets. Naturally associated with complexity is big data set. This is where random forest is supposed to 'shine', but learning algorithms from papers is one thing. Industralizing it on large scale is entirely different game. I have used SAS HPFOREST capabilities for a while. I believe it is still generation ONE, but has crossed critical threshold into industralization.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 14 Mar 2013 00:32:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8886#M13</guid>
      <dc:creator>JasonXin</dc:creator>
      <dc:date>2013-03-14T00:32:56Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8887#M14</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi David,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is there any reference about &lt;SPAN style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;Gradient Boosting&lt;/SPAN&gt;? Thanks&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 25 Jul 2013 10:04:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8887#M14</guid>
      <dc:creator>TomiKong</dc:creator>
      <dc:date>2013-07-25T10:04:09Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8888#M15</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I just came across this animated video on HPFOREST showing an example of how it may work in the academic space. While not getting into detail, it's a quick and artful watch.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.youtube.com/watch?v=p7cTLS7xQYw&amp;amp;feature=em-uploademail" title="http://www.youtube.com/watch?v=p7cTLS7xQYw&amp;amp;feature=em-uploademail"&gt;Animating Analytics: PROC HPFOREST - YouTube&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Anna&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 24 Sep 2013 13:54:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8888#M15</guid>
      <dc:creator>AnnaBrown</dc:creator>
      <dc:date>2013-09-24T13:54:23Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8889#M16</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;If you have access to EM product or product documentation, gradient boosting details are just under the Gradient Boosting node. The PROC version of GB is proc treeboost. I think the details of all the PROCS behind EM are now public at SAS support site, although the official policy remains 'as it is' meaning not supported by SAS technical support. I know the high performance version of GB is under construction. No info when it will be ready.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Jason Xin&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 28 Sep 2013 22:11:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8889#M16</guid>
      <dc:creator>JasonXin</dc:creator>
      <dc:date>2013-09-28T22:11:55Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8890#M17</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Jason,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I found the articles on your website really helpful. Do you have any documentation relating to PROC HPFOREST which you can email me?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 11 Nov 2014 23:10:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8890#M17</guid>
      <dc:creator>AarushIssar</dc:creator>
      <dc:date>2014-11-11T23:10:43Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8891#M18</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Please contact Tech Support (&lt;A href="http://support.sas.com/ctx/supportform/createForm" title="http://support.sas.com/ctx/supportform/createForm"&gt;Technical Support Form)&lt;/A&gt; to get access to the secure HP procedure documentation that is available from the link:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://support.sas.com/documentation/onlinedoc/miner/" title="http://support.sas.com/documentation/onlinedoc/miner/"&gt;http://support.sas.com/documentation/onlinedoc/miner/&lt;/A&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 12 Nov 2014 14:37:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/8891#M18</guid>
      <dc:creator>WendyCzika</dc:creator>
      <dc:date>2014-11-12T14:37:32Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/286972#M4265</link>
      <description>&lt;P&gt;Since the time of this original post (over 5 years!), SAS Enterprise Miner has added deep support for random forests, including an HP Forest node.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;See &lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/Tip-Getting-the-Most-from-your-Random-Forest/ta-p/223949" target="_self"&gt;Getting the most from your Random Forests in SAS Enterprise Miner&lt;/A&gt;. &amp;nbsp;Also, &lt;A href="https://www.youtube.com/watch?v=EOxwpnbFqIU" target="_self"&gt;watch this YouTube video&lt;/A&gt;&amp;nbsp;about Random Forest and Support Vector Machines.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You might also want to read &lt;A href="https://support.sas.com/resources/papers/proceedings14/SAS133-2014.pdf" target="_self"&gt;this paper about ensemble models in SAS Enterprise Miner.&lt;/A&gt;&amp;nbsp; From the abstract:&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Ensemble models combine two or more models to enable a more robust prediction, classification, or variable selection. &lt;A href="https://support.sas.com/resources/papers/proceedings14/SAS133-2014.pdf" target="_self"&gt;This paper&lt;/A&gt; describes three types of ensemble models: boosting, bagging, and model averaging. It discusses go-to methods, such as gradient boosting and random forest, and newer methods, such as rotational forest and fuzzy clustering.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 25 Jul 2016 18:19:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/286972#M4265</guid>
      <dc:creator>ChrisHemedinger</dc:creator>
      <dc:date>2016-07-25T18:19:49Z</dc:date>
    </item>
    <item>
      <title>Re: Random Forests in Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/405331#M6169</link>
      <description>&lt;P&gt;Is there a way to set the HP Forest node to chose the best model based on prediction performance on validation data?&amp;nbsp; I only see in-sample options, no out-of-sample evaluation methods.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Maria&lt;/P&gt;</description>
      <pubDate>Wed, 18 Oct 2017 20:10:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Random-Forests-in-Enterprise-Miner/m-p/405331#M6169</guid>
      <dc:creator>weeseml</dc:creator>
      <dc:date>2017-10-18T20:10:08Z</dc:date>
    </item>
  </channel>
</rss>

