<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Social Network Analysis - How to combine Hadoop + Spark + SAS Enteprise Miner in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/278250#M4134</link>
    <description>&lt;P&gt;teste&lt;/P&gt;</description>
    <pubDate>Fri, 15 Jul 2016 07:59:56 GMT</pubDate>
    <dc:creator>Rodgers_125</dc:creator>
    <dc:date>2016-07-15T07:59:56Z</dc:date>
    <item>
      <title>Social Network Analysis - How to combine Hadoop + Spark + SAS Enteprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/277231#M4127</link>
      <description>&lt;P&gt;teste&lt;/P&gt;</description>
      <pubDate>Fri, 15 Jul 2016 07:59:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/277231#M4127</guid>
      <dc:creator>Rodgers_125</dc:creator>
      <dc:date>2016-07-15T07:59:39Z</dc:date>
    </item>
    <item>
      <title>Re: Social Network Analysis - How to combine Hadoop + Spark + SAS Enteprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/278201#M4133</link>
      <description>&lt;P&gt;Hi Rodgers,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If I understand correctly, you have one table/dataset that is 80GB in Hadoop stored as 1000 csv files. You want to use Spark to do data cleaning and Enterprise Miner to mine patterns from this data. I have not worked with Spark much but it distributes and works with data in-memory so your data manipulations should be fast.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Once the data is ready for the modeling, you can bring it into SAS Enterprise Miner via SAS/ACCESS for Hadoop or Hive.&amp;nbsp;SAS Enterprise Miner has HPA (High-Performance Analytics) nodes under HPDM tab to handle big data (as is your case). You will need SAS High-Performance Data Mining license to add this capability when running in distributed mode (where the data is distributed on multiple machines and computations performed in parallel fashion).&lt;/P&gt;
&lt;P&gt;For more details about HPA and how it works in SAS Enterprise Miner, read the following tips:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;A href="http://communities.sas.com/t5/SAS-Communities-Library/SAS-High-Performance-Analytics-tip-1-How-it-differs-from-SAS/ta-p/244538" target="_blank"&gt;SAS High-Performance Analytics tip #1: How it differs from SAS Grid and SAS In-Memory Analytics&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="http://communities.sas.com/t5/SAS-Communities-Library/SAS-High-Performance-Analytics-tip-2-HPDM-nodes-in-SAS/ta-p/247513" target="_blank"&gt;SAS High-Performance Analytics tip #2: HPDM nodes in SAS Enterprise Miner&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="http://communities.sas.com/t5/SAS-Communities-Library/SAS-High-Performance-Analytics-tip-3-Example-flow-diagram-in-SAS/ta-p/248960" target="_blank"&gt;SAS High-Performance Analytics tip #3: Example flow diagram in SAS Enterprise Miner&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="http://communities.sas.com/t5/SAS-Communities-Library/SAS-High-Performance-Analytics-tip-4-Scoring-with-SAS-Enterprise/ta-p/250139" target="_blank"&gt;SAS High-Performance Analytics tip #4: Scoring with SAS Enterprise Miner&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/SAS-High-Performance-Analytics-tip-5-Scoring-with-Analytic-Store/ta-p/253544" target="_self"&gt;SAS-High Performance Analytics tip #5: Scoring with Analytic Store files&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Hope this helps !&lt;/P&gt;
&lt;P&gt;Radhikha&lt;/P&gt;</description>
      <pubDate>Fri, 17 Jun 2016 14:28:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/278201#M4133</guid>
      <dc:creator>RadhikhaMyneni</dc:creator>
      <dc:date>2016-06-17T14:28:34Z</dc:date>
    </item>
    <item>
      <title>Re: Social Network Analysis - How to combine Hadoop + Spark + SAS Enteprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/278250#M4134</link>
      <description>&lt;P&gt;teste&lt;/P&gt;</description>
      <pubDate>Fri, 15 Jul 2016 07:59:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/278250#M4134</guid>
      <dc:creator>Rodgers_125</dc:creator>
      <dc:date>2016-07-15T07:59:56Z</dc:date>
    </item>
    <item>
      <title>Re: Social Network Analysis - How to combine Hadoop + Spark + SAS Enteprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/278297#M4136</link>
      <description>&lt;P&gt;It usually comes down to the amount of resources (cpus, memory, disk space) available on the SAS server/machine. If your SAS server is being shared with other users, it will definitely affect their performance too, not to mention&amp;nbsp;the time&amp;nbsp;to transfer 80GB of data on the network.&amp;nbsp;I would strongly&amp;nbsp;recommend chatting with your SAS admin to make sure if the server can handle this data size and if it does, maybe work during off-hours. Also, use&amp;nbsp;data step whenever possible&amp;nbsp;instead of sql for data manipulations on this sized data.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Jun 2016 19:14:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Social-Network-Analysis-How-to-combine-Hadoop-Spark-SAS/m-p/278297#M4136</guid>
      <dc:creator>RadhikhaMyneni</dc:creator>
      <dc:date>2016-06-17T19:14:20Z</dc:date>
    </item>
  </channel>
</rss>

