<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Methods to the March Madness:  Data Mining in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Methods-to-the-March-Madness-Data-Mining/m-p/144480#M1417</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It's only open to Americans though :smileycry:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Do you happen to have data handy though, or should we compile our own?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 14 Mar 2014 15:47:55 GMT</pubDate>
    <dc:creator>Reeza</dc:creator>
    <dc:date>2014-03-14T15:47:55Z</dc:date>
    <item>
      <title>Methods to the March Madness:  Data Mining</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Methods-to-the-March-Madness-Data-Mining/m-p/144479#M1416</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;A href="http://blogs.sas.com/content/sascom/2014/03/14/who-wants-to-be-a-billionaire/" title="http://blogs.sas.com/content/sascom/2014/03/14/who-wants-to-be-a-billionaire/"&gt; Who wants to be a billionaire? - SAS Voices&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;As you may have heard, billionaire philanthropist Warren Buffett and Cleveland Cavaliers owner Dan Gilbert have teamed up &lt;A href="http://espn.go.com/mens-college-basketball/story/_/id/10582341/rick-reilly-predict-perfect-bracket-warren-buffett-owe-least-500-million"&gt;to offer $1 billion&lt;/A&gt; to anyone who can create a perfect NCAA March Madness bracket. “Wow,” you might say. “How hard can it be to create a perfect bracket? I could really use a billion bucks!”&amp;nbsp; Well, the answer is “really, really, unbelievably hard.” So hard that in the history of March Madness, no one has ever done it. For you math lovers out there, the odds are supposedly 1 in 9.2 quintillion.&amp;nbsp; And what if someone is actually able to create this magical winning bracket?&lt;/P&gt;&lt;P&gt;"I will invite him or her to be my guest at the final game and be there with a check in my pocket, but I will not be cheering for him or her to win," Buffett said, jokingly. "I may even give them a little investment advice.”&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I wanted to share how I used SAS Enterprise Miner with the Data Mining community as it related to the mania around March Madness.&amp;nbsp; I heavily used data mining techniques with SAS Enterprise Miner and SAS Rapid Predictive Modeler to get customers to be comfortable with data mining techniques via March Madness.&amp;nbsp; These are the steps I took to pull in the data to be analyzed.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Through my research, I’ve also compiled a list of some helpful (and some not-so-helpful) factors for selection. Here’s what’s been successful in the past:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;A href="http://espn.go.com/mens-college-basketball/rpi"&gt;RPI (Rating Percentage Index)&lt;/A&gt;, based upon wins, losses, and strength of schedule&lt;/LI&gt;&lt;LI&gt;&lt;A href="http://usatoday30.usatoday.com/sports/sagarin.htm"&gt;Jeff Sagarin rankings from &lt;EM&gt;USA Today&lt;/EM&gt;&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;Wins against top 25 teams (per RPI rankings)&lt;/LI&gt;&lt;LI&gt;Wins against teams ranked 26-50&lt;/LI&gt;&lt;LI&gt;Neutral court wins (Note: conference tournaments matter!)&lt;/LI&gt;&lt;LI&gt;Record and rank in-conference (regular season championships matter!)&lt;/LI&gt;&lt;LI&gt;Strength of conference (conferences do matter!)&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;And here’s what doesn’t work:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;A team’s record in the last 10 games, i.e. the "hot team" myth:&lt;UL&gt;&lt;LI&gt;Strong finish is not important to the vs. a team’s overall performance&lt;/LI&gt;&lt;LI&gt;Those "hot teams" are often doing some of the other things -- winning on neutral courts, and beating teams in the top 25 or top 50 – that do help boost their chances according to the Dance Card&lt;/LI&gt;&lt;LI&gt;A team’s record against teams ranked 50-100&lt;/LI&gt;&lt;/UL&gt;&lt;UL&gt;&lt;LI&gt;Winning against good teams helps, and the Dance Card model shows there's little downside in losing to good teams&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; The lesson for athletic directors? Schedule less cupcakes and more top 50 RPI teams&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In addition to SAS products, here are the other sources I tapped:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Georgia Tech LRMC &lt;A href="http://www2.isye.gatech.edu/~jsokol/lrmc/"&gt;Bayesian results Basketball Rankings&lt;/A&gt; (School of Industrial and Systems Engineering)&lt;/LI&gt;&lt;LI&gt;&lt;A href="http://kenpom.com/"&gt;Kenpom.com&lt;/A&gt; Advanced Analysis of College Basketball&lt;/LI&gt;&lt;LI&gt;&lt;A href="http://www.unf.edu/~jcoleman/dance.htm"&gt;NCAA Dance Card &lt;/A&gt;– University of South Florida (powered by SAS)&lt;/LI&gt;&lt;LI&gt;&lt;A href="http://usatoday30.usatoday.com/sports/sagarin.htm"&gt;Ken Sagarin&lt;/A&gt; (USA Today Rankings)&amp;nbsp; NCAA Basketball&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Let's submit a community bracket to prove that SAS has the best modelers around!&amp;nbsp; Entries must be complete prior to the start of the NCAA tournament.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 14 Mar 2014 15:40:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Methods-to-the-March-Madness-Data-Mining/m-p/144479#M1416</guid>
      <dc:creator>kathyball_sas</dc:creator>
      <dc:date>2014-03-14T15:40:08Z</dc:date>
    </item>
    <item>
      <title>Re: Methods to the March Madness:  Data Mining</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Methods-to-the-March-Madness-Data-Mining/m-p/144480#M1417</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It's only open to Americans though :smileycry:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Do you happen to have data handy though, or should we compile our own?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 14 Mar 2014 15:47:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Methods-to-the-March-Madness-Data-Mining/m-p/144480#M1417</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2014-03-14T15:47:55Z</dc:date>
    </item>
    <item>
      <title>Re: Methods to the March Madness:  Data Mining</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Methods-to-the-March-Madness-Data-Mining/m-p/144481#M1418</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi &lt;A _jive_internal="true" href="https://communities.sas.com/people/Reeza"&gt;Reeza&lt;/A&gt;, That's exactly why we wanted to open up this discussion for ALL members! Traditionally, entries must be completed prior to the NCAA tournament. The community brackets/predictions can be submitted through the tournament. I believe Kathy originally used some historical data of each team and other factors listed above. This discussion is just for fun, and a great way to use your Data Mining skills.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-Anna-Marie&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 14 Mar 2014 15:54:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Methods-to-the-March-Madness-Data-Mining/m-p/144481#M1418</guid>
      <dc:creator>anna_holland</dc:creator>
      <dc:date>2014-03-14T15:54:33Z</dc:date>
    </item>
  </channel>
</rss>

