<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Proc Logistic:  Rebuild models when errors in build data are discovered? in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/263314#M13901</link>
    <description>&lt;P&gt;Thanks! If the issues can not be corrected.&amp;nbsp;Do you have any suggestions on how to take this issue into account when buliding new models based on this data?&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 12 Apr 2016 19:14:54 GMT</pubDate>
    <dc:creator>RobertNYC</dc:creator>
    <dc:date>2016-04-12T19:14:54Z</dc:date>
    <item>
      <title>Proc Logistic:  Rebuild models when errors in build data are discovered?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/263301#M13899</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hi all,&lt;/P&gt;
&lt;P&gt;I’m using SAS Stat and Proc Logistic to build some basic product (retail) propensity models.&amp;nbsp; These questions have more to do with an issue that has developed with the data I’m using for some of these &amp;nbsp;“on the shelf” logistic regression models. I thought this was a good place to get some initial advice on how to handle this issue.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In a nutshell, the customer IDs I use to base the model build samples on models are not an accurate representation of customers. There is a bunch of customers which were assigned more than one customer ID (more than one email, more than one address…issues like that). So, two customer IDs&amp;nbsp; could actually be one “customer”.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Since I built my propensity models using Customer ID, that means I only modeled on portions of customer behavior, and duplicated customers as well.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here are my questions:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Should I rebuild the models, once the data is corrected?&lt;/LI&gt;
&lt;LI&gt;Should I do some validation work on the existing models now, by combining all associated cust Ids together, creating a new identifier, and creating gains charts?&lt;/LI&gt;
&lt;LI&gt;If the data issue cannot be corrected is there a way to take these duplicate model IDs into account when I rebuild or create new models?&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Any feedback will be greatly appreciated! Thanks ! &amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Apr 2016 18:45:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/263301#M13899</guid>
      <dc:creator>RobertNYC</dc:creator>
      <dc:date>2016-04-12T18:45:26Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic:  Rebuild models when errors in build data are discovered?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/263310#M13900</link>
      <description>&lt;P&gt;An assumption for regression is independence between observations.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The ID issues violates this assumption, so yes, you should fix it, if possible.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Apr 2016 19:02:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/263310#M13900</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2016-04-12T19:02:29Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic:  Rebuild models when errors in build data are discovered?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/263314#M13901</link>
      <description>&lt;P&gt;Thanks! If the issues can not be corrected.&amp;nbsp;Do you have any suggestions on how to take this issue into account when buliding new models based on this data?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Apr 2016 19:14:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/263314#M13901</guid>
      <dc:creator>RobertNYC</dc:creator>
      <dc:date>2016-04-12T19:14:54Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic:  Rebuild models when errors in build data are discovered?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/266039#M14016</link>
      <description>&lt;P&gt;Please don't think this is a flippant answer, but if you cannot set up the ID's as independent, then I would strongly suggest that you not build new models from the data, but rather spend your available time and money on collecting usable data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;However, if that really can't be done, then some sort of hierarchical modeling might be attempted, regarding the multiple IDs per unique customer. &amp;nbsp;If the unique identifier can be found, then you might consider the multiple measures as a repeated measure on the individual. &amp;nbsp;From there, it gets considerably murkier, as model selection procedures in the mixed model realm are not easily defined. &amp;nbsp;You will have to depend on subject knowledge more than you may want.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Steve Denham&lt;/P&gt;</description>
      <pubDate>Mon, 25 Apr 2016 14:06:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-Rebuild-models-when-errors-in-build-data-are/m-p/266039#M14016</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2016-04-25T14:06:29Z</dc:date>
    </item>
  </channel>
</rss>

