<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Cululative Curves in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232347#M12214</link>
    <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a data set of around 80K customers. I have classified these customers as GOOD, BAD or INDETERMINATE based on their payment history for the Last 12 months.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Each customer is assigned a Clssification od either Good, Bad or Indeterminiate, in the same file I have Application Scores for these customers (i.e. each has a score assigned to them from 0 to 100). I want to test the reliability of these scores in terms of the classification I did for these customers (i.e. to be sure that more bads are at lower scores and goods at higher scores.). Could somebody help me with the code that I could use to get a lift curve and/or K-S Curve, Gini, ROC etc or analysis of cumulative goods vs Bad.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Sample&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Application&amp;nbsp;Score&amp;nbsp;&amp;nbsp; Score Range&amp;nbsp; Classification&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;10&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0-10&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Bad&lt;/P&gt;&lt;P&gt;&amp;nbsp;30&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 21-30&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Bad&lt;/P&gt;&lt;P&gt;&amp;nbsp;68&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;61-70&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Good&lt;/P&gt;&lt;P&gt;&amp;nbsp;12&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 11-20&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Good&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also, is there a way to determinethe cut-off that I can come up with for the&amp;nbsp;application score that I could use to accept or reject cistomer (maybe a reverse cumulative distribution for the bad)?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have tried a lot to try to get codes for SAS but unsuccessful, please HELP!&lt;/P&gt;</description>
    <pubDate>Thu, 29 Oct 2015 22:30:41 GMT</pubDate>
    <dc:creator>mithusaini</dc:creator>
    <dc:date>2015-10-29T22:30:41Z</dc:date>
    <item>
      <title>Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232347#M12214</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a data set of around 80K customers. I have classified these customers as GOOD, BAD or INDETERMINATE based on their payment history for the Last 12 months.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Each customer is assigned a Clssification od either Good, Bad or Indeterminiate, in the same file I have Application Scores for these customers (i.e. each has a score assigned to them from 0 to 100). I want to test the reliability of these scores in terms of the classification I did for these customers (i.e. to be sure that more bads are at lower scores and goods at higher scores.). Could somebody help me with the code that I could use to get a lift curve and/or K-S Curve, Gini, ROC etc or analysis of cumulative goods vs Bad.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Sample&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Application&amp;nbsp;Score&amp;nbsp;&amp;nbsp; Score Range&amp;nbsp; Classification&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;10&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0-10&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Bad&lt;/P&gt;&lt;P&gt;&amp;nbsp;30&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 21-30&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Bad&lt;/P&gt;&lt;P&gt;&amp;nbsp;68&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;61-70&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Good&lt;/P&gt;&lt;P&gt;&amp;nbsp;12&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 11-20&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Good&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also, is there a way to determinethe cut-off that I can come up with for the&amp;nbsp;application score that I could use to accept or reject cistomer (maybe a reverse cumulative distribution for the bad)?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have tried a lot to try to get codes for SAS but unsuccessful, please HELP!&lt;/P&gt;</description>
      <pubDate>Thu, 29 Oct 2015 22:30:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232347#M12214</guid>
      <dc:creator>mithusaini</dc:creator>
      <dc:date>2015-10-29T22:30:41Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232365#M12215</link>
      <description>&lt;P&gt;You should try decision tree procedure HPSPLIT. Something like:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc hpsplit data=test;
target class;
input score / level=int;
output nodestats=want;
run;

option linesize=120;
proc print data=want label noobs; 
where depth=1; 
var leaf n predictedvalue insplitvar decision p_: ; 
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You will get optimal cutting scores between your classes as well as classification rates.&lt;/P&gt;</description>
      <pubDate>Fri, 30 Oct 2015 02:41:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232365#M12215</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2015-10-30T02:41:23Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232474#M12240</link>
      <description>&lt;P&gt;Hi PG,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for the response I tried the code but SAS log returns an error message of -&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;"ERROR: Procedure HPSPLIT not found."&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This was a similar situation for PROC Reliability as well, would you know why this is happening?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have SAS 9.2 and no Enterprise Miner.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Gavin&lt;/P&gt;</description>
      <pubDate>Fri, 30 Oct 2015 16:38:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232474#M12240</guid>
      <dc:creator>mithusaini</dc:creator>
      <dc:date>2015-10-30T16:38:14Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232479#M12241</link>
      <description>&lt;P&gt;HPSPLIT is rather recent. The first mention of HPSPLIT in the documentation is for version 12.3 of SAS/STAT. If you have access to JMP you could do roughly the same thing with the &lt;EM&gt;partition&lt;/EM&gt; platform.&lt;/P&gt;</description>
      <pubDate>Fri, 30 Oct 2015 16:49:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232479#M12241</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2015-10-30T16:49:34Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232498#M12242</link>
      <description>&lt;P&gt;I do not have access to JMP, is there a way of doing this on SAS 9.2&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Appreciate the help.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Gavin&lt;/P&gt;</description>
      <pubDate>Fri, 30 Oct 2015 18:55:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232498#M12242</guid>
      <dc:creator>mithusaini</dc:creator>
      <dc:date>2015-10-30T18:55:57Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232501#M12243</link>
      <description>&lt;P&gt;Untested, but try these ideas:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Recode the response variable as Bad= -1, indeterminant=0, and good=1.&amp;nbsp; You can fit the response by using the "score" as the explanatory variable for ordinal logistic regression.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/documentation/cdl/en/statug/63347/HTML/default/viewer.htm#statug_logistic_sect019.htm" target="_self"&gt;The ROC statement in PROC LOGISTIC&lt;/A&gt;&amp;nbsp;enables you to construct ROC curves for the response in terms of the scores.&lt;/P&gt;
&lt;P&gt;Use the LINK=CLOGLOG&amp;nbsp;option to fit the ordinal response.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/documentation/cdl/en/statug/63347/HTML/default/viewer.htm#statug_logistic_sect042.htm" target="_self"&gt;The "response profile" table &lt;/A&gt;gives various concordance statistics&amp;nbsp;such as Gini and the area under the ROC curve.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 30 Oct 2015 19:37:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232501#M12243</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2015-10-30T19:37:23Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232507#M12244</link>
      <description>&lt;P&gt;In last resort, you could try discriminant analysis, the non-parametric version:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;
/* Scores over the range of possible values */
data testvalues;
do score = 0 to 100 by 0.1;
    output;
    end;
run;

/* non-parametric discriminant analysis */
proc discrim data=test method=npar kernel=normal r=5 testdata=testvalues testout=testScore;
class class;
var score;
run;

/* Get the predicted score range for each class */
proc sql;
select _into_ as class, min(score) as fromScore, max(score) as toScore
from testScore
group by _into_
order by fromScore;
quit;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Fri, 30 Oct 2015 20:03:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232507#M12244</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2015-10-30T20:03:53Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232557#M12248</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/462"&gt;@PGStats﻿&lt;/a&gt;&amp;nbsp;I thought about recommending PROC CANCORR, but discriminant analysis is more appropriate for nominal than ordinal categories. What is your reason for recommending the nonparametrix discriminant analysis over the linear?&lt;/P&gt;</description>
      <pubDate>Sat, 31 Oct 2015 10:26:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232557#M12248</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2015-10-31T10:26:31Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232606#M12250</link>
      <description>&lt;P&gt;Hi &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS﻿&lt;/a&gt;, I suggested non parametric discriminant analysis because I didn't want to make strong assumptions about the score distribution in each class. But more importantly, I thought that using a small kernel would yield sharper delineation of the classes, i.e. class border positions would be determined locally. I chose a normal kernel because of its infinite support.&lt;/P&gt;</description>
      <pubDate>Sun, 01 Nov 2015 02:13:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232606#M12250</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2015-11-01T02:13:24Z</dc:date>
    </item>
    <item>
      <title>Re: Cululative Curves</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232931#M12274</link>
      <description>&lt;P&gt;Thanks to both you guys for the quick turnaround. I really approeciate it.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I had another question on Cumulative Accuracy Profile which I will post shortly. Hope you guys can help.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards&lt;/P&gt;
&lt;P&gt;Gavin&lt;/P&gt;</description>
      <pubDate>Tue, 03 Nov 2015 17:34:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cululative-Curves/m-p/232931#M12274</guid>
      <dc:creator>mithusaini</dc:creator>
      <dc:date>2015-11-03T17:34:39Z</dc:date>
    </item>
  </channel>
</rss>

