<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Gini Coefficient - Variable Importance Measure in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212337#M11460</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;There is a whitepaper for selecting important variables in a linear regression model. The URL of the whitepaper is &lt;A href="http://support.sas.com/resources/papers/proceedings15/3242-2015.pdf" title="http://support.sas.com/resources/papers/proceedings15/3242-2015.pdf"&gt;http://support.sas.com/resources/papers/proceedings15/3242-2015.pdf&lt;/A&gt; .&lt;/P&gt;&lt;P&gt;It explains gini coefficient can be used to check linearity in the model. And we can also rank variable based on their GINI coefficient. &lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;A higher Gini coefficient suggests a higher potential for the variable to be useful in a &lt;/SPAN&gt;&lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;linear regression. &lt;/SPAN&gt;&lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;If a numeric variable is high on IV Rank but low on Gini coefficient &lt;/SPAN&gt;&lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;, it usually suggests a lack of linearity.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My Question - Is it the gini coefficient derived from decision tree? Or it is related to Area under Curve (AUC) -- (Gini = 2*AUC- 1)? What is the exact calculation of this Gini Coefficient and how it can be used to check linearity? I googled a lot. What i got is it is used in economics theory to check inequality.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any help would be highly appreciated. Thanks!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 23 Jun 2015 18:37:52 GMT</pubDate>
    <dc:creator>Ujjawal</dc:creator>
    <dc:date>2015-06-23T18:37:52Z</dc:date>
    <item>
      <title>Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212337#M11460</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;There is a whitepaper for selecting important variables in a linear regression model. The URL of the whitepaper is &lt;A href="http://support.sas.com/resources/papers/proceedings15/3242-2015.pdf" title="http://support.sas.com/resources/papers/proceedings15/3242-2015.pdf"&gt;http://support.sas.com/resources/papers/proceedings15/3242-2015.pdf&lt;/A&gt; .&lt;/P&gt;&lt;P&gt;It explains gini coefficient can be used to check linearity in the model. And we can also rank variable based on their GINI coefficient. &lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;A higher Gini coefficient suggests a higher potential for the variable to be useful in a &lt;/SPAN&gt;&lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;linear regression. &lt;/SPAN&gt;&lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;If a numeric variable is high on IV Rank but low on Gini coefficient &lt;/SPAN&gt;&lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;, it usually suggests a lack of linearity.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My Question - Is it the gini coefficient derived from decision tree? Or it is related to Area under Curve (AUC) -- (Gini = 2*AUC- 1)? What is the exact calculation of this Gini Coefficient and how it can be used to check linearity? I googled a lot. What i got is it is used in economics theory to check inequality.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any help would be highly appreciated. Thanks!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 23 Jun 2015 18:37:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212337#M11460</guid>
      <dc:creator>Ujjawal</dc:creator>
      <dc:date>2015-06-23T18:37:52Z</dc:date>
    </item>
    <item>
      <title>Re: Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212338#M11461</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Check proc univariate who can calculate GINI .&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 24 Jun 2015 13:14:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212338#M11461</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2015-06-24T13:14:12Z</dc:date>
    </item>
    <item>
      <title>Re: Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212339#M11462</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks Xia. The Gini that PROC UNIVARIATE produces is a measure of statistical dispersion. Correct me if i am wrong? &lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;A low Gini coefficient indicates a more equal distribution, with 0 corresponding to complete equality. How it can be used to check linearity? How it can be used in modeling process to select important linear variables?&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 24 Jun 2015 18:31:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212339#M11462</guid>
      <dc:creator>Ujjawal</dc:creator>
      <dc:date>2015-06-24T18:31:54Z</dc:date>
    </item>
    <item>
      <title>Re: Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212340#M11463</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Sorry. I will leave it to Steve&amp;nbsp; or lvm .&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 25 Jun 2015 11:46:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212340#M11463</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2015-06-25T11:46:03Z</dc:date>
    </item>
    <item>
      <title>Re: Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212341#M11464</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks Xia for looking into it.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 25 Jun 2015 21:01:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212341#M11464</guid>
      <dc:creator>Ujjawal</dc:creator>
      <dc:date>2015-06-25T21:01:46Z</dc:date>
    </item>
    <item>
      <title>Re: Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212342#M11465</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The Gini coefficient or Somers' D statistic gives a measure of concordance in logistic models.&amp;nbsp; It is a rank based statistic, where all results are paired (all observed with all predicted). In linear regression, it is a transformation of the Pearson correlation coefficient.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here is a nice paper that covers a lot of what is buried in the SGF paper.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.imperial.ac.uk/nhli/r.newson/miscdocs/intsomd1.pdf"&gt;http://www.imperial.ac.uk/nhli/r.newson/miscdocs/intsomd1.pdf&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Jun 2015 17:35:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212342#M11465</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-06-26T17:35:18Z</dc:date>
    </item>
    <item>
      <title>Re: Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212343#M11466</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks a ton Steve for your answer. I know Somer's D and Gini Coefficient. Gini Coefficient = 2 (AUC -1) and AUC = %Concordance + 0.5 (Tied Pairs). It would be great if you share an article of&amp;nbsp;&amp;nbsp; "In &lt;SPAN style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-size: 13px; background-color: #ffffff;"&gt;linear regression, it is a transformation of the Pearson correlation coefficient.". I am more intersted about application of Gini Coefficient in linear regression. I did not find a single article to support it.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Am i correct? -&lt;/P&gt;&lt;P&gt;In logistic regression, if gini coefficient is high, logit function is monotonically related to independent variable?&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 13.3333330154419px;"&gt;In linear regression, if gini coefficient is high, y is linearly related to independent variable?&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Jun 2015 20:55:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212343#M11466</guid>
      <dc:creator>Ujjawal</dc:creator>
      <dc:date>2015-06-26T20:55:18Z</dc:date>
    </item>
    <item>
      <title>Re: Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212344#M11467</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Regarding correctness of interpretation, that is the way I would interpret it. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt; The quote is from the Imperial College paper I linked to.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 29 Jun 2015 12:51:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/212344#M11467</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-06-29T12:51:35Z</dc:date>
    </item>
    <item>
      <title>Re: Gini Coefficient - Variable Importance Measure</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/284101#M14993</link>
      <description>What Is the best GINI cut off value for selecting the significant variable. I do understand that if gini value is high, it's a good variable in separating goods and bads.</description>
      <pubDate>Wed, 13 Jul 2016 16:00:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Gini-Coefficient-Variable-Importance-Measure/m-p/284101#M14993</guid>
      <dc:creator>Srikanthg</dc:creator>
      <dc:date>2016-07-13T16:00:50Z</dc:date>
    </item>
  </channel>
</rss>

