<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Histogram bins are not helpful or insightful, what do I do in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Histogram-bins-are-not-helpful-or-insightful-what-do-I-do/m-p/388198#M20223</link>
    <description>&lt;P&gt;Have you tried changing the bins?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It's skewed because you have outliers, can you group them into an "other" category, ie everything &amp;gt; 3M is in one bucket?&lt;/P&gt;</description>
    <pubDate>Tue, 15 Aug 2017 15:37:12 GMT</pubDate>
    <dc:creator>Reeza</dc:creator>
    <dc:date>2017-08-15T15:37:12Z</dc:date>
    <item>
      <title>Histogram bins are not helpful or insightful, what do I do</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Histogram-bins-are-not-helpful-or-insightful-what-do-I-do/m-p/388197#M20222</link>
      <description>&lt;P&gt;I have&amp;nbsp;a SAS data set which represents data on mortgages in the US. I want to make a historgram which shows the distribution of these loan amounts. I use the code below to run it, but the histogram that comes out is all skew'd to the left and is not helpful. Why is there so much blank space and how can i fix this???? I just want to take the bars on the left and spread them out accross the chart so people can see which values they correspond with.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="histogram fail.png" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/14491i698E47F5CAE2C65E/image-size/large?v=v2&amp;amp;px=999" role="button" title="histogram fail.png" alt="histogram fail.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;title 'Analysis of Loan Amount';
ods graphics off;
proc univariate data=work.SampleData noprint;
   histogram Amount;
run;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 15 Aug 2017 15:33:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Histogram-bins-are-not-helpful-or-insightful-what-do-I-do/m-p/388197#M20222</guid>
      <dc:creator>bzubrick</dc:creator>
      <dc:date>2017-08-15T15:33:53Z</dc:date>
    </item>
    <item>
      <title>Re: Histogram bins are not helpful or insightful, what do I do</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Histogram-bins-are-not-helpful-or-insightful-what-do-I-do/m-p/388198#M20223</link>
      <description>&lt;P&gt;Have you tried changing the bins?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It's skewed because you have outliers, can you group them into an "other" category, ie everything &amp;gt; 3M is in one bucket?&lt;/P&gt;</description>
      <pubDate>Tue, 15 Aug 2017 15:37:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Histogram-bins-are-not-helpful-or-insightful-what-do-I-do/m-p/388198#M20223</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2017-08-15T15:37:12Z</dc:date>
    </item>
    <item>
      <title>Re: Histogram bins are not helpful or insightful, what do I do</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Histogram-bins-are-not-helpful-or-insightful-what-do-I-do/m-p/388214#M20225</link>
      <description>&lt;P&gt;One quick and dirty approach would be to just trim out the high end at some "reasonable" value.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="token procnames"&gt;proc&lt;/SPAN&gt; &lt;SPAN class="token procnames"&gt;univariate&lt;/SPAN&gt; &lt;SPAN class="token procnames"&gt;data&lt;/SPAN&gt;&lt;SPAN class="token operator"&gt;=&lt;/SPAN&gt;work&lt;SPAN class="token punctuation"&gt;.&lt;/SPAN&gt;SampleData (where=(amount le 2000000) noprint;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;for instance would discard values greater than 2,000,000.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Another option would be to use Proc Sgplot with histogram plot and specify a binwidth that you want such as 10000 or 20000. You may still need to remove the outliers as the maximum number of bins that are allowed are 10,000. Since you apparently are getting some value&amp;nbsp;near 20,000,000 your minimum bidwidth would have to be larger than 2000.&lt;/P&gt;</description>
      <pubDate>Tue, 15 Aug 2017 16:34:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Histogram-bins-are-not-helpful-or-insightful-what-do-I-do/m-p/388214#M20225</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2017-08-15T16:34:59Z</dc:date>
    </item>
  </channel>
</rss>

