<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Plot of summary stats of Y against numeric variable X for big dataset in Graphics Programming</title>
    <link>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/526966#M17666</link>
    <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/147629"&gt;@BingyiYang&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;I often need to plot out the mean/sum of a continuous&amp;nbsp;variable (Y) against the buckets of another continuous&amp;nbsp;variable (X), by a third grouping variable (Z).&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;Note that the big datasets are typically&amp;nbsp;huge; it is impractical to scatter plot every point. That's why we need to group X into buckets and calculate bucket means first for the plots.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;We can do it by using proc rank on X first then calculate buckets for X and Y. But I am wondering if any SAS plot procs can do this automatically. Thanks. &lt;/SPAN&gt;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Without knowing what type of plot or what your data currently looks like it is hard to provide any explicit example.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The SGPLOT HBAR and VBAR plots will do Freq, Mean, Median and Sum statistics for a given Response variable for levels of a group variable. "Bins" from a variable can often be created using a format.&lt;/P&gt;</description>
    <pubDate>Mon, 14 Jan 2019 15:45:37 GMT</pubDate>
    <dc:creator>ballardw</dc:creator>
    <dc:date>2019-01-14T15:45:37Z</dc:date>
    <item>
      <title>Plot of summary stats of Y against numeric variable X for big dataset</title>
      <link>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/526813#M17661</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 10.5pt; font-family: 'Helvetica Neue'; color: #333333;"&gt;I often need to plot out the mean/sum of a continuous&amp;nbsp;variable (Y) against the buckets of another continuous&amp;nbsp;variable (X), by a third grouping variable (Z).&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 10.5pt; font-family: 'Helvetica Neue'; color: #333333;"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 10.5pt; font-family: 'Helvetica Neue'; color: #333333;"&gt;Note that the big datasets are typically&amp;nbsp;huge; it is impractical to scatter plot every point. That's why we need to group X into buckets and calculate bucket means first for the plots.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 10.5pt; font-family: 'Helvetica Neue'; color: #333333;"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 10.5pt; font-family: 'Helvetica Neue'; color: #333333;"&gt;We can do it by using proc rank on X first then calculate buckets for X and Y. But I am wondering if any SAS plot procs can do this automatically. Thanks. &lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 01:42:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/526813#M17661</guid>
      <dc:creator>BingyiYang</dc:creator>
      <dc:date>2019-01-14T01:42:58Z</dc:date>
    </item>
    <item>
      <title>Re: Plot of summary stats of Y against numeric variable X for big dataset</title>
      <link>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/526853#M17662</link>
      <description>&lt;P&gt;Look at proc hpbin as a faster and more flexible alternative to proc rank.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 06:20:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/526853#M17662</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2019-01-14T06:20:22Z</dc:date>
    </item>
    <item>
      <title>Re: Plot of summary stats of Y against numeric variable X for big dataset</title>
      <link>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/526966#M17666</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/147629"&gt;@BingyiYang&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;I often need to plot out the mean/sum of a continuous&amp;nbsp;variable (Y) against the buckets of another continuous&amp;nbsp;variable (X), by a third grouping variable (Z).&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;Note that the big datasets are typically&amp;nbsp;huge; it is impractical to scatter plot every point. That's why we need to group X into buckets and calculate bucket means first for the plots.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Helvetica Neue&amp;quot;; font-size: 10.5pt;"&gt;We can do it by using proc rank on X first then calculate buckets for X and Y. But I am wondering if any SAS plot procs can do this automatically. Thanks. &lt;/SPAN&gt;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Without knowing what type of plot or what your data currently looks like it is hard to provide any explicit example.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The SGPLOT HBAR and VBAR plots will do Freq, Mean, Median and Sum statistics for a given Response variable for levels of a group variable. "Bins" from a variable can often be created using a format.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 15:45:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/526966#M17666</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2019-01-14T15:45:37Z</dc:date>
    </item>
    <item>
      <title>Re: Plot of summary stats of Y against numeric variable X for big dataset</title>
      <link>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/527210#M17672</link>
      <description>&lt;P&gt;Thanks all for your response. I was wondering if there is a way that SAS would convert numeric X to buckets and then plot out Freq/Mean/Sum of Y against bucket means of X automatically. This is a typical data visualization&amp;nbsp;of Y ~ X when dataset&amp;nbsp;is huge. Thanks.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 15 Jan 2019 04:43:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Graphics-Programming/Plot-of-summary-stats-of-Y-against-numeric-variable-X-for-big/m-p/527210#M17672</guid>
      <dc:creator>BingyiYang</dc:creator>
      <dc:date>2019-01-15T04:43:41Z</dc:date>
    </item>
  </channel>
</rss>

