<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Count Methodology for imputation? in New SAS User</title>
    <link>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555239#M9641</link>
    <description>&lt;P&gt;For numeric variables, you can use proc stdize but I have never seen documentation on character variables.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ie&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;proc stdize data=train&lt;/P&gt;&lt;P&gt;method=median out=traini&lt;/P&gt;&lt;P&gt;var var1&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;</description>
    <pubDate>Tue, 30 Apr 2019 22:29:37 GMT</pubDate>
    <dc:creator>edasdfasdfasdfa</dc:creator>
    <dc:date>2019-04-30T22:29:37Z</dc:date>
    <item>
      <title>Count Methodology for imputation?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555234#M9638</link>
      <description>&lt;P&gt;&lt;SPAN&gt;I read the following (below) in some article on here:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;For categorical variables, the most common methodology is “count” wherein you fill the missing values with the most common level of the categorical&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="lia-search-match-lithium"&gt;variable&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;How is this performed? I can't find any information on it.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 30 Apr 2019 22:14:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555234#M9638</guid>
      <dc:creator>edasdfasdfasdfa</dc:creator>
      <dc:date>2019-04-30T22:14:26Z</dc:date>
    </item>
    <item>
      <title>Re: Count Methodology for imputation?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555238#M9640</link>
      <description>&lt;P&gt;One very crude method: Proc Freq plus a data step. Find the most frequent occurrence using proc freq&amp;nbsp;then something like:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Data want;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp; set have;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp; if missing(var) then var='mostcommonvalue';&lt;/P&gt;
&lt;P&gt;run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Similar for replacing with a Mean value, proc means/summary to get the mean and replace missing values.&lt;/P&gt;</description>
      <pubDate>Tue, 30 Apr 2019 22:26:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555238#M9640</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2019-04-30T22:26:28Z</dc:date>
    </item>
    <item>
      <title>Re: Count Methodology for imputation?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555239#M9641</link>
      <description>&lt;P&gt;For numeric variables, you can use proc stdize but I have never seen documentation on character variables.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ie&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;proc stdize data=train&lt;/P&gt;&lt;P&gt;method=median out=traini&lt;/P&gt;&lt;P&gt;var var1&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;</description>
      <pubDate>Tue, 30 Apr 2019 22:29:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555239#M9641</guid>
      <dc:creator>edasdfasdfasdfa</dc:creator>
      <dc:date>2019-04-30T22:29:37Z</dc:date>
    </item>
    <item>
      <title>Re: Count Methodology for imputation?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555241#M9643</link>
      <description>&lt;P&gt;You need to first understand how and why the values are missing before you can say what an appropriate method is. Using the largest group isn't a great method. An alternative is to actually model the data to predict the category - using logistic regression or discriminant analysis. These are both covered in PROC MI and both have examples in the documentation, 79.4 &amp;amp; 79.5 Examples&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://documentation.sas.com/?docsetId=statug&amp;amp;docsetTarget=statug_mi_examples04.htm&amp;amp;docsetVersion=15.1&amp;amp;locale=en"&gt;https://documentation.sas.com/?docsetId=statug&amp;amp;docsetTarget=statug_mi_examples04.htm&amp;amp;docsetVersion=15.1&amp;amp;locale=en&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/201537"&gt;@edasdfasdfasdfa&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;&lt;SPAN&gt;I read the following (below) in some article on here:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;For categorical variables, the most common methodology is “count” wherein you fill the missing values with the most common level of the categorical&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="lia-search-match-lithium"&gt;variable&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;How is this performed? I can't find any information on it.&lt;/SPAN&gt;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 30 Apr 2019 22:39:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Count-Methodology-for-imputation/m-p/555241#M9643</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-04-30T22:39:22Z</dc:date>
    </item>
  </channel>
</rss>

