<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Keep rows that are mostly complete in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724221#M19834</link>
    <description>&lt;P&gt;Yes.&lt;/P&gt;</description>
    <pubDate>Sat, 06 Mar 2021 19:14:16 GMT</pubDate>
    <dc:creator>amarikow57</dc:creator>
    <dc:date>2021-03-06T19:14:16Z</dc:date>
    <item>
      <title>Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724206#M19830</link>
      <description>&lt;P&gt;I have a dataset with lots of missing data. Rather than impute, I was informed to build two logistic regression&amp;nbsp;prediction models: (1) include only variables that most participants have observations on and (2) include all variables with participants who have most observations.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Originally I was going to use cmiss(of _ALL_) = 0 to obtain complete cases for the second model, but this left me with an incredibly small sample size. Is there a way to include participants that contain say 90% of the information filled out? That is, if I have 20 variables (columns), then only participants who have information in at least 18 of those columns are kept.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Conversely, is there a way to only keep variables that contain 90% of the information? That is, if I have 50 participants (rows), then variables that contain at least 45 observations will be kept. I suppose I can manually look and drop variable names, but I was hoping for something more efficient.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you in advance!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 06 Mar 2021 17:42:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724206#M19830</guid>
      <dc:creator>amarikow57</dc:creator>
      <dc:date>2021-03-06T17:42:21Z</dc:date>
    </item>
    <item>
      <title>Re: Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724208#M19831</link>
      <description>&lt;P&gt;how about:&amp;nbsp;&lt;SPAN&gt;cmiss(of _ALL_)/20 &amp;lt;= 0.1?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 06 Mar 2021 17:51:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724208#M19831</guid>
      <dc:creator>yabwon</dc:creator>
      <dc:date>2021-03-06T17:51:12Z</dc:date>
    </item>
    <item>
      <title>Re: Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724212#M19832</link>
      <description>&lt;P&gt;Hahaha. The answer was so obvious. Thank you!&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Do you know how to do a similar thing for columns instead of rows? Would transposing be the best option?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 06 Mar 2021 18:23:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724212#M19832</guid>
      <dc:creator>amarikow57</dc:creator>
      <dc:date>2021-03-06T18:23:42Z</dc:date>
    </item>
    <item>
      <title>Re: Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724213#M19833</link>
      <description>&lt;P&gt;What do you mean by "&lt;SPAN&gt;similar thing for columns"? Drop column if it has less then 90% of non-missing?&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;B&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 06 Mar 2021 18:28:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724213#M19833</guid>
      <dc:creator>yabwon</dc:creator>
      <dc:date>2021-03-06T18:28:43Z</dc:date>
    </item>
    <item>
      <title>Re: Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724221#M19834</link>
      <description>&lt;P&gt;Yes.&lt;/P&gt;</description>
      <pubDate>Sat, 06 Mar 2021 19:14:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724221#M19834</guid>
      <dc:creator>amarikow57</dc:creator>
      <dc:date>2021-03-06T19:14:16Z</dc:date>
    </item>
    <item>
      <title>Re: Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724232#M19835</link>
      <description>&lt;P&gt;I have been faced with this problem at some occasions where you have to decide on the combination of variables and observations to retain for an analysis that excludes observations with missing values. I wrote a macro that finds ALL of those combinations, called complete subsets, to help you choose. You might choose, for example, the largest subset (nbVars x nbObs) that includes some key variables.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The macro is described here:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/Finding-a-complete-sub-matrix-aka-finding-maximal-bicliques/ta-p/285383" target="_self"&gt;https://communities.sas.com/t5/SAS-Communities-Library/Finding-a-complete-sub-matrix-aka-finding-maximal-bicliques/ta-p/285383&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good luck.&lt;/P&gt;</description>
      <pubDate>Sat, 06 Mar 2021 23:06:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724232#M19835</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2021-03-06T23:06:59Z</dc:date>
    </item>
    <item>
      <title>Re: Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724233#M19836</link>
      <description>&lt;P&gt;If my variable is stored as numeric but to be treated as categorical in analysis, does this macro still work okay with it?&lt;/P&gt;</description>
      <pubDate>Sat, 06 Mar 2021 23:23:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724233#M19836</guid>
      <dc:creator>amarikow57</dc:creator>
      <dc:date>2021-03-06T23:23:29Z</dc:date>
    </item>
    <item>
      <title>Re: Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724235#M19837</link>
      <description>&lt;P&gt;Ah. Nevermind. It seems SAS University does not like OPTNET. Thank you though.&lt;/P&gt;</description>
      <pubDate>Sat, 06 Mar 2021 23:33:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724235#M19837</guid>
      <dc:creator>amarikow57</dc:creator>
      <dc:date>2021-03-06T23:33:38Z</dc:date>
    </item>
    <item>
      <title>Re: Keep rows that are mostly complete</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724236#M19838</link>
      <description>&lt;P&gt;Please note that proc optnet was introduced with SAS 9.3TS1M2.&lt;/P&gt;</description>
      <pubDate>Sun, 07 Mar 2021 00:23:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Keep-rows-that-are-mostly-complete/m-p/724236#M19838</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2021-03-07T00:23:43Z</dc:date>
    </item>
  </channel>
</rss>

