<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: PROC MI: Predicitive Mean Matching is Running slowly on Large Dataset in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/754336#M36713</link>
    <description>&lt;P&gt;Is this a duplicate of&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://communities.sas.com/t5/Statistical-Procedures/proc-mi-produces-no-log/m-p/754134" target="_blank"&gt;https://communities.sas.com/t5/Statistical-Procedures/proc-mi-produces-no-log/m-p/754134&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;If not, please let us know how this question is different.&lt;/P&gt;</description>
    <pubDate>Thu, 15 Jul 2021 14:11:00 GMT</pubDate>
    <dc:creator>Rick_SAS</dc:creator>
    <dc:date>2021-07-15T14:11:00Z</dc:date>
    <item>
      <title>PROC MI: Predicitive Mean Matching is Running slowly on Large Dataset</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/753923#M36685</link>
      <description>&lt;P&gt;I'm trying to impute 6 variables using PROC MI and the&amp;nbsp; fully condidional specification method.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Two of these variables are attendance&amp;nbsp; rates bounded by 0 and 100.&amp;nbsp; &amp;nbsp;I have used PROC MI and FCS to imputed values which works fine except for the fact that some attendance values end up being a lot more than 100.&amp;nbsp; I've tried setting MAX =100 but the algorithm ends up stopping.&amp;nbsp; I then tried to use predictive mean matching for the attendance variables.&amp;nbsp; However, the dataset is very large so this seems to take a very long time.&amp;nbsp; I was wondering if there was any way to speed up predictive mean matching.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 13 Jul 2021 20:38:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/753923#M36685</guid>
      <dc:creator>whs278</dc:creator>
      <dc:date>2021-07-13T20:38:32Z</dc:date>
    </item>
    <item>
      <title>Re: PROC MI: Predicitive Mean Matching is Running slowly on Large Dataset</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/753925#M36686</link>
      <description>&lt;P&gt;How large is "large"? Large number of observations takes time.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It may help to include the code you are running to avoid suggestions of options you are using or to get hints that are more likely to be applicable.&lt;/P&gt;</description>
      <pubDate>Tue, 13 Jul 2021 20:45:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/753925#M36686</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2021-07-13T20:45:13Z</dc:date>
    </item>
    <item>
      <title>Re: PROC MI: Predicitive Mean Matching is Running slowly on Large Dataset</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/753934#M36687</link>
      <description>&lt;P&gt;There are&amp;nbsp; 757,400 observations.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The code below takes about a minute with 50,000 observations.&amp;nbsp; However, the procedure seems to take for ever (at least 2 hours) after going beyond about 70,000 observations, which is only 1/10 of the data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;PROC MI DATA = HSPSTUID(OBS = 50000) NIMPUTE =  1 OUT = HSPSTU_MI1;
	CLASS ETHCAT GENCAT OVERAGEG09 REGMTHP65G08P;
	FCS NBITER = 1 REGPMM ( ATTPCTROLG08MI);
	FCS NBITER = 1 REG (ELASSCZG08MI);
	FCS NBITER = 1 REG (MTHSSCZG07MI);
	FCS NBITER = 1 REGPMM ( ATTPCTROLG07MI);
	FCS NBITER = 1 REG (ELASSCZG07MI);
	FCS NBITER = 1 REG (MTHSSCZG08MI);
	VAR ETHCAT GENCAT OVERAGEG09 REGMTHP65G08P ATTPCTROLG08MI ELASSCZG08MI MTHSSCZG07MI ATTPCTROLG07MI ELASSCZG07MI MTHSSCZG08MI;
	WHERE IN_MI = 1;

RUN
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;None of the variables in the class statement have any missing data.&amp;nbsp; &amp;nbsp;The two attendance variables use REGPMM and the other four continous variables use REG.&amp;nbsp; I've order the FCS statements by the number of missing observations in each variable.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I purposedly set NIMPUTE&amp;nbsp; = 1.&amp;nbsp; I know multiple imputation is superior but we're currently sticking with single imputation.&amp;nbsp; In any case, I imagine multiple imputation would take even longer.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for your help.&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 13 Jul 2021 21:04:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/753934#M36687</guid>
      <dc:creator>whs278</dc:creator>
      <dc:date>2021-07-13T21:04:43Z</dc:date>
    </item>
    <item>
      <title>Re: PROC MI: Predicitive Mean Matching is Running slowly on Large Dataset</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/754308#M36711</link>
      <description>&lt;P&gt;For at 0 and 100 both of the imputed variables truncated at 0 and 100, do you feel comfortable imputing using the FCS method, and then post-processing the two variables to have a max or min of 0 or 100, as those are what could be actually measured?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;SteveDenham&lt;/P&gt;</description>
      <pubDate>Thu, 15 Jul 2021 12:48:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/754308#M36711</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2021-07-15T12:48:24Z</dc:date>
    </item>
    <item>
      <title>Re: PROC MI: Predicitive Mean Matching is Running slowly on Large Dataset</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/754336#M36713</link>
      <description>&lt;P&gt;Is this a duplicate of&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://communities.sas.com/t5/Statistical-Procedures/proc-mi-produces-no-log/m-p/754134" target="_blank"&gt;https://communities.sas.com/t5/Statistical-Procedures/proc-mi-produces-no-log/m-p/754134&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;If not, please let us know how this question is different.&lt;/P&gt;</description>
      <pubDate>Thu, 15 Jul 2021 14:11:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-MI-Predicitive-Mean-Matching-is-Running-slowly-on-Large/m-p/754336#M36713</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2021-07-15T14:11:00Z</dc:date>
    </item>
  </channel>
</rss>

