<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic What imputation methods should I use for our study for multiple imputation (Proc MI)? Seizure count in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928066#M46213</link>
    <description>&lt;P&gt;Hi:&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am working on a study. It has been planned to do multiple imputation for missing values related to the primary endpoint by an agency's requirements. I have read many materials online and have some ideas. But, I am still not sure.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Study information: The study's indication is Epilepsy. Each patient is given a diary and they are supposed to record how many seizures they expeirenced each day during the study period (DB period is ~ 85 days).&amp;nbsp; As you can image, some patients may forget to record their seizures on some days and some patients may discontinue from the study before the Day 85. So there are missing values for seizure counts on some days for some subjects.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My questions:&lt;/P&gt;&lt;P&gt;1.&amp;nbsp;Our missing type is considered as 'missing as random' and so this procedure(Proc MI) can be used, correct?&lt;/P&gt;&lt;P&gt;2. Since our missing data is only in one variable, i.e., seizure count, so I think I should NOT use the methods in SAS documentation (Imputation Methods, Table 5) with 'monotone', correct?&lt;/P&gt;&lt;P&gt;3. Since our data is 'seizure count', which should follow poission distribution (correct?), not normal, I should NOT use methods with 'MCMC' since MCMC method is based on the assumption of multivariate normal distribution (MVN) for variables, correct?&lt;/P&gt;&lt;P&gt;4. Then I thought I should use FCS,&amp;nbsp; fcs reg, or fcs regpmm. I read SAS documentation, it has "&lt;SPAN&gt;The predictive mean matching method ensures that imputed values are plausible; it might be more appropriate than the regression method if the normality assumption is violated (Horton and Lipsitz&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://documentation.sas.com/doc/en/statug/15.2/statug_mi_references.htm#statug_mihort_n01" target="_blank" rel="noopener"&gt;2001&lt;/A&gt;&lt;SPAN&gt;, p. 246)." So I thought I should use 'fcs regpmm'. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I also tried 'fcs reg', the imputed values gives non-integer, a number with decimal. It seems it does not fit my case. Our seizure is an count; so it should be an interger. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;If I use 'fcs regpmm', the imputed values are integers.&amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;5. If using 'fcs regpmm' is correct for my case, what number of 'k' (SAS option with 'fcs regpmm' option) should I pick?&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Here is the code I use.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;proc mi data = post nimpute = 25 out = post_mi&amp;nbsp; seed = 54321 noprint;&amp;nbsp;&lt;BR /&gt;by subjid;&lt;BR /&gt;var qsdy count;'&lt;BR /&gt;fcs regpmm (/k = 5);&lt;BR /&gt;run;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note: 'qsdy' is the study Day variable; it is from Day 1 till Day 85. 'count' is seizure count for each day. There are missings in this variable.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note: since the imputation is by subjid, so covariates such as age, treatment, etc, are not needed (no change for an individual), correct?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If any detailed information is needed for this discussion, please ask me.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks a lot in advance.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Xiaoshu&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 12 May 2024 23:17:17 GMT</pubDate>
    <dc:creator>JanetXu</dc:creator>
    <dc:date>2024-05-12T23:17:17Z</dc:date>
    <item>
      <title>What imputation methods should I use for our study for multiple imputation (Proc MI)? Seizure count</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928066#M46213</link>
      <description>&lt;P&gt;Hi:&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am working on a study. It has been planned to do multiple imputation for missing values related to the primary endpoint by an agency's requirements. I have read many materials online and have some ideas. But, I am still not sure.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Study information: The study's indication is Epilepsy. Each patient is given a diary and they are supposed to record how many seizures they expeirenced each day during the study period (DB period is ~ 85 days).&amp;nbsp; As you can image, some patients may forget to record their seizures on some days and some patients may discontinue from the study before the Day 85. So there are missing values for seizure counts on some days for some subjects.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My questions:&lt;/P&gt;&lt;P&gt;1.&amp;nbsp;Our missing type is considered as 'missing as random' and so this procedure(Proc MI) can be used, correct?&lt;/P&gt;&lt;P&gt;2. Since our missing data is only in one variable, i.e., seizure count, so I think I should NOT use the methods in SAS documentation (Imputation Methods, Table 5) with 'monotone', correct?&lt;/P&gt;&lt;P&gt;3. Since our data is 'seizure count', which should follow poission distribution (correct?), not normal, I should NOT use methods with 'MCMC' since MCMC method is based on the assumption of multivariate normal distribution (MVN) for variables, correct?&lt;/P&gt;&lt;P&gt;4. Then I thought I should use FCS,&amp;nbsp; fcs reg, or fcs regpmm. I read SAS documentation, it has "&lt;SPAN&gt;The predictive mean matching method ensures that imputed values are plausible; it might be more appropriate than the regression method if the normality assumption is violated (Horton and Lipsitz&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://documentation.sas.com/doc/en/statug/15.2/statug_mi_references.htm#statug_mihort_n01" target="_blank" rel="noopener"&gt;2001&lt;/A&gt;&lt;SPAN&gt;, p. 246)." So I thought I should use 'fcs regpmm'. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I also tried 'fcs reg', the imputed values gives non-integer, a number with decimal. It seems it does not fit my case. Our seizure is an count; so it should be an interger. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;If I use 'fcs regpmm', the imputed values are integers.&amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;5. If using 'fcs regpmm' is correct for my case, what number of 'k' (SAS option with 'fcs regpmm' option) should I pick?&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Here is the code I use.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;proc mi data = post nimpute = 25 out = post_mi&amp;nbsp; seed = 54321 noprint;&amp;nbsp;&lt;BR /&gt;by subjid;&lt;BR /&gt;var qsdy count;'&lt;BR /&gt;fcs regpmm (/k = 5);&lt;BR /&gt;run;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note: 'qsdy' is the study Day variable; it is from Day 1 till Day 85. 'count' is seizure count for each day. There are missings in this variable.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note: since the imputation is by subjid, so covariates such as age, treatment, etc, are not needed (no change for an individual), correct?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If any detailed information is needed for this discussion, please ask me.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks a lot in advance.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Xiaoshu&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 12 May 2024 23:17:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928066#M46213</guid>
      <dc:creator>JanetXu</dc:creator>
      <dc:date>2024-05-12T23:17:17Z</dc:date>
    </item>
    <item>
      <title>Re: What imputation methods should I use for our study for multiple imputation (Proc MI)? Seizure co</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928168#M46216</link>
      <description>&lt;P&gt;Proc MI does not have a specific option for dealing with count (poisson) data.&amp;nbsp; To circumvent this you could use the REGPMM method on a MONOTONE statement.&amp;nbsp; This will lead to only integer values being selected as imputed values.&lt;/P&gt;
&lt;P data-unlink="true"&gt;As far as the value of K= that you should select, there is not really an ideal value, but instead there is a tradeoff of sorts.&amp;nbsp;&amp;nbsp;&lt;SPAN&gt;A smaller&amp;nbsp;K= value&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;tends to increase the correlation among the multiple imputations for the missing observation and results in a higher variability of point estimators in repeated sampling. On the other hand, a larger&amp;nbsp;K= value&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;tends to lessen the effect from the imputation model and results in biased estimators.&amp;nbsp; There is a good discussion in&amp;nbsp; Schenker and Taylor&amp;nbsp;(&lt;/SPAN&gt;1996&amp;nbsp;) on&amp;nbsp;&lt;SPAN&gt;&amp;nbsp;p. 430.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 13 May 2024 16:42:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928168#M46216</guid>
      <dc:creator>SAS_Rob</dc:creator>
      <dc:date>2024-05-13T16:42:50Z</dc:date>
    </item>
    <item>
      <title>Re: What imputation methods should I use for our study for multiple imputation (Proc MI)? Seizure co</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928215#M46221</link>
      <description>&lt;P&gt;Thanks so much for your reply.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a further question. Why would my data be considered as monotone? I have only one varialbe which has missing. Can you please help me out? I should not use 'fcs'?&lt;/P&gt;&lt;P&gt;in SAS code,&amp;nbsp;&lt;/P&gt;&lt;P&gt;not 'fcs regpmm', but 'monotone regpmm'?&lt;/P&gt;&lt;P&gt;Thanks a lot!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Xiaoshu&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 13 May 2024 22:07:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928215#M46221</guid>
      <dc:creator>JanetXu</dc:creator>
      <dc:date>2024-05-13T22:07:45Z</dc:date>
    </item>
    <item>
      <title>Re: What imputation methods should I use for our study for multiple imputation (Proc MI)? Seizure co</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928343#M46229</link>
      <description>&lt;P&gt;With one variable that has missingness you can always create a monotone missing data pattern by listing that variable last on the VAR statement.&lt;/P&gt;
&lt;P&gt;Since you only have one variable with a missing value, it really doesn't matter whether you use the MONOTONE or FCS statement.&amp;nbsp; But in general, you can always use the FCS statement for any missing data pattern (including monotone).&amp;nbsp; It would be preferred however to use the MONOTONE method whenever possible because it is simpler with respect to the number of imputation models that would need to be run.&lt;/P&gt;</description>
      <pubDate>Tue, 14 May 2024 17:11:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928343#M46229</guid>
      <dc:creator>SAS_Rob</dc:creator>
      <dc:date>2024-05-14T17:11:35Z</dc:date>
    </item>
    <item>
      <title>Re: What imputation methods should I use for our study for multiple imputation (Proc MI)? Seizure co</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928399#M46248</link>
      <description>&lt;P&gt;Thank you so much for your reply. I am clear now. One of the key word is 'last'. I should put the variable with missing values (seizure count) in the last if I use monotone.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 14 May 2024 22:09:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-imputation-methods-should-I-use-for-our-study-for-multiple/m-p/928399#M46248</guid>
      <dc:creator>JanetXu</dc:creator>
      <dc:date>2024-05-14T22:09:39Z</dc:date>
    </item>
  </channel>
</rss>

