<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Missing Value Imputation in SAS/IML Software and Matrix Computations</title>
    <link>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/899538#M6093</link>
    <description>Thank you for your kind words, Season. Now that I am retired, I have the&lt;BR /&gt;luxury to investigate topics of interest in detail and I can then describe&lt;BR /&gt;my results for the user community to enjoy.&lt;BR /&gt;&lt;BR /&gt;I have no real experience with MI. I looked through the SAS description of&lt;BR /&gt;PROC MI and saw that the MI algorithm is a very intricate procedure with&lt;BR /&gt;which to perform missing value imputation.&lt;BR /&gt;&lt;BR /&gt;I wrote a paper on fuzzy c-means imputation which I posted to the SAS&lt;BR /&gt;Community:&lt;BR /&gt;&lt;A href="https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/790785" target="_blank"&gt;https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/790785&lt;/A&gt;.&lt;BR /&gt;It deals with missing value imputation quite effectively and, IMHO, much&lt;BR /&gt;more simply than the MI procedure. It is a popular tool and there is much&lt;BR /&gt;support for it in the published literature.&lt;BR /&gt;&lt;BR /&gt;HTH,&lt;BR /&gt;Ross&lt;BR /&gt;</description>
    <pubDate>Sat, 21 Oct 2023 15:20:54 GMT</pubDate>
    <dc:creator>rbettinger</dc:creator>
    <dc:date>2023-10-21T15:20:54Z</dc:date>
    <item>
      <title>Missing Value Imputation</title>
      <link>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/790785#M5749</link>
      <description>&lt;P&gt;I have written a paper, "Missing Value Imputation", that I presented at the October Southeast SAS Users' Group 2021 meeting. It contains a historical summary of attempts to perform imputation of missing values, a detailed description of the mechanisms of missingness, and an application of the fuzzy c-means algorithm (FCM) to Census data to perform missing value imputation.&amp;nbsp; I would like to share it with the SAS community. There are two macros, %FCM and %FCM_IMPUTE, which are included as attachments to this post. If you cannot access them, contact me and I will send them to you.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here is the paper, which I have copied into this post for those of you who cannot access the PDF document of the original paper.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="rbettinger_0-1642550546698.png" style="width: 999px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/67487iDF267AA24ED26567/image-size/large?v=v2&amp;amp;px=999" role="button" title="rbettinger_0-1642550546698.png" alt="rbettinger_0-1642550546698.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Unfortunately, I cannot paste the entirety of the document into this post. Please contact me directly and I will send you the PDF of the paper and the macros.&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jan 2022 00:07:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/790785#M5749</guid>
      <dc:creator>rbettinger</dc:creator>
      <dc:date>2022-01-19T00:07:54Z</dc:date>
    </item>
    <item>
      <title>Re: Missing Value Imputation</title>
      <link>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/790854#M5750</link>
      <description>&lt;P&gt;Thanks for this post. I look forward to reading it. This seems like a good candidate for an article for the&amp;nbsp;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/tkb-p/library" target="_self"&gt;SAS Communities Library.&lt;/A&gt;&amp;nbsp;The Library often hosts papers and the associated code for SAS-related content. You might want to look into it.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I took a quick look at the code, and I'd like to suggest that you do not need either of the user-defined functions in your PROC IML step.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;The DIST_FNC module can be replaced by using &lt;A href="https://go.documentation.sas.com/doc/en/pgmsascdc/v_013/imlug/imlug_langref_sect125.htm" target="_self"&gt;the DISTANCE function in IML&lt;/A&gt;. (This capability of the DISTANCE function was released as part of SAS 9.4M5.) FOr background and a different "manual" implementation, see &lt;A href="https://blogs.sas.com/content/iml/2016/09/28/distance-between-two-group.html" target="_self"&gt;"Distances between observations in two groups."&lt;/A&gt;&amp;nbsp;&lt;/LI&gt;
&lt;LI&gt;The&amp;nbsp;MATVEC_SS module can be replaced by the one-liner &lt;STRONG&gt;SSQ(X - c)&lt;/STRONG&gt;, where c is the row vector.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Again, thanks for posting.&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jan 2022 11:40:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/790854#M5750</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2022-01-19T11:40:56Z</dc:date>
    </item>
    <item>
      <title>Re: Missing Value Imputation</title>
      <link>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/898635#M6083</link>
      <description>&lt;P&gt;Hello, thanks for sharing your work to us! I have some simple yet possibly unanswered questions to raise concerning multiple imputation (MI). In one word, that is: &lt;STRONG&gt;How should we pool the point estimands of the respective imputation sample when the estimands do not (necessarily) follow a normal distribution&lt;/STRONG&gt;?&lt;/P&gt;
&lt;P&gt;One of the most common problems of this kind encountered in data analyses is the pooling of median: suppose I have a variable with missing data and does not follow a normal distribution, then median is the correct statistic for describing the central tendency of the variable. Now that I have computed&amp;nbsp;&lt;EM&gt;M&amp;nbsp;&lt;/EM&gt;times and therefore have&amp;nbsp;&lt;EM&gt;M&amp;nbsp;&lt;/EM&gt;medians, how should I pool them given the fact that while the&amp;nbsp;&lt;STRONG&gt;mean&amp;nbsp;&lt;/STRONG&gt;follows an asymptotic normal distribution while this is not necessarily the case for&amp;nbsp;&lt;STRONG&gt;median&amp;nbsp;&lt;/STRONG&gt;and that Rubin's rule of pooling the estimands is based on asymptotic normality?&lt;/P&gt;
&lt;P&gt;Variable transform is a possible choice. That is, we bypass this problem by transforming the variable into another one that follows a normal distribution via methods like Box-Cox transformation and reported the pooled mean and standard deviation of the transformed rather than the original variable. But given the complexity of Box-Cox transformation to practitioners without professional statistical training (e.g., medical doctors) and the loss of "intuitiveness" and "explanability" (e.g., it is difficult for a person to tell what the mean and standard deviation of a Box-Cox transformed triglyceride are all about, especially for medical doctors), this method does not seem to work.&lt;/P&gt;
&lt;P&gt;Similar situations frequently occur in medical statistics. Sensitivity, specificity and Youden index are all examples of the problems of this kind.&lt;/P&gt;
&lt;P&gt;So, when pooling the estimands seem to violate the rationale of Rubin's rule, what should we do?&lt;/P&gt;
&lt;P&gt;Many thanks!&lt;/P&gt;</description>
      <pubDate>Sun, 15 Oct 2023 04:44:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/898635#M6083</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-10-15T04:44:56Z</dc:date>
    </item>
    <item>
      <title>Re: Missing Value Imputation</title>
      <link>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/898664#M6084</link>
      <description>Thank you for your interest in my work.&lt;BR /&gt;&lt;BR /&gt;As a general rule, whenever you impute missing values, you are adding&lt;BR /&gt;"value" to the data. This "value" may be helpful in that it solves the&lt;BR /&gt;problem of missingness but it may be a hindrance when, as you have&lt;BR /&gt;described the transformations, the result is to add complications to&lt;BR /&gt;understanding the results. If you, as a trained and literate practitioner&lt;BR /&gt;of statistics, have doubts and uncertainties about the imputation process,&lt;BR /&gt;how much more so someone who may be an expert in a specialized field but&lt;BR /&gt;not in statistical reasoning and practice.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;   - I once tried to explain the concept of compound average growth rate to&lt;BR /&gt;   a manager and he shook his head to indicate that he didn't understand what&lt;BR /&gt;   I had told him. So I simplified the answer to "it's like compound interest"&lt;BR /&gt;   and he nodded.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;   - I once tried to explain the concept of area under the ROC curve to my&lt;BR /&gt;   manager. He barely understood the concept of a 2x2 classification table,&lt;BR /&gt;   but once I said "and you build a new table for different values of cut-off&lt;BR /&gt;   value p and then plot true positive vs false positive as a parametric&lt;BR /&gt;   graph", he shook his head and I stopped trying to explain the AUROC idea&lt;BR /&gt;   because he was innumerate. He had a degree in ChemE but he was a yutz&lt;BR /&gt;   (Google on it).&lt;BR /&gt;&lt;BR /&gt;Basically, the answer must be on the same level of sophistication as your&lt;BR /&gt;audience or they will at best ignore your words and at worst say that you&lt;BR /&gt;can't communicate.&lt;BR /&gt;&lt;BR /&gt;So, for all of these words that you have endured reading until now, I would&lt;BR /&gt;compute the median of the medians and report that statistic to the users.&lt;BR /&gt;Hopefully, your results will not be too far off from the "true" but unknown&lt;BR /&gt;value of the missing variable(s). Your results will not be theoretically&lt;BR /&gt;elegant, but your audience will immediately understand what you have done.&lt;BR /&gt;As a test, you might artificially set some values in complete data to&lt;BR /&gt;missing, run the MI procedure to generate medians, and compare the&lt;BR /&gt;estimated medians on the "pseudomissing" data to the medians on the&lt;BR /&gt;complete case data as a sanity check. Be sure to use the same percentage of&lt;BR /&gt;induced missingness in the simulation of missingness as already exists in&lt;BR /&gt;the original data to make the results realistic.&lt;BR /&gt;&lt;BR /&gt;While this "seat of the pants" method is not particularly elegant, you are&lt;BR /&gt;not writing a PhD dissertation. I seriously doubt that any prospective&lt;BR /&gt;users will say "Seriously? Is this the best that you can do?" They will be&lt;BR /&gt;more likely to say, "Thank you for providing us a solution that we can&lt;BR /&gt;understand. Now we can go forward." Optimize when you can, satisfice when&lt;BR /&gt;you cannot. Sometimes, good enough is enough.&lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;Ross&lt;BR /&gt;</description>
      <pubDate>Sun, 15 Oct 2023 16:00:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/898664#M6084</guid>
      <dc:creator>rbettinger</dc:creator>
      <dc:date>2023-10-15T16:00:54Z</dc:date>
    </item>
    <item>
      <title>Re: Missing Value Imputation</title>
      <link>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/899493#M6089</link>
      <description>&lt;P&gt;Thank you very much for your patient reply! I can feel the passion you conveyed to a stranger who is interested in your research but had not uttered a word to you ever before. Thank you for your kind and passion!&lt;/P&gt;
&lt;P&gt;I do agree with your opinion that the interest of audience is of great importance on how you present your research results. Prior to raising my question here, I browsed extensively over the literatures in multiple imputation (MI) and thought about that problem on my own. The only way I could thought when clarity and simplicity was taken into consideration was to use the median of estimates of the statistics to be pooled as the estimate of the pooled statistics. But my notion was not supported by previous research, so I raised my question here to see if better methods existed.&lt;/P&gt;
&lt;P&gt;Here are some more advanced topics concerning MI.&amp;nbsp;&lt;STRONG&gt;Are there any regression diagnostic methods in the presence of MI? The word "regression" in the phrase "regression diagnostic" here stands for the model to be built with the imputed datasets rather than the imputation models. More specifically, I would like to ask about the ways of assessing outliers, collinearity and strong influential observations in the presence of MI.&amp;nbsp;&lt;/STRONG&gt;The multiple samples created by MI has enabled a more robust point estimate of the estimands, but it also causes confusions to data analyst as to which sample to choose when it comes to compute regression diagnostic statistics.&lt;/P&gt;
&lt;P&gt;Thank you again for your kind help!&lt;/P&gt;</description>
      <pubDate>Sat, 21 Oct 2023 03:48:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/899493#M6089</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-10-21T03:48:55Z</dc:date>
    </item>
    <item>
      <title>Re: Missing Value Imputation</title>
      <link>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/899538#M6093</link>
      <description>Thank you for your kind words, Season. Now that I am retired, I have the&lt;BR /&gt;luxury to investigate topics of interest in detail and I can then describe&lt;BR /&gt;my results for the user community to enjoy.&lt;BR /&gt;&lt;BR /&gt;I have no real experience with MI. I looked through the SAS description of&lt;BR /&gt;PROC MI and saw that the MI algorithm is a very intricate procedure with&lt;BR /&gt;which to perform missing value imputation.&lt;BR /&gt;&lt;BR /&gt;I wrote a paper on fuzzy c-means imputation which I posted to the SAS&lt;BR /&gt;Community:&lt;BR /&gt;&lt;A href="https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/790785" target="_blank"&gt;https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/790785&lt;/A&gt;.&lt;BR /&gt;It deals with missing value imputation quite effectively and, IMHO, much&lt;BR /&gt;more simply than the MI procedure. It is a popular tool and there is much&lt;BR /&gt;support for it in the published literature.&lt;BR /&gt;&lt;BR /&gt;HTH,&lt;BR /&gt;Ross&lt;BR /&gt;</description>
      <pubDate>Sat, 21 Oct 2023 15:20:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/899538#M6093</guid>
      <dc:creator>rbettinger</dc:creator>
      <dc:date>2023-10-21T15:20:54Z</dc:date>
    </item>
    <item>
      <title>Re: Missing Value Imputation</title>
      <link>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/899539#M6094</link>
      <description>All right. Thank you for your attention paid to and time spent on me!</description>
      <pubDate>Sat, 21 Oct 2023 15:40:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-IML-Software-and-Matrix/Missing-Value-Imputation/m-p/899539#M6094</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-10-21T15:40:41Z</dc:date>
    </item>
  </channel>
</rss>

