<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: sample size in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764615#M37344</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;On top of my previous reply (see above) ...&lt;BR /&gt;&lt;BR /&gt;I know that &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/27741"&gt;@gcjfernandez&lt;/a&gt;&amp;nbsp;is rather savvy about our survey procedures. Would be good to have his opinion.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
    <pubDate>Sat, 28 Aug 2021 11:53:26 GMT</pubDate>
    <dc:creator>sbxkoenk</dc:creator>
    <dc:date>2021-08-28T11:53:26Z</dc:date>
    <item>
      <title>sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/763960#M37290</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is it ok to use proc surveylogistic to compare a group of 41 participants to 10144 participants&lt;SPAN style="font-family: inherit;"&gt;&amp;nbsp;using a national data (NHANES) and weights?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;or there is a special syntax ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Wed, 25 Aug 2021 18:56:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/763960#M37290</guid>
      <dc:creator>lbahanan</dc:creator>
      <dc:date>2021-08-25T18:56:42Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764297#M37321</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Can you elaborate a bit?&lt;/P&gt;
&lt;P&gt;What exactly do you want to do?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Do you want to fit a binary response model where 41 people have a 1-response and&amp;nbsp;&lt;SPAN&gt;10144&amp;nbsp;have a 0-response (or vice versa)?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Where exactly do you have this severe unbalanced-ness (if that is an existing English word)?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Koen&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 26 Aug 2021 18:15:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764297#M37321</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-08-26T18:15:59Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764554#M37338</link>
      <description>Yes, exactly. I want to fit a binary response model where 41 people have a 1-response and 10144 have a 0-response and this is variable is the predictor (independent variable) and I have other confounders in the model.&lt;BR /&gt;</description>
      <pubDate>Fri, 27 Aug 2021 19:41:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764554#M37338</guid>
      <dc:creator>lbahanan</dc:creator>
      <dc:date>2021-08-27T19:41:54Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764579#M37339</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am not really familiar with survey research.&lt;/P&gt;
&lt;P&gt;There may be some&amp;nbsp;peculiarities and intricacies when the data at hand are survey data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Anyway, you have very few observations in the rare category ( # 41 ).&lt;/P&gt;
&lt;P&gt;Thus, what you want to do is known as RARE EVENT MODELLING.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;The problem is that maximum likelihood estimation of the parameters of the logistic model is well-known to suffer from substantial bias when you have such&amp;nbsp;a small number of cases on the rarer of the two outcomes.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;You could use a method, known as "penalized likelihood" (also called the Firth method, after its inventor). Penalized likelihood is a general approach to reducing small-sample bias in maximum likelihood estimation.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;In PROC LOGISTIC there's the "firth" option on the model statement, but PROC SURVEYLOGISTIC does not have this option.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;I hope a survey analytics specialist will chime in to help you out.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;See also this paper from SAS Global Forum 2020:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Paper 4654-2020&lt;BR /&gt;Rare Events or Non-Convergence with a Binary Outcome? The Power of Firth Regression in PROC LOGISTIC&lt;BR /&gt;Patrick Karabon, Oakland University William Beaumont School of Medicine &lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://www.sas.com/content/dam/SAS/support/en/sas-global-forum-proceedings/2020/4654-2020.pdf" target="_blank"&gt;https://www.sas.com/content/dam/SAS/support/en/sas-global-forum-proceedings/2020/4654-2020.pdf&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Kind regards,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Koen&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 27 Aug 2021 22:00:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764579#M37339</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-08-27T22:00:06Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764588#M37340</link>
      <description>Thank you so much for the helpful comment!&lt;BR /&gt;&lt;BR /&gt;Unfortunately, I’m using PROC SURVEYLOGISTIC.&lt;BR /&gt;&lt;BR /&gt;Can I proceed with it and mention the substantial bias in the limitations? Or it is not acceptable?&lt;BR /&gt;&lt;BR /&gt;Thank you!!&lt;BR /&gt;Lina&lt;BR /&gt;</description>
      <pubDate>Fri, 27 Aug 2021 22:44:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764588#M37340</guid>
      <dc:creator>lbahanan</dc:creator>
      <dc:date>2021-08-27T22:44:53Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764614#M37343</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You ask:&lt;BR /&gt;&amp;gt;&amp;nbsp;&lt;SPAN&gt;Can I proceed with it and mention the substantial bias in the limitations? Or it is not acceptable?&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Is this for a research paper in a journal?&lt;BR /&gt;It is a severe limitation.&amp;nbsp;&lt;BR /&gt;I would still take into account the serious unbalanced-ness in your analysis one way or another.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good luck,&lt;BR /&gt;Koen&lt;/P&gt;</description>
      <pubDate>Sat, 28 Aug 2021 11:46:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764614#M37343</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-08-28T11:46:20Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764615#M37344</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;On top of my previous reply (see above) ...&lt;BR /&gt;&lt;BR /&gt;I know that &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/27741"&gt;@gcjfernandez&lt;/a&gt;&amp;nbsp;is rather savvy about our survey procedures. Would be good to have his opinion.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Sat, 28 Aug 2021 11:53:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764615#M37344</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-08-28T11:53:26Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764624#M37348</link>
      <description>Yes, it is a research paper.&lt;BR /&gt;&lt;BR /&gt;Can you have a look at this paper.&lt;BR /&gt;&lt;A href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6475717/#!po=1.85185" target="_blank"&gt;https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6475717/#!po=1.85185&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;As I understood they only did PROC SURVEYLOGISTIC.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Sat, 28 Aug 2021 13:39:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764624#M37348</guid>
      <dc:creator>lbahanan</dc:creator>
      <dc:date>2021-08-28T13:39:27Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764659#M37354</link>
      <description>&lt;P&gt;Please provide the details of your data, design and objectives?&lt;/P&gt;
&lt;P&gt;The paper you cited used NHANSES data and using SURVEYLOGISTIC is appropriate there.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Whether your data is also originated from National survey data such as&amp;nbsp; NHANES?&lt;/P&gt;
&lt;P&gt;Then the reference population is what? Finite? Infinite?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Probability designs? Survey design, stratification? multistage cluster design? Survey weights? Missing value imputation?&lt;/P&gt;
&lt;P&gt;Post stratification adjustment?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please watch my past presentation on this topic?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please check this link: &lt;A href="https://communities.sas.com/t5/Ask-the-Expert/What-Are-Best-Practices-for-Using-SAS-Survey-Procedures-Q-amp-A/ta-p/706830" target="_blank"&gt;https://communities.sas.com/t5/Ask-the-Expert/What-Are-Best-Practices-for-Using-SAS-Survey-Procedures-Q-amp-A/ta-p/706830&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Aug 2021 05:47:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764659#M37354</guid>
      <dc:creator>gcjfernandez</dc:creator>
      <dc:date>2021-08-29T05:47:36Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764662#M37355</link>
      <description>&lt;P&gt;Thank you for your helpful comment!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes, I'm using NHANES. So, I don't have any problem, right?&lt;/P&gt;&lt;P&gt;if yes, please can you explain the reason.&lt;/P&gt;&lt;P&gt;I'm to comparing 44 subjects with 10144 subjects.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you!!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Aug 2021 08:46:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764662#M37355</guid>
      <dc:creator>lbahanan</dc:creator>
      <dc:date>2021-08-29T08:46:49Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764793#M37362</link>
      <description>&lt;P&gt;Glad to know your data is from NHANES and therefore you should have the right design weights (Mec2yr or Mec4yr), design variables (Strata, primary cluster unit) in your data. You will be choosing one of the variance estimation methods(Tyler series-Default, JK or BRR) in your analysis. When you make comparison in SURVEYLOGISIC with the correct syntax between 42 and full sample, you are making inferences about the reference population. Therefore you can use either LSMEANS. ESTIMATE or LSMESTIMATE when making your comparison. If there is any issues with convergence or estimation SAS log will notify you.&lt;/P&gt;
&lt;P&gt;Hope this helps.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 30 Aug 2021 07:04:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764793#M37362</guid>
      <dc:creator>gcjfernandez</dc:creator>
      <dc:date>2021-08-30T07:04:30Z</dc:date>
    </item>
    <item>
      <title>Re: sample size</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764794#M37363</link>
      <description>&lt;P&gt;It helps a lot!&lt;/P&gt;&lt;P&gt;Thank you so much!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 30 Aug 2021 07:20:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/sample-size/m-p/764794#M37363</guid>
      <dc:creator>lbahanan</dc:creator>
      <dc:date>2021-08-30T07:20:23Z</dc:date>
    </item>
  </channel>
</rss>

