<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: assessing changes over time in a variable with an unknown category in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/assessing-changes-over-time-in-a-variable-with-an-unknown/m-p/827741#M40988</link>
    <description>&lt;P&gt;with summaries like that I would not make any conclusion other than what you state.&lt;/P&gt;
&lt;P&gt;For one thing what if data from 2004 had 15 records in the sample and the 2010 had 25,000?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Pick any start/end pair of points. I can create data that matches the end points that would throw questions on any statement of conclusions .&lt;/P&gt;
&lt;P&gt;Lets look at your first one for example. 2004 you have 7% of responses male, 2010 you have 11% male. What conclusion would you draw if the data for 2005 through 2009 were 75% or greater males each year? That would make me think something funny went on in 2004 and 2010.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Would there be any natural reason because of your sample methodology that would bias the male/female ratio? If you pick on a particular industry/occupation/hobby activity or similar approach there might be a known bias. The question would be does your data match that bias.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What might be appropriate might be either weighting to a known population ratio or imputing gender for the "unknown".&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You want to carefully read some of your own statements (or provide actual data sets). 2004 to 2010 is 7 years. Just like 2004 to 2005 is 2, the end point matters. Minor point.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;"situation 2: % females &lt;FONT color="#008000"&gt;&lt;STRONG&gt;decreased from 59% to 65%&lt;/STRONG&gt;&lt;/FONT&gt;, % of males increased from 18% to 25%,"&lt;/P&gt;
&lt;P&gt;Looks like an increase.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;And a real underlying question. Why are they "Unknown"? No answer recorded? Some other answer like refused to answer?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 09 Aug 2022 03:55:10 GMT</pubDate>
    <dc:creator>ballardw</dc:creator>
    <dc:date>2022-08-09T03:55:10Z</dc:date>
    <item>
      <title>assessing changes over time in a variable with an unknown category</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/assessing-changes-over-time-in-a-variable-with-an-unknown/m-p/827736#M40987</link>
      <description>&lt;P&gt;Hello, I have data over a 6 year time period and am looking at changes in gender. my variable has 3 categories - female, male, unknown. Question: what can I say, if anything, about the growth in the % of male and % female over time given the presence of the unknown? Below I should some of the scenarios I am facing. Note: data is MADE UP for illustration.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;situation 1: from 2004-2010, the % male increased from 7% to 11%, % of females increased from 74% to 83%, and unknown decreased from 19% to 6%. &lt;STRONG&gt;conclusion=&amp;gt; can't make any conclusions.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;situation 2: % females decreased from 59% to 65%, % of males increased from 18% to 25%, and unknown decreased from 23% to 10%. &lt;STRONG&gt;conclusion: growth in male category&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;situation 3: % of females decreased from 73% to 65%, % of males increased from 8% to 14%, and unknown increased from 19% to 21%. &lt;STRONG&gt;conclusion=&amp;gt; ? possibilities?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;situation 4: % of females stayed the same at 6%, males increased from 70% to 80%, and unknown decreased from 24% to 14%.&amp;nbsp; &lt;STRONG&gt;conclusion: ?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for any assistance.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Aug 2022 02:08:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/assessing-changes-over-time-in-a-variable-with-an-unknown/m-p/827736#M40987</guid>
      <dc:creator>louisar</dc:creator>
      <dc:date>2022-08-09T02:08:10Z</dc:date>
    </item>
    <item>
      <title>Re: assessing changes over time in a variable with an unknown category</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/assessing-changes-over-time-in-a-variable-with-an-unknown/m-p/827741#M40988</link>
      <description>&lt;P&gt;with summaries like that I would not make any conclusion other than what you state.&lt;/P&gt;
&lt;P&gt;For one thing what if data from 2004 had 15 records in the sample and the 2010 had 25,000?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Pick any start/end pair of points. I can create data that matches the end points that would throw questions on any statement of conclusions .&lt;/P&gt;
&lt;P&gt;Lets look at your first one for example. 2004 you have 7% of responses male, 2010 you have 11% male. What conclusion would you draw if the data for 2005 through 2009 were 75% or greater males each year? That would make me think something funny went on in 2004 and 2010.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Would there be any natural reason because of your sample methodology that would bias the male/female ratio? If you pick on a particular industry/occupation/hobby activity or similar approach there might be a known bias. The question would be does your data match that bias.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What might be appropriate might be either weighting to a known population ratio or imputing gender for the "unknown".&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You want to carefully read some of your own statements (or provide actual data sets). 2004 to 2010 is 7 years. Just like 2004 to 2005 is 2, the end point matters. Minor point.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;"situation 2: % females &lt;FONT color="#008000"&gt;&lt;STRONG&gt;decreased from 59% to 65%&lt;/STRONG&gt;&lt;/FONT&gt;, % of males increased from 18% to 25%,"&lt;/P&gt;
&lt;P&gt;Looks like an increase.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;And a real underlying question. Why are they "Unknown"? No answer recorded? Some other answer like refused to answer?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Aug 2022 03:55:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/assessing-changes-over-time-in-a-variable-with-an-unknown/m-p/827741#M40988</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2022-08-09T03:55:10Z</dc:date>
    </item>
    <item>
      <title>Re: assessing changes over time in a variable with an unknown category</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/assessing-changes-over-time-in-a-variable-with-an-unknown/m-p/828155#M41001</link>
      <description>&lt;P&gt;thank you for the excellent feedback. that's a good point that I have to look at the years in between as well -&amp;nbsp; typically there is somewhat of an upward or downward trend.&lt;/P&gt;&lt;P&gt;The "unknown" category is individuals who are explicitly SELECTING an option that they do not wish to provide that information. does that make a difference in your interpretation?&lt;/P&gt;&lt;P&gt;It is difficult to know of any direction of bias, as it could go either way.&amp;nbsp; Imputing is not an option in our case.&amp;nbsp; One possibility is removing the "unknown" and then calculating the % male and % female on those who are known, but I think that has problems too.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2022 19:45:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/assessing-changes-over-time-in-a-variable-with-an-unknown/m-p/828155#M41001</guid>
      <dc:creator>louisar</dc:creator>
      <dc:date>2022-08-10T19:45:06Z</dc:date>
    </item>
  </channel>
</rss>

