<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: categorical variable or a continuous variable? in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/856116#M10413</link>
    <description>&lt;P&gt;&lt;FONT face="courier new,courier"&gt;"Smoking"-var has the label "Weight" which is weird.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;I would expect "Smoking"-var to be a YES vs. NO variable (1 versus 0).&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;That might be the case, but the &amp;lt; Maximum &amp;gt; equals 60??&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;The &amp;lt; median &amp;gt; however equals 1.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;This means that :&lt;/FONT&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;FONT face="courier new,courier"&gt;50% of the 5173 non-missing values (records) has 1 or higher as a value for "Smoking"-var and&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT face="courier new,courier"&gt;50% of the 5173 non-missing values (records) has 1 or lower as a value for "Smoking"-var.&lt;/FONT&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;I think "Smoking"-var should be categorical (i.e. a CLASS effect should you make a model or a &lt;EM&gt;binary&lt;/EM&gt; target if you want to explain / predict it).&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;Check with PROC FREQ and nlevels option to know about the cardinality of "Smoking"-var.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;PROC FREQ data=have NLEVELS;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;tables Smoking / missing;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;run;&lt;/FONT&gt;&lt;/P&gt;</description>
    <pubDate>Sat, 28 Jan 2023 22:14:50 GMT</pubDate>
    <dc:creator>sbxkoenk</dc:creator>
    <dc:date>2023-01-28T22:14:50Z</dc:date>
    <item>
      <title>categorical variable or a continuous variable?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/855862#M10411</link>
      <description>&lt;P&gt;Hello all,&amp;nbsp;&lt;/P&gt;&lt;P&gt;Need help with this class assignment to better understanding the two types of variables:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please help answer the question below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="Screen Shot 2023-01-26 at 2.01.36 PM.png" style="width: 808px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/79865i6502F89F573128E7/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screen Shot 2023-01-26 at 2.01.36 PM.png" alt="Screen Shot 2023-01-26 at 2.01.36 PM.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 26 Jan 2023 22:09:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/855862#M10411</guid>
      <dc:creator>simmaee</dc:creator>
      <dc:date>2023-01-26T22:09:23Z</dc:date>
    </item>
    <item>
      <title>Re: categorical variable or a continuous variable?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/855870#M10412</link>
      <description>&lt;P&gt;Which question, there are three.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In general I want&amp;nbsp; to know what the units, if any of a measure are, and how the data is collected before deciding.&lt;/P&gt;
&lt;P&gt;I ask this sort of thing because I am aware of many surveys that collect data in a Yes/No/ Refused/Don't know category but the values are, for one example 1, 2, 7 and 9. So summary statistics for that would/could show a minimum of 1, median of 2 and maximum of 9.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If I were not allowed to have information about collection I would use a tool such as Proc Freq with the NLEVELS to see just how many values are involved. If the number of distinct values comes back with 4, such as in my example above, that might push a decision towards categorical.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Some variables could be treated as either depending on the specific analysis attempted or question asked.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 26 Jan 2023 22:54:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/855870#M10412</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2023-01-26T22:54:07Z</dc:date>
    </item>
    <item>
      <title>Re: categorical variable or a continuous variable?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/856116#M10413</link>
      <description>&lt;P&gt;&lt;FONT face="courier new,courier"&gt;"Smoking"-var has the label "Weight" which is weird.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;I would expect "Smoking"-var to be a YES vs. NO variable (1 versus 0).&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;That might be the case, but the &amp;lt; Maximum &amp;gt; equals 60??&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;The &amp;lt; median &amp;gt; however equals 1.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;This means that :&lt;/FONT&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;FONT face="courier new,courier"&gt;50% of the 5173 non-missing values (records) has 1 or higher as a value for "Smoking"-var and&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT face="courier new,courier"&gt;50% of the 5173 non-missing values (records) has 1 or lower as a value for "Smoking"-var.&lt;/FONT&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;I think "Smoking"-var should be categorical (i.e. a CLASS effect should you make a model or a &lt;EM&gt;binary&lt;/EM&gt; target if you want to explain / predict it).&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;Check with PROC FREQ and nlevels option to know about the cardinality of "Smoking"-var.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;PROC FREQ data=have NLEVELS;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;tables Smoking / missing;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;run;&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 28 Jan 2023 22:14:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/856116#M10413</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2023-01-28T22:14:50Z</dc:date>
    </item>
    <item>
      <title>Re: categorical variable or a continuous variable?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/856121#M10414</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/438874"&gt;@simmaee&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;Hello all,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Need help with this class assignment to better understanding the two types of variables:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please help answer the question below:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="Screen Shot 2023-01-26 at 2.01.36 PM.png" style="width: 808px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/79865i6502F89F573128E7/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screen Shot 2023-01-26 at 2.01.36 PM.png" alt="Screen Shot 2023-01-26 at 2.01.36 PM.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;So what do you think about Smoking? You didn't tell us.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Perhaps you should go back to whoever provided this example output and ask why the label for SMOKING is WEIGHT. Or ask how SMOKING can have a mean of 9.366. I think that is the next step.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This isn't our data, and I doubt anyone here can explain what variable SMOKING represents or why the mean is 9.366. As it is, I think this is an extremely poor class example.&lt;/P&gt;</description>
      <pubDate>Sun, 29 Jan 2023 10:53:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/856121#M10414</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2023-01-29T10:53:34Z</dc:date>
    </item>
    <item>
      <title>Re: categorical variable or a continuous variable?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/856122#M10415</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/60547"&gt;@sbxkoenk&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;"Smoking"-var has the label "Weight" which is weird.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;That's just some poor rendition. I looked at that several time before realizing the label that appears to be for Smoking is the last word for the variable above smoking, MRW, where the label Metropolitan Relative Weight for the variable makes some sense.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;The "Age at Death" label for Cholesterol shows the same lack of care of alignment by whoever prepared that image.&lt;/FONT&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;</description>
      <pubDate>Sun, 29 Jan 2023 00:47:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/categorical-variable-or-a-continuous-variable/m-p/856122#M10415</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2023-01-29T00:47:58Z</dc:date>
    </item>
  </channel>
</rss>

