<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Proc Univariate trouble in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793867#M38918</link>
    <description>&lt;P&gt;I was given a dataset called Cancer and was told to find the N, Mean, SD, Median, Min, and Max for the variables "Exposure" and "Mortality." I thought this was simple since there is a linear correlation between both variables so I used the code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;proc univariate data=cancer.cancer;&lt;BR /&gt;var exposure mortality;&lt;BR /&gt;run;&lt;/PRE&gt;&lt;P&gt;which was able to generate all the values I needed however, the only answer considered "right" was the N value I obtained, everything else was incorrect. Am I supposed to be using a different code? or Multiplying something? Will attach the dataset and problem for reference.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Synopsis:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class=""&gt;Since World War II, plutonium for use in atomic weapons has been produced at an Atomic Energy&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;Commission facility in Hanford, WA. One of the major safety issues has been the storage of radioactive&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;wastes. Over the years, significant quantities of these substances, including Sr&lt;/SPAN&gt;&lt;SPAN class=""&gt;90&lt;/SPAN&gt;&lt;SPAN class=""&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;and Cs&lt;/SPAN&gt;&lt;SPAN class=""&gt;137&lt;/SPAN&gt;&lt;SPAN class=""&gt;, have leaked into&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;the nearby Columbia River, which flows along the Washington-Oregon border into the Pacific Ocean.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class=""&gt;To measure the health consequences of this contamination, an index of exposure was calculated for each of&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;the nine Oregon counties bordering either the Columbia River or the Pacific Ocean. This particular index was&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;based on several factors, including the county's stream distance from Hanford and the average distance of its&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;population from the water. As a covariate, the cancer mortality rate was determined for each county.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;The SAS data set,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;cancer&lt;/SPAN&gt;&lt;SPAN class=""&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;is located on SAS on Demand and in your&lt;/SPAN&gt;&lt;SPAN class=""&gt;&amp;nbsp;BIOS-517 library&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;The data set contains the following variables:&lt;/SPAN&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;County&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Name of county&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Exposure&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Index of exposure&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Mortality&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Cancer mortality per 100,000 person-yrs&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;SPAN class=""&gt;You may assume that the dataset is clean, but you should still do univariate analyses to familiarize yourself&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;with each variable.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 01 Feb 2022 19:08:35 GMT</pubDate>
    <dc:creator>saza</dc:creator>
    <dc:date>2022-02-01T19:08:35Z</dc:date>
    <item>
      <title>Proc Univariate trouble</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793867#M38918</link>
      <description>&lt;P&gt;I was given a dataset called Cancer and was told to find the N, Mean, SD, Median, Min, and Max for the variables "Exposure" and "Mortality." I thought this was simple since there is a linear correlation between both variables so I used the code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;proc univariate data=cancer.cancer;&lt;BR /&gt;var exposure mortality;&lt;BR /&gt;run;&lt;/PRE&gt;&lt;P&gt;which was able to generate all the values I needed however, the only answer considered "right" was the N value I obtained, everything else was incorrect. Am I supposed to be using a different code? or Multiplying something? Will attach the dataset and problem for reference.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Synopsis:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class=""&gt;Since World War II, plutonium for use in atomic weapons has been produced at an Atomic Energy&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;Commission facility in Hanford, WA. One of the major safety issues has been the storage of radioactive&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;wastes. Over the years, significant quantities of these substances, including Sr&lt;/SPAN&gt;&lt;SPAN class=""&gt;90&lt;/SPAN&gt;&lt;SPAN class=""&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;and Cs&lt;/SPAN&gt;&lt;SPAN class=""&gt;137&lt;/SPAN&gt;&lt;SPAN class=""&gt;, have leaked into&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;the nearby Columbia River, which flows along the Washington-Oregon border into the Pacific Ocean.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class=""&gt;To measure the health consequences of this contamination, an index of exposure was calculated for each of&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;the nine Oregon counties bordering either the Columbia River or the Pacific Ocean. This particular index was&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;based on several factors, including the county's stream distance from Hanford and the average distance of its&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;population from the water. As a covariate, the cancer mortality rate was determined for each county.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;The SAS data set,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;cancer&lt;/SPAN&gt;&lt;SPAN class=""&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;is located on SAS on Demand and in your&lt;/SPAN&gt;&lt;SPAN class=""&gt;&amp;nbsp;BIOS-517 library&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;The data set contains the following variables:&lt;/SPAN&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;County&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Name of county&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Exposure&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Index of exposure&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Mortality&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Cancer mortality per 100,000 person-yrs&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;SPAN class=""&gt;You may assume that the dataset is clean, but you should still do univariate analyses to familiarize yourself&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;with each variable.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Feb 2022 19:08:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793867#M38918</guid>
      <dc:creator>saza</dc:creator>
      <dc:date>2022-02-01T19:08:35Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Univariate trouble</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793868#M38919</link>
      <description>&lt;P&gt;What values did you get and what was the "correct" answer?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Did you read in a CSV file or different file type to create the cancer file attached or was it provided as is?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/413742"&gt;@saza&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;I was given a dataset called Cancer and was told to find the N, Mean, SD, Median, Min, and Max for the variables "Exposure" and "Mortality." I thought this was simple since there is a linear correlation between both variables so I used the code:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;proc univariate data=cancer.cancer;&lt;BR /&gt;var exposure mortality;&lt;BR /&gt;run;&lt;/PRE&gt;
&lt;P&gt;which was able to generate all the values I needed however, the only answer considered "right" was the N value I obtained, everything else was incorrect. Am I supposed to be using a different code? or Multiplying something? Will attach the dataset and problem for reference.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Synopsis:&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class=""&gt;Since World War II, plutonium for use in atomic weapons has been produced at an Atomic Energy&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;Commission facility in Hanford, WA. One of the major safety issues has been the storage of radioactive&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;wastes. Over the years, significant quantities of these substances, including Sr&lt;/SPAN&gt;&lt;SPAN class=""&gt;90&lt;/SPAN&gt;&lt;SPAN class=""&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;and Cs&lt;/SPAN&gt;&lt;SPAN class=""&gt;137&lt;/SPAN&gt;&lt;SPAN class=""&gt;, have leaked into&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;the nearby Columbia River, which flows along the Washington-Oregon border into the Pacific Ocean.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class=""&gt;To measure the health consequences of this contamination, an index of exposure was calculated for each of&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;the nine Oregon counties bordering either the Columbia River or the Pacific Ocean. This particular index was&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;based on several factors, including the county's stream distance from Hanford and the average distance of its&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;population from the water. As a covariate, the cancer mortality rate was determined for each county.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;The SAS data set,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;cancer&lt;/SPAN&gt;&lt;SPAN class=""&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;is located on SAS on Demand and in your&lt;/SPAN&gt;&lt;SPAN class=""&gt;&amp;nbsp;BIOS-517 library&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN class=""&gt;The data set contains the following variables:&lt;/SPAN&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN class=""&gt;County&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Name of county&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN class=""&gt;Exposure&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Index of exposure&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN class=""&gt;Mortality&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Cancer mortality per 100,000 person-yrs&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN class=""&gt;You may assume that the dataset is clean, but you should still do univariate analyses to familiarize yourself&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;with each variable.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Feb 2022 19:17:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793868#M38919</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2022-02-01T19:17:43Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Univariate trouble</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793875#M38920</link>
      <description>&lt;P&gt;I am just guessing, but were your answers submitted electronically and graded by computer?&amp;nbsp; If so, you might search the assignment or computer instructions to determine how many decimal digits you should submit for non-integer values. For example, PROC UNIVARIATE might tell you that the mean of the Exposure variable is&amp;nbsp;4.61777778, but the software might be expecting 4.6178. Check with your peers and your instructor to determine the format for the answers.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Still, I am surprised you didn't get correct answers for MIN and MAX, which only have one correct answer.&lt;/P&gt;</description>
      <pubDate>Tue, 01 Feb 2022 20:30:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793875#M38920</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2022-02-01T20:30:11Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Univariate trouble</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793876#M38922</link>
      <description>&lt;P&gt;You really need to describe what you consider "wrong". The example data set only has 9 records with one character variable for the county name and then measure variables so there really isn't much going on in that data set.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It may be that you are seeing more decimal places in the output then you expect. So if a mean, as reported by proc univariate is 4.61777778 and you were told to expect something like 4.6 or 4.62 then the difference is a rounding choice someone made. Or did you expect to see more digits?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A variable such as Exposure in this context could well have an instrumental limit of accuracy and decimals past a certain point imply more precision than the instruments could measure. Or with deaths per 100,000 people if you carry too many decimals you start talking about fractional person-deaths. Except in movies like "Princess Bride" people tend to be dead or alive, not "mostly dead". So the mortality would seldom be reported with more than 2 or 3 decimals.&lt;/P&gt;
&lt;DIV class="branch"&gt;
&lt;DIV&gt;
&lt;DIV align="left"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;</description>
      <pubDate>Tue, 01 Feb 2022 20:36:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Univariate-trouble/m-p/793876#M38922</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2022-02-01T20:36:37Z</dc:date>
    </item>
  </channel>
</rss>

