<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Help with statistics please in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86257#M256993</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks for help, but i still cant get the needed answer. Let me explain the problem more clearly. I have a data set which looks like that (-1, 0.8, -1, -1, 0.9,0, 1.1, 0, -1,...........,0.7, 0, 0.9), N=1000, mean ~0.04. I am using bootstrapping technique to find out the distribution and i get X~N(0.05, 0.02). Everything is clear for me here. If i put mean, std dev, and lower/upper bounds of mean to proc power to get N i get ~300. Thats look totally normal as i was guessing that 300 would be enough before trying to calculate this. But if i choose 300 random observations from data set containing 1000 observations and calculate mean, every time it is not even close to 0.05. Even running bootstrapping when generating 300 samples means where N =300 i get totally different results. So how do i get N with which mean would be similar in both cases with all data from set and with N=300 ? &lt;/P&gt;&lt;P&gt;Problem: for example i want to collect another data set with similar observations (it takes too long to get another 1000 observations) and i want to decide when number of observations is enough to conclude that mean is equal to some number and it will be the same in a long term so and i can invest real money into these observations.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 29 Jan 2013 07:46:17 GMT</pubDate>
    <dc:creator>EdvinasS</dc:creator>
    <dc:date>2013-01-29T07:46:17Z</dc:date>
    <item>
      <title>Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86252#M256988</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;i would really appreciate if anyone could help me to solve a problem:&lt;/P&gt;&lt;P&gt;i have dataset X containing 1000 observations that are normaly distirbuted. X~N(0.05,0.02)&lt;/P&gt;&lt;P&gt;What i need to do is to find out what minimum amount of such observations do i need to get confident mean and std.&lt;/P&gt;&lt;P&gt;Why i need it? this dataset was collected in two years and now i can tell that it is really enougth of observations to claim that mean=0.05. i can not wait another two years to collect data for another type of observations so i need to find out how many observations do i need to get until&amp;nbsp; i can claim that mean of data set is confident?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 25 Jan 2013 17:00:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86252#M256988</guid>
      <dc:creator>EdvinasS</dc:creator>
      <dc:date>2013-01-25T17:00:06Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86253#M256989</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Sounds like you need a sample size calculation. Look at proc power, or in this case a google search will bring up a lot of online calculators. &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 25 Jan 2013 17:15:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86253#M256989</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2013-01-25T17:15:01Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86254#M256990</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;thanks for help. i have problem with proc power..&lt;/P&gt;&lt;P&gt;proc power;&lt;/P&gt;&lt;P&gt;onesamplemeans&lt;/P&gt;&lt;P&gt;mean = 0.05&lt;/P&gt;&lt;P&gt;ntotal = 10&lt;/P&gt;&lt;P&gt;stddev = 0.02&lt;/P&gt;&lt;P&gt;power = .;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;it gives me power &amp;gt;.999 all the time, no matter what ntotal value i enter. how can it be...?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 25 Jan 2013 17:31:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86254#M256990</guid>
      <dc:creator>EdvinasS</dc:creator>
      <dc:date>2013-01-25T17:31:51Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86255#M256991</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Because SAS is saying you only need 4 so anything over 4 will give you a lot of power...I thought you were looking for n though, so you'd set your power to 0.8 say and see what you get for n instead.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc power;&lt;/P&gt;&lt;P&gt;onesamplemeans&lt;/P&gt;&lt;P&gt;mean = 5&lt;/P&gt;&lt;P&gt;ntotal = .&lt;/P&gt;&lt;P&gt;stddev = 2&lt;/P&gt;&lt;P&gt;power = .8;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 25 Jan 2013 17:41:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86255#M256991</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2013-01-25T17:41:17Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86256#M256992</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;To tell you the truth, I don't have a problem with getting a power of &amp;gt; 0.999 in this case.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I do have a problem with your statement "no matter what ntotal value i enter". When I enter an ntotal value of 5 or less, the power decreases.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The power of a test is: the probability that the test will reject the null hypothesis when the null hypothesis is false. So, with only 2 observation, the probability of rejecting H0: mean=0 when in fact H0 is false is 0.219. That seems like the right answer to me. With only 2 observations, and mean ne 0, you could easily wind up accepting the null hypothesis. When ntotal increases to 10, it seems very likely that you will reject the null hypothesis when it is false.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 25 Jan 2013 17:42:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86256#M256992</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2013-01-25T17:42:09Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86257#M256993</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks for help, but i still cant get the needed answer. Let me explain the problem more clearly. I have a data set which looks like that (-1, 0.8, -1, -1, 0.9,0, 1.1, 0, -1,...........,0.7, 0, 0.9), N=1000, mean ~0.04. I am using bootstrapping technique to find out the distribution and i get X~N(0.05, 0.02). Everything is clear for me here. If i put mean, std dev, and lower/upper bounds of mean to proc power to get N i get ~300. Thats look totally normal as i was guessing that 300 would be enough before trying to calculate this. But if i choose 300 random observations from data set containing 1000 observations and calculate mean, every time it is not even close to 0.05. Even running bootstrapping when generating 300 samples means where N =300 i get totally different results. So how do i get N with which mean would be similar in both cases with all data from set and with N=300 ? &lt;/P&gt;&lt;P&gt;Problem: for example i want to collect another data set with similar observations (it takes too long to get another 1000 observations) and i want to decide when number of observations is enough to conclude that mean is equal to some number and it will be the same in a long term so and i can invest real money into these observations.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 29 Jan 2013 07:46:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86257#M256993</guid>
      <dc:creator>EdvinasS</dc:creator>
      <dc:date>2013-01-29T07:46:17Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86258#M256994</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Something is wrong in your math somewhere. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you're bootstrapping your results and generating 300 different samples that don't have a mean similar to your 'true' mean then I question the calculation of the original mean.&amp;nbsp; Unless you have a few extreme outliers, but then I'd expect the std to account for that.&amp;nbsp; &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 29 Jan 2013 15:53:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86258#M256994</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2013-01-29T15:53:17Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86259#M256995</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Its not like "not even close" but if i take mean of original data its ~0.04. if i use bootstraping for distribution analysis and generate 1000 datasets with 1000 randomly selected observations from original (which contains 1000 obs) i get mean 0.05 +-0.005, std=0.025. if i put mean, std and up/low limits of mean into proc power to get N for onesample mean with power 0.9 and alpha=0.05 i get N= ~300. if i repeat bootstraping and generate 300 datasets with 300 randomly selected obs from original dataset i get mean ~0.08 which is twice bigger than real mean of original dataset. if i simply take obs 1-300, 300-600, 600-900 from original data set i get means like ~0.08, ~0.05, ~ -0.02. im stuck...&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 30 Jan 2013 08:11:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86259#M256995</guid>
      <dc:creator>EdvinasS</dc:creator>
      <dc:date>2013-01-30T08:11:09Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86260#M256996</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;PRE __jive_macro_name="quote" class="jive_text_macro jive_macro_quote" modifiedtitle="true"&gt;
&lt;P&gt;EdvinasS wrote:&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Its not like "not even close" but if i take mean of original data its ~0.04. if i use bootstraping for distribution analysis and generate 1000 datasets with 1000 randomly selected observations from original (which contains 1000 obs) i get mean 0.05 +-0.005, std=0.025.&lt;/P&gt;
&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Usually you use less than the number of observations, but I still think somethings wrong, possibly in your code somewhere but I can't tell.&amp;nbsp; &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 30 Jan 2013 15:40:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86260#M256996</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2013-01-30T15:40:54Z</dc:date>
    </item>
    <item>
      <title>Re: Help with statistics please</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86261#M256997</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;This is very suspicious:&amp;nbsp; "if i simply take obs 1-300, 300-600, 600-900 from original data set i get means like ~0.08, ~0.05, ~ -0.02."&amp;nbsp; Are your observations in time order?&amp;nbsp; Maybe there is a process going on that means this is not really a random sample from a univariate distribution.&amp;nbsp; Try box plots by quarter to see if there is some sort of regression involve.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 30 Jan 2013 17:51:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-with-statistics-please/m-p/86261#M256997</guid>
      <dc:creator>Doc_Duke</dc:creator>
      <dc:date>2013-01-30T17:51:42Z</dc:date>
    </item>
  </channel>
</rss>

