<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: NPAR1WAY, SURVEYSELECT and Kolmogorov-Smirnov two-sample tests on weighted data in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/NPAR1WAY-SURVEYSELECT-and-Kolmogorov-Smirnov-two-sample-tests-on/m-p/173036#M8976</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Not sure a weighted K-S test exists. Is there a reference describing such a test? - PG&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sun, 24 Aug 2014 18:00:52 GMT</pubDate>
    <dc:creator>PGStats</dc:creator>
    <dc:date>2014-08-24T18:00:52Z</dc:date>
    <item>
      <title>NPAR1WAY, SURVEYSELECT and Kolmogorov-Smirnov two-sample tests on weighted data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/NPAR1WAY-SURVEYSELECT-and-Kolmogorov-Smirnov-two-sample-tests-on/m-p/173035#M8975</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;In need to compare two distributions using NPAR1WAY and two-sample K-S tests,but one of them is weighted.&amp;nbsp; If I set FREQ to the weight I get the correct cumulative distribution for the weighted data, but NPAR1WAY calculates the p-value incorrectly.&amp;nbsp; It thinks the number of entries in the cumulative distribution is the sum of the weights, whereas it is much lower (thus the p-values are too low).&amp;nbsp; Given the D-statistic, which I think SAS calculates correctly from the two cumulative distributions, I believe I can recalculate the p-value from the correct numbers of entries in the two distributions.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is there a way to get NPAR1WAY to correctly calculate the p-value?&amp;nbsp; Problem is I have to do this for 400 different pairs of distributions!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can I somehow use SURVEYSELECT to resample the weighted distribution to get an unweighted distribution having the original number of observations?&amp;nbsp; E.g. if the unweighted data set has 10K entries, and the sum of the weights is 200M, can SURVEYSELECT produce a data set with 10K entries that reproduces the weighted sample cumulative distribution?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 23 Aug 2014 20:06:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/NPAR1WAY-SURVEYSELECT-and-Kolmogorov-Smirnov-two-sample-tests-on/m-p/173035#M8975</guid>
      <dc:creator>ewolin</dc:creator>
      <dc:date>2014-08-23T20:06:21Z</dc:date>
    </item>
    <item>
      <title>Re: NPAR1WAY, SURVEYSELECT and Kolmogorov-Smirnov two-sample tests on weighted data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/NPAR1WAY-SURVEYSELECT-and-Kolmogorov-Smirnov-two-sample-tests-on/m-p/173036#M8976</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Not sure a weighted K-S test exists. Is there a reference describing such a test? - PG&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 24 Aug 2014 18:00:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/NPAR1WAY-SURVEYSELECT-and-Kolmogorov-Smirnov-two-sample-tests-on/m-p/173036#M8976</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2014-08-24T18:00:52Z</dc:date>
    </item>
    <item>
      <title>Re: NPAR1WAY, SURVEYSELECT and Kolmogorov-Smirnov two-sample tests on weighted data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/NPAR1WAY-SURVEYSELECT-and-Kolmogorov-Smirnov-two-sample-tests-on/m-p/173037#M8977</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The weights are essentially predicted frequencies, and I use them as such.&amp;nbsp; They are based on a full survey design and using the weights/frequencies when plotting a variable should give a distribution that is close to what one would get if one sampled the entire US civilian population where every sample had weight equal to one.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The K-S test should work fine, I just want to find a way to get SAS to calculate the p-value correctly.&amp;nbsp; It gets the d-statistic correct, I believe.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 24 Aug 2014 18:07:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/NPAR1WAY-SURVEYSELECT-and-Kolmogorov-Smirnov-two-sample-tests-on/m-p/173037#M8977</guid>
      <dc:creator>ewolin</dc:creator>
      <dc:date>2014-08-24T18:07:05Z</dc:date>
    </item>
  </channel>
</rss>

