<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic proc surveyselect to split the data to two groups in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195689#M48946</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;We have physician data with taxID and need to split them equally to two groups. We want to keep the physicians together under same taxID.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;First, we use the below coding to assign 0 and 1 based on taxID.&lt;/P&gt;&lt;P&gt;proc surveyselect data=test samprate=0.50 seed=49201 out=Sample outall&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; method=srs noprint;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; run;&lt;/P&gt;&lt;P&gt;Then, they ask whether we could have a list which also consider the size of taxID(the physicians number), because each taxID have a vary number of physicians. For example, they probably have 1 physician or 20 physicians under each taxID.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So, the current question is how to split all of the taxID to 2 groups randomly, which also make sure they have similiar amount physicians under each group.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Kui&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 14 Apr 2015 16:28:47 GMT</pubDate>
    <dc:creator>zhongdianshi</dc:creator>
    <dc:date>2015-04-14T16:28:47Z</dc:date>
    <item>
      <title>proc surveyselect to split the data to two groups</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195689#M48946</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;We have physician data with taxID and need to split them equally to two groups. We want to keep the physicians together under same taxID.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;First, we use the below coding to assign 0 and 1 based on taxID.&lt;/P&gt;&lt;P&gt;proc surveyselect data=test samprate=0.50 seed=49201 out=Sample outall&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; method=srs noprint;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; run;&lt;/P&gt;&lt;P&gt;Then, they ask whether we could have a list which also consider the size of taxID(the physicians number), because each taxID have a vary number of physicians. For example, they probably have 1 physician or 20 physicians under each taxID.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So, the current question is how to split all of the taxID to 2 groups randomly, which also make sure they have similiar amount physicians under each group.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Kui&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Apr 2015 16:28:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195689#M48946</guid>
      <dc:creator>zhongdianshi</dc:creator>
      <dc:date>2015-04-14T16:28:47Z</dc:date>
    </item>
    <item>
      <title>Re: proc surveyselect to split the data to two groups</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195690#M48947</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Do you want a sample of physicians or of TaxIds?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Apr 2015 17:44:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195690#M48947</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2015-04-14T17:44:05Z</dc:date>
    </item>
    <item>
      <title>Re: proc surveyselect to split the data to two groups</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195691#M48948</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;sample of taxID. And also consider the number of physicians under the taxID&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Kui&lt;/P&gt;&lt;P&gt;&lt;BR /&gt; &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Apr 2015 17:48:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195691#M48948</guid>
      <dc:creator>zhongdianshi</dc:creator>
      <dc:date>2015-04-14T17:48:04Z</dc:date>
    </item>
    <item>
      <title>Re: proc surveyselect to split the data to two groups</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195692#M48949</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;If I sort taxID by number of physician, then randomly assign them to two groups, I guess, they might have similar physician number.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Apr 2015 20:25:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195692#M48949</guid>
      <dc:creator>zhongdianshi</dc:creator>
      <dc:date>2015-04-14T20:25:54Z</dc:date>
    </item>
    <item>
      <title>Re: proc surveyselect to split the data to two groups</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195693#M48950</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Technically putting another requirement on the resulting sample changes the "random selection" behavior you may have chosen. One approach would be to have a summary of the TaxId with the number of physicians at each. Select a sample of TaxIds in your groups/strata what ever. Examine the sum of physicians. If the number is too far apart, rerun the sample until you get something "close enough". But you are playing somewhat loose with the probability of selection and resulting weights.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Apr 2015 21:20:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195693#M48950</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2015-04-14T21:20:24Z</dc:date>
    </item>
    <item>
      <title>Re: proc surveyselect to split the data to two groups</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195694#M48951</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks for your time and suggestions.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Apr 2015 13:56:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-surveyselect-to-split-the-data-to-two-groups/m-p/195694#M48951</guid>
      <dc:creator>zhongdianshi</dc:creator>
      <dc:date>2015-04-15T13:56:52Z</dc:date>
    </item>
  </channel>
</rss>

