<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Comparing 2 datasets in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257328#M268933</link>
    <description>&lt;P&gt;Ya you're so right. We're dealing with counts here. Thanks a lot &lt;A href="https://communities.sas.com/t5/user/viewprofilepage/user-id/32733" target="_self"&gt;&lt;SPAN class="login-bold"&gt;FreelanceReinhard&lt;/SPAN&gt;&lt;/A&gt;. I just have another question if you don't mind. As I said, I have to see if there is any difference in death rates in small areas due to difference in geocoding in two different data sets (period 1,years 2003-2007, and&amp;nbsp;period 2,&lt;SPAN&gt;years 2008-2012&lt;/SPAN&gt;). Do you have any thoughts about how can I do it&amp;nbsp;? So far I've calculated percentage of geocoded data in each county (not small areas), and ANOVA tests with "period" and "sarea" as independent variables and "cause of death" as dependent variable (couldn't do the interaction terms due to 0 degrees of freedom for errors).&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;BR /&gt;&lt;IMG src="https://communities.sas.com/t5/image/serverpage/image-id/12385iABAFD4682B8AD025/image-size/large?v=1.0&amp;amp;px=600" border="0" alt="geocode3.PNG" title="geocode3.PNG" /&gt;</description>
    <pubDate>Thu, 17 Mar 2016 15:25:30 GMT</pubDate>
    <dc:creator>mayasak</dc:creator>
    <dc:date>2016-03-17T15:25:30Z</dc:date>
    <item>
      <title>Comparing 2 datasets</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257097#M268931</link>
      <description>&lt;DIV class="lp_chat_line lp_chat_visitor"&gt;&lt;DIV class="lp_chat_by"&gt;I have 2 sets of data that are&amp;nbsp;geocoded for the "small area", years&amp;nbsp;(2003-2007) and (2008 to 2012).&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV class="lp_chat_line lp_chat_visitor lp_chat_repeating_source"&gt;&lt;DIV class="lp_chat_by"&gt;The&amp;nbsp;two sets are&amp;nbsp;geocoded in a different way. I'm trying to concatenate both sets and see if there's a difference in death rates between the two periods (years).&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV class="lp_chat_line lp_chat_visitor lp_chat_repeating_source"&gt;&lt;DIV class="lp_chat_by"&gt;I have this code:&lt;/DIV&gt;&lt;DIV class="lp_chat_by"&gt;data mort03_07;&lt;BR /&gt;set data.mortsarea99_09_newrace;&lt;BR /&gt;if NMRes=1;&lt;BR /&gt;if 2003&amp;lt;=year&amp;lt;=2007;&lt;BR /&gt;sarea134=sarea133;&lt;BR /&gt;if sarea133=100 then sarea134=99;&lt;BR /&gt;&lt;BR /&gt;geo=2; *not geocoded;&lt;BR /&gt;if 1&amp;lt;=sarea134&amp;lt;=108 then geo=1; *geocoded;&lt;BR /&gt;&lt;BR /&gt;run;&lt;BR /&gt;data mort08_12;&lt;BR /&gt;set final.death99_13geo_14ungeo_ibis_std;&lt;BR /&gt;if NMRes=1;&lt;BR /&gt;if 2008&amp;lt;=year&amp;lt;=2012;&lt;BR /&gt;&lt;BR /&gt;geo=2; *not geocoded;&lt;BR /&gt;if 1&amp;lt;=sarea134&amp;lt;=108 then geo=1; *geocoded;&lt;BR /&gt;&lt;BR /&gt;run;&lt;BR /&gt;&lt;BR /&gt;/*&lt;BR /&gt;proc summary data=mort03_07;&lt;BR /&gt;var x geo;&lt;BR /&gt;class fipscode;&lt;BR /&gt;output out=numgeo1 sum(geo)=numgeo sum(x)=totnum;&lt;BR /&gt;run;&lt;BR /&gt;&lt;BR /&gt;proc summary data=mort08_12;&lt;BR /&gt;var x geo;&lt;BR /&gt;class fipscode;&lt;BR /&gt;output out=numgeo2 sum(geo)=numgeo sum(x)=totnum;&lt;BR /&gt;run;&lt;BR /&gt;&lt;BR /&gt;data numge01;&lt;BR /&gt;set numgeo1;&lt;BR /&gt;period=1;&lt;BR /&gt;run;&lt;BR /&gt;data numge02;&lt;BR /&gt;set numgeo2;&lt;BR /&gt;period=2;&lt;BR /&gt;run;&lt;BR /&gt;&lt;BR /&gt;data numgeo;&lt;BR /&gt;set numge01 numge02;&lt;BR /&gt;geopct=numgeo/totnum;&lt;BR /&gt;run;&lt;BR /&gt;&lt;BR /&gt;proc print data=numgeo1; title 'geocoded, period1';&lt;BR /&gt;proc print data=numgeo2; title 'geocoded, period2';&lt;BR /&gt;proc print data=numgeo; title 'geocoded, both periods';&lt;BR /&gt;run;&lt;/DIV&gt;&lt;DIV class="lp_chat_by"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="lp_chat_by"&gt;When I run the &lt;SPAN&gt;proc print data=numgeo; title 'geocoded, both periods';&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class="lp_chat_by"&gt;&lt;SPAN&gt;I get the&amp;nbsp;geopct &amp;gt;1 &amp;nbsp;(because the numgeo &amp;gt; totnum)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class="lp_chat_by"&gt;&lt;SPAN&gt;I'm not sure what am I doing wrong here.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class="lp_chat_by"&gt;&lt;SPAN&gt;Thank you,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class="lp_chat_by"&gt;&lt;SPAN&gt;Ruzeina&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 16 Mar 2016 18:39:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257097#M268931</guid>
      <dc:creator>mayasak</dc:creator>
      <dc:date>2016-03-16T18:39:01Z</dc:date>
    </item>
    <item>
      <title>Re: Comparing 2 datasets</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257118#M268932</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/47035"&gt;@mayasak﻿&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Without sample data it's a bit hard to say, but my first guess is that the coding of variable GEO might be not ideal: If NUMGEO is to be the &lt;EM&gt;number&lt;/EM&gt; of geocoded items, the code for "not geocoded" should be 0, not 2. Otherwise NUMGEO is likely to be too large, leading to incorrectly large values of GEOPCT, possibly GEOPCT&amp;gt;1, as you've observed.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Mar 2016 19:58:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257118#M268932</guid>
      <dc:creator>FreelanceReinh</dc:creator>
      <dc:date>2016-03-16T19:58:12Z</dc:date>
    </item>
    <item>
      <title>Re: Comparing 2 datasets</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257328#M268933</link>
      <description>&lt;P&gt;Ya you're so right. We're dealing with counts here. Thanks a lot &lt;A href="https://communities.sas.com/t5/user/viewprofilepage/user-id/32733" target="_self"&gt;&lt;SPAN class="login-bold"&gt;FreelanceReinhard&lt;/SPAN&gt;&lt;/A&gt;. I just have another question if you don't mind. As I said, I have to see if there is any difference in death rates in small areas due to difference in geocoding in two different data sets (period 1,years 2003-2007, and&amp;nbsp;period 2,&lt;SPAN&gt;years 2008-2012&lt;/SPAN&gt;). Do you have any thoughts about how can I do it&amp;nbsp;? So far I've calculated percentage of geocoded data in each county (not small areas), and ANOVA tests with "period" and "sarea" as independent variables and "cause of death" as dependent variable (couldn't do the interaction terms due to 0 degrees of freedom for errors).&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;BR /&gt;&lt;IMG src="https://communities.sas.com/t5/image/serverpage/image-id/12385iABAFD4682B8AD025/image-size/large?v=1.0&amp;amp;px=600" border="0" alt="geocode3.PNG" title="geocode3.PNG" /&gt;</description>
      <pubDate>Thu, 17 Mar 2016 15:25:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257328#M268933</guid>
      <dc:creator>mayasak</dc:creator>
      <dc:date>2016-03-17T15:25:30Z</dc:date>
    </item>
    <item>
      <title>Re: Comparing 2 datasets</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257349#M268934</link>
      <description>&lt;P&gt;As this is a completely different question, it will be better if you open a new&amp;nbsp;thread for it. To do this, you should select a different forum within the SAS Support Communities: Analytics --&amp;gt; SAS Statistical Procedures.&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;There you will attract a more targeted audience. Also, it will be helpful to describe your data a little more (types of variables and their meaning). I am not familiar with geocoding and its implications for epidemiological research questions.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Mar 2016 16:22:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Comparing-2-datasets/m-p/257349#M268934</guid>
      <dc:creator>FreelanceReinh</dc:creator>
      <dc:date>2016-03-17T16:22:27Z</dc:date>
    </item>
  </channel>
</rss>

