<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Address cleaning in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521212#M73405</link>
    <description>Are you using SAS Base or do you have access to SAS Data Management Studio/Dataflux?</description>
    <pubDate>Thu, 13 Dec 2018 17:30:14 GMT</pubDate>
    <dc:creator>Reeza</dc:creator>
    <dc:date>2018-12-13T17:30:14Z</dc:date>
    <item>
      <title>Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521206#M73404</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I have an address field that I am trying to clean. Would want to know how do I remove 0s from the string.&lt;/P&gt;&lt;P&gt;eg; I have addresses like&amp;nbsp;&lt;/P&gt;&lt;P&gt;00000 MARTIN LUTHER KING JR BLVD&lt;/P&gt;&lt;P&gt;00000&amp;nbsp;&lt;SPAN&gt;MEMPHIS AVE&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;and some are like&lt;/P&gt;&lt;P&gt;0MAIN STREET&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How do I make them&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;MARTIN LUTHER KING JR BLVD&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;MEMPHIS AVE&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;MAIN STREET&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Dec 2018 17:20:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521206#M73404</guid>
      <dc:creator>ARTI1</dc:creator>
      <dc:date>2018-12-13T17:20:35Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521212#M73405</link>
      <description>Are you using SAS Base or do you have access to SAS Data Management Studio/Dataflux?</description>
      <pubDate>Thu, 13 Dec 2018 17:30:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521212#M73405</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2018-12-13T17:30:14Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521214#M73406</link>
      <description>&lt;P&gt;SAS BASE&lt;/P&gt;</description>
      <pubDate>Thu, 13 Dec 2018 17:32:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521214#M73406</guid>
      <dc:creator>ARTI1</dc:creator>
      <dc:date>2018-12-13T17:32:52Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521216#M73407</link>
      <description>&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data have;
input address:&amp;amp;$200.;
newaddress=strip(compress(address,'0'));
cards;
00000 MARTIN LUTHER KING JR BLVD
00000 MEMPHIS AVE
0MAIN STREET
;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 13 Dec 2018 17:37:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521216#M73407</guid>
      <dc:creator>Jagadishkatam</dc:creator>
      <dc:date>2018-12-13T17:37:21Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521217#M73408</link>
      <description>&lt;P&gt;That won't work, because there will be addresses that are 21040 Martin Luther King Jr Blvd. It would also erase the 0. SAS Data flux has an address cleaning component. &lt;BR /&gt;&lt;BR /&gt;There's a paper on lexjansen.com that parses addresses and cleans them, for US at least that may work here though. &lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://analytics.ncsu.edu/sesug/2008/CC-028.pdf" target="_blank"&gt;https://analytics.ncsu.edu/sesug/2008/CC-028.pdf&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Edit: what I would do is run that program which separates the addresses, and then search the house number field to see if that is all 0's instead.&lt;/P&gt;</description>
      <pubDate>Thu, 13 Dec 2018 17:41:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521217#M73408</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2018-12-13T17:41:12Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521218#M73409</link>
      <description>&lt;P&gt;another more efficient way to remove the '0' which are only in the beginning of the address can be done by perl regular expression&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data have;
input address:&amp;amp;$200.;
newaddress=strip(prxchange('s/^\d[0]*//',-1,strip(address)));
cards;
00000 MARTIN LUTHER KING JR BLVD
00000 MEMPHIS AVE
0MAIN STREET
;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 13 Dec 2018 17:40:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521218#M73409</guid>
      <dc:creator>Jagadishkatam</dc:creator>
      <dc:date>2018-12-13T17:40:16Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521219#M73410</link>
      <description>&lt;P&gt;Thank you Jagdish. But I have addresses like 1023 Main Street or 420 Main Street. I do not want to compress those 0s. The position of the 0 is the start of the string. In some cases there are multiple 0s, in some cases there is one 0 followed by the letters without space.&lt;/P&gt;</description>
      <pubDate>Thu, 13 Dec 2018 17:40:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521219#M73410</guid>
      <dc:creator>ARTI1</dc:creator>
      <dc:date>2018-12-13T17:40:20Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521249#M73416</link>
      <description>&lt;P&gt;Or rather:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data have;
input address:&amp;amp;$200.;
newaddress=prxchange('s/^0+\s*//',1,left(address));
cards;
00000 MARTIN LUTHER KING JR BLVD
00000 MEMPHIS AVE
0MAIN STREET
20 MAIN STREET
;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 13 Dec 2018 19:02:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521249#M73416</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2018-12-13T19:02:54Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521263#M73417</link>
      <description>&lt;P&gt;Cleaned a lot of addresses. Thank you.&lt;/P&gt;</description>
      <pubDate>Thu, 13 Dec 2018 19:29:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521263#M73417</guid>
      <dc:creator>ARTI1</dc:creator>
      <dc:date>2018-12-13T19:29:08Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521267#M73418</link>
      <description>&lt;P&gt;Thank you Reeza. Very informative paper.&lt;/P&gt;</description>
      <pubDate>Thu, 13 Dec 2018 19:30:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/521267#M73418</guid>
      <dc:creator>ARTI1</dc:creator>
      <dc:date>2018-12-13T19:30:12Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/727378#M80311</link>
      <description>&lt;P&gt;This does not quite answer my problem. I have addresses like these&lt;/P&gt;&lt;P&gt;3235 Cambridge Avenue, #6H, Bronx, New York 10463&lt;BR /&gt;2 Harrison Street, PH, Poughkeepsie, New York 12601&lt;BR /&gt;130 Schroeders Avenue, Apt. 12G, Brooklyn, New York 11239&lt;BR /&gt;220 Wadsworth Avenue, Apt. 307, New York, New York 10033&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I want to break them apart. For the first one, I want 3235 in one variable, Cambridge Avenue in the second variable, #6H in the third variable, Bronx in the fourth variable, New York in the fifth variable and 10463 in the sixth variable.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Mar 2021 12:27:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/727378#M80311</guid>
      <dc:creator>sekarpc</dc:creator>
      <dc:date>2021-03-18T12:27:55Z</dc:date>
    </item>
    <item>
      <title>Re: Address cleaning</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/727403#M80312</link>
      <description>&lt;P&gt;Please post your problem as a new topic.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Mar 2021 13:43:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Address-cleaning/m-p/727403#M80312</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2021-03-18T13:43:48Z</dc:date>
    </item>
  </channel>
</rss>

