BookmarkSubscribeRSS Feed
huongc2
Calcite | Level 5

Hi everyone,

 

I am having some troubles with cleaning these addresses. at first it was easy with some if statement. But since my data is getting bigger, it is not efficient to use if statement anymore. I would love to hear your thoughts about how to clean this field.

 

I have:

Data have;

input address $200;

datalines;

100 ABC ST RM S02A

102 ABC STREET FLOOR 1

103 ABC ST APT 3 HOMELESS

1035 CD AVENUE FLOOR 2

108 SOMETHING ST # 2FL

115 VISA VISTA DR APT 212 APT 212

1155 LOOK AVENUE APT 205

12 BORED AVE APT 2

1214 TIRED STREET APT 428

127 HAPPY STREET FLOOR 2

1397 SOMEWHERE STREET FIRST FLOOR

142 SOMETHING ST APT 3

200 RAINBOW AVE UNIT 202

;

 

I don't want any Unit or floor or apt number in the clean address. So I want the address field that would look like this:

100 ABC ST

102 ABC STREET

103 ABC ST

1035 CD AVENUE

108 SOMETHING ST

115 VISA VISTA DR

1155 LOOK AVENUE

12 BORED AVE

1214 TIRED STREET

127 HAPPY STREET

1397 SOMEWHERE STREET

142 SOMETHING ST

200 RAINBOW AVE

 

Thank you so much!

2 REPLIES 2
SASKiwi
PROC Star

What is your actual use case?  If your address cleaning needs to scale up to enterprise-wide customer volumes and techniques then you would be better off using a tool specific to this task like SAS Data Quality. On the other hand, cleaning a few hundred addresses with a few transformation rules like the ones in your post, you are probably better off persevering with your current approach.

huongc2
Calcite | Level 5

Ah Thank you. I thought SAS would have function for this that I don't know 😄

sas-innovate-2024.png

 

Secure your spot at the must-attend AI and analytics event of 2024: SAS Innovate 2024! Get ready for a jam-packed agenda featuring workshops, super demos, breakout sessions, roundtables, inspiring keynotes and incredible networking events.

 

Register by March 1 to snag the Early Bird rate of just $695! Don't miss out on this exclusive offer. 

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 508 views
  • 0 likes
  • 2 in conversation