BookmarkSubscribeRSS Feed

Parsing: Using SAS when the data is hiding in a nonstandard format

Started ‎06-24-2020 by
Modified ‎06-25-2020 by
Views 1,280

What do you do when the data you need is hidden in a nonstandard source, like a text document? In this 22-minute video, independent consultant @KuligowskiAndre reveals techniques for when you're not dealing with cleanly formatted data. He'll walk you through an example in which data is found within a free-form text file.

 

 

Video highlights

01:06 - What is parsing?

03:35 - How to find, identify, keep or reject data points

07:32 - Debugging

13:29 - How to account for periods in text

16:57 - The role of the substring function

20:25 - Why you should clean your data

 

 

Read the Paper

 

Related resources

About the PARSE function (SAS documentation)

SAS Text Miner: overview of text parsing node (SAS documentation) 

How to scrape data from a web page using SAS (blog post)

 

 

Version history
Last update:
‎06-25-2020 09:56 AM
Updated by:
Contributors

SAS INNOVATE 2024

Innovate_SAS_Blue.png

Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.

If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website. 

Register now!

Free course: Data Literacy Essentials

Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning  and boost your career prospects.

Get Started

Article Tags