BookmarkSubscribeRSS Feed
dereck255
Fluorite | Level 6

At work, was asked a question about SAS and not sure of the answer. Basically, we want to SAS to search the internet (maybe just clinicaltrials.gov) or RSS feeds (RSS is probably easier) to see when companies post about their new studies using various search terms (like, hypertension). SAS can run this overnight and would email us the results every morning. I can do the email and overnight code, but I have no clue how to do this potentially text mining procedure with only using SAS 9. Sounds pretty easy, but not sure where to start. Any ideas?

2 REPLIES 2
ChrisNZ
Tourmaline | Level 20

You'll need to use the usual text manipulation functions: find(), scan(), spelling distances, and so on, to analyse the pages.

 

You'll also need to access and read the web sites. Look at : filename method URL & proc http.

AlanC
Barite | Level 11

Consider gathering the data outside of SAS and then use SAS for analysis. Languages such as C# and Python are much better at web technologies such as REST, RSS, etc. Dump to tab-delimited files or an RDBMS then have SAS chug away at it.

 

This is a good case, IMO, or combo technologies vs relying on one or another. There are also existing tools that can do the monitoring vs rolling your own. Command l;ine them then have SAS execute them for you.

 

https://github.com/savian-net

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1242 views
  • 0 likes
  • 3 in conversation