Hi,
I am currently working on an automatic data refresh using Windows Task Scheduler. Every night at midnight web scraped data will be uploaded on my computer. I will then run the automated SAS code for data manipulation purposes. However I would only like to run the SAS code on the data that is new. My file size is very large, so I would like to reduce process time by only running the code through the new data rather than the entire data set. I was wondering if there is a code that would only run on new data.
Thank you!
Sure, just add a step to your process so that it extracts the new data only and then runs on that. Exactly how depends a bit more on your data structure. There needs to be a relatively clean way to know what’s new data.
@bklassen wrote:
Hi,
I am currently working on an automatic data refresh using Windows Task Scheduler. Every night at midnight web scraped data will be uploaded on my computer. I will then run the automated SAS code for data manipulation purposes. However I would only like to run the SAS code on the data that is new. My file size is very large, so I would like to reduce process time by only running the code through the new data rather than the entire data set. I was wondering if there is a code that would only run on new data.
Thank you!
is there date and timestamp on the records? you could setup a step or selection statement for processing records using those indicators.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.