I have a code base that synthesizes a zip code to a claims dataset with a county code. The zip data is from one dataset and the claims dataset has the fips/county code. I can provide samples of both. We have been running this for several years now without major issues. However, the time to run this job is taking around 10-11 days whereas in the past is was closer to 6. I believe there to be changes to the environment, but I unfortunately cannot affect those changes. So I have to work around them. This is likely a big ask, but I was wondering if there was a possibility of optimizing the code for a potentially better production experience (shortened run time and QC). I am attaching a zip file of the code, data, and format in a zip file that I hope has all of the information. I realize this is a big ask of anyone, I'm just kind of stuck trying to figure out if there is a better way.
... View more