Hi all,
I've been given an algorithm to select patients who have a certain type of illness. Given a set of ICD-9 codes + inclusion procedure codes, + other criteria (age, region, etc).
Generally with claims data (this is Truven) -- should I clean the entire set first, and then isolate my sample, or isolate my sample and then clean?
Thanks,
I agree in general with @Reeza but experience has taught me if age is involved to always at least check it early in any process where it is important.
Finding data like date of birth after the date a service is performed or age (not to mention gender) inappropriate services might be a concern.
You may also have to consider age at time of service vs age at data extract depending on your data systems. Many systems will maintain demographics such as birth date separately from services and may calculate an age based on the date of the extract for each record even though the services were on different dates.
@cdubs wrote:
Hi all,
I've been given an algorithm to select patients who have a certain type of illness. Given a set of ICD-9 codes + inclusion procedure codes, + other criteria (age, region, etc).
Generally with claims data (this is Truven) -- should I clean the entire set first, and then isolate my sample, or isolate my sample and then clean?
Thanks,
Depends on your cleaning process. If the cleaning process can affect selection then it needs to go first.
I agree in general with @Reeza but experience has taught me if age is involved to always at least check it early in any process where it is important.
Finding data like date of birth after the date a service is performed or age (not to mention gender) inappropriate services might be a concern.
You may also have to consider age at time of service vs age at data extract depending on your data systems. Many systems will maintain demographics such as birth date separately from services and may calculate an age based on the date of the extract for each record even though the services were on different dates.
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.
Find more tutorials on the SAS Users YouTube channel.