About lim_6

lim_6 · ‎05-03-2016

PGStats, do I need to sort each dataset by SSN and key/event date first and then use your syntax? Or would it not matter? For some reason I wasn't getting that same outcome. Thanks!

lim_6 · ‎05-02-2016

I am looking for the eventDate that is closest to and precedes keyDate ID keyDate eventDate 1 01/11/2005 12/15/2004 1 05/13/2005 04/18/2005 1 09/21/2006 05/13/2006 2 04/21/2008 03/17/2008 3 05/03/2007 03/21/2007 3 05/09/2007 03/21/2007

lim_6 · ‎05-02-2016

PGStats, the syntax you provided matches the key date to the maximum eventdate for each identifier, which is not what I'm looking for. I need each key event to be matched to an event date that occurs prior or on the same day as the key event. For many individuals, some of their event dates can happen well beyond each key event and so pulling the max(EventDate) per identifier isn't helping me find the event date closest to the key event. Not sure if that makes complete sense so if you need me to elaborate, I'd be more than happy to do so. Thanks!

lim_6 · ‎05-02-2016

This question is very similar to the "Find the closest event date prior to another date; sets linked by same ID" post. I have a dataset with individual IDs and key dates. Each individual has at least one key date, but may have more. I have another dataset with the ID and multiple event dates (each is an observation) for each individual. I need to find the event date that is closest to (before) or the same as the key date for each individual. Individuals may have more than 20 event dates. I tried using the syntax in the post mentioned above, but it gets rid of the additional key dates and I need all key dates to match up with the closest event date. ID & Key date: ID Keydate 1 01/11/2005 1 05/13/2005 1 09/21/2006 2 04/21/2008 3 05/03/2007 3 05/09/2007 ID & Event date: ID Eventdate 1 10/09/2004 1 12/15/2004 1 03/11/2005 1 04/18/2005 1 05/13/2006 1 08/05/2007 2 03/15/2008 2 03/17/2008 2 04/14/2008 2 04/27/2008 2 05/29/2009 3 02/10/2004 3 01/04/2007 3 03/21/2007 3 08/14/2007

lim_6 · ‎04-05-2016

Hi, I've been working on case-control syntax and tried using the SAS PDF found here: http://www2.sas.com/proceedings/sugi29/173-29.pdf Unfortunately my size is vastly larger (~170000 cases and more than 3 million controls) and this syntax is creating a file size much larger than necessary. I'm hoping to use a non-macro based syntax since my SAS has trouble running macros. I'd like to use some sort of PROC SQL syntax, but I really don't know how to make it more efficient in the beginning and match controls to cases right away without creating a ton of duplicates per each case. Any suggestions or ideas? Anything would be immensely helpful at this point. Thank you! Syntax: PROC SQL; CREATE table controls_id as select one.ID as study_id, two.ID as control_id, one.age as study_age, two.age as control_age, one.race as study_race, two.race as control_race, one.rand_num as rand_num from study one, control two where (one.age=two.age and one.race=two.race); * Remove duplicate control subjects; proc sort data=controls_id nodupkey; by control_id rand_num; run; *exactly match on variables with fixed number of controls; proc sort data=controls_id ; by study_id rand_num; run; data controls_id2 not_enough; set controls_id; by study_id ; retain num; if first.study_id then num=1; if num le 2 then do; output controls_id2; num=num+1; end; if last.study_id then do; if num le 2 then output not_enough; end; run; proc print data=controls_id2 (obs=40) ; title2 'matched patients'; run; *use following syntax to remove cases that do not have two controls; data controls_id3; merge controls_id2 not_enough(in=b_); by study_id; if b_ then delete; run;

Online Status	Offline
Date Last Visited	‎05-03-2016 04:29 PM

Re: Find date closest to another date for multiple identifiers

Re: Find date closest to another date for multiple identifiers

Re: Find date closest to another date for multiple identifiers

Find date closest to another date for multiple identifiers

Matching cases and controls using PROC SQL

Matching cases and controls using PROC SQL

Re: Find date closest to another date for multiple identifiers

Re: Find date closest to another date for multiple identifiers

Re: Find date closest to another date for multiple identifiers

Find date closest to another date for multiple identifiers

Matching cases and controls using PROC SQL