About Ian_K

Ian_K · ‎07-04-2015

I think that PG Stat has provided a completely correct solution. I suspect that I am having a problem that I will refer to my IT group. Thank you, Ian

Ian_K · ‎07-02-2015

Hello Datasp, The proc print output for the pharm input data is this: Obs DRUG_ID symptom 1 1022 BP 2 1022 ARRYTHMIA 3 1023 ANGINA 4 1024 BP 5 1024 ANGINA 6 1024 PALP And for the Patient input data is this: PATIENT_ Obs ID AILMENT_WORDS 1 245762 DIAHREA,NASEAU,ABDOMINAL CRAMPS 2 238761 MIGRAINE,ANGINA 3 239978 BP,HEADACHE,COUGH,THROAT PAIN

Ian_K · ‎07-02-2015

Hello datasp, My comment (#7 above shows my LOG) says it all. When I copy and paste the code as is and run it in SAS I end up with Zero OBS in the SQL steps. I'm not sure why.... the output you show is exactly what I need, but it just isn't happening for me. Thank you for replying, Ian

Ian_K · ‎07-02-2015

PG STATS had some good suggestions, but they didn't completely work out. Anybody else have any great ideas to solve this one? Thank you to all! Ian

Ian_K · ‎07-01-2015

Hello PG Stats! You are amazing!! Thank you once again. I entered your code "exactly as is" (I copy and pasted it into SAS) at part 6 of this conversation and I still get 0 obs. Sorry to bother you, but do you have any further ideas? All the best to you, Ian

Ian_K · ‎06-30-2015

Thank you PG Stats for your partial solution - much appreciated!! Ian Each step works fine until we get to the PROC SQLs. You can see below that at line 44 we end up with zero obs. Here is the log file from the SQL steps: 39 proc sql; 40 create table drugs as 41 select PATIENT_ID, DRUG_ID 42 from pat as a inner join 43 pharm as b on b.symptom = a.ailment 44 order by PATIENT_ID; NOTE: Table WORK.DRUGS created, with 0 rows and 2 columns. 45 quit; NOTE: PROCEDURE SQL used (Total process time): real time 0.31 seconds cpu time 0.06 seconds 46 47 data drugs10; 48 set drugs; by PATIENT_ID; 49 if first.PATIENT_ID then count = 0; 50 count + 1; 51 if count <= 10; 52 drop count; 53 run; NOTE: There were 0 observations read from the data set WORK.DRUGS. NOTE: The data set WORK.DRUGS10 has 0 observations and 2 variables. NOTE: DATA statement used (Total process time): real time 0.03 seconds cpu time 0.01 seconds

Ian_K · ‎06-29-2015

I'm looking for a list of a maximum of 10 drugs per patient. Accordingly some ailments will not be addressed, but all patients will be addressed.

Ian_K · ‎06-29-2015

Hi Ballardw, I'm just looking for 10 match drugs for any given ailment. Therefore, it is possible that some ailments will not be addressed for given patients. So the end result would be something along the lines of: User1 drug1 User1 drug2 user1 drug3 ... user1 drug10 I know that sounds strange, but it is a theoretical rather than a practical exercise. Thank you for replying!

Ian_K · ‎06-29-2015

I would like to match a maximum of 10 pharmaceuticals to each patient based on the symptoms addressed by the drugs and the symptoms recorded for each patient. However, there is not a common variable to which to merge on and merging all pharmaceuticals to all patients and using the index function to find suitable drugs is not practical as it would result in a dataset with the number of records equivalent to 200 million drugs x 25 million patients (or 5,000 trillion records). * THE DATASET PHARMACEUTICALS IN REALITY HAS 200 MILLION RECORDS; DATA PHARMACEUTICALS;INPUT DRUG_ID SYMPTOM_WORD1 $ SYMPTOM_WORD2 $ SYMPTOM_WORD3 $; CARDS; 1022 BP ARRYTHMIA 1023 ANGINA 1024 BP ANGINA PALP ; RUN; * THE DATASET PATIENTS IN REALITY HAS 25 MILLION RECORDS; * SYMPTOM_WORDS ARE CONCATENATED TOGETHER AND SEPARTED BY COMMAS; DATA PATIENTS;INPUT PATIENT_ID AILMENT_WORDS $; CARDS; 245762 DIAHREA,NASEAU,ABDOMINAL CRAMPS 238761 MIGRAINE,ANGINA 239978 BP,HEADACHE,COUGH,THROAT PAIN Ideally, if there was a common variable I would merge the two datasets and execute an INDEX function to find drugs that match patient symptoms and set a counter to limit these matches to 10 per patient, like this: IF INDEX(AILMENT_WORDS, SEARCH_WORD1) GE 1 THEN EVENTMATCH+1; IF INDEX(AILMENT_WORDS, SEARCH_WORD2) GE 1 THEN EVENTMATCH+1; IF INDEX(AILMENT_WORDS, SEARCH_WORD3) GE 1 THEN EVENTMATCH+1; I would set a loop counter using first.patient_ID to ensure no more than 10 drug matches per patient. However, as I cannot perform the merge “BY PATIENT_ID” as it is not common to both datasets I therefore cannot use first.patient_ID. Any ideas?

Online Status	Offline
Date Last Visited	‎09-01-2015 07:12 AM

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Conditionally match merging records from two datasets without a common...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Re: Conditionally match merging records from two datasets without a co...

Conditionally match merging records from two datasets without a common...