Thank you both for your responses.
I am not fully sure of everything you said, as some of that is new to me. However, I am willing to learn!
Each of these data sets pre-exist and have a distinct value associated with them - the patient's account number. It is listed 1 time in each dataset as a identifier.
Examples of each dataset
[DemoGraphic Data]
PatientAccountNum | FirstName | LastName | Birthdate | County | Admission Date etc.
J110114542112 | John | Doe | 01/01/1960 | Duval | 03/20/2023
[Press-Ganey Data]
PatientAccountNum | FirstName | LastName | PTgoals | PTconcerns | OTgoals | OTconcerns
J110114542112 | John | Doe | 5 | 5 | 4 | 5
[RTA Data]
PatientAccountNum | FirstName | LastName | Date_of_event | symptoms | durationOfSymptoms
J110114542112 | John | Doe | 03/24/2023 | SOB,Tachycardia | <1 hour
[Falls Data]
PatientAccountNum | FirstName | LastName | DateofFall | LocationOfFall | TimeOfFall
J110114542112 | John | Doe | 03/22/2023 | Room | 1300
This is assuming that the patient with account number J110114542112 actually had all of these events occur (if it didn't then it would only contain the information that is actually listed)
EXPECTED OUTPUT:
After I remove some of the duplicate data:
PatientAccountNum | FirstName | LastName | Birthdate | County | Admission Date | PG_PTgoals | PG_PTconcerns | PG_OTgoals | PG_OTconcerns | RTA_Date_of_event | RTA_symptoms | RTA_durationOfSymptoms | Fall_DateofFall | Fall_LocationOfFall | Fall_TimeOfFall
J110114542112 | John | Doe | 01/01/1960 | Duval | 03/20/2023 | 5 | 5 | 4 | 5 | 03/24/2023 | SOB,Tachycardia | <1 hour | 03/22/2023 | Room | 1300
I am attempting to tie in patient demographic data that is gathered from our EMR that already pre-exists in a dataset, with these data I want to tie in link/merge the data found in 3 other datasets (all merged on the patient's account number - which exists uniquely to the patients' visit.) using the account number of the patient.
I am currently doing this now, however it is very time consuming and I was wondering if I could learn a faster way that would allow me to flag all of the new data that gets merged from the datasets with a prefix to make it easier to identify exactly where the information is coming from when I am building dashboards. If there is not easily solution I can also just individually pull and rename each of the elements (about 500) but I wanted to check here first.
Thank you!
... View more