Hello, I have a list of pneumococcal vaccination administrations (n=2,400,000) and am trying to determine the best way to indicate if each person has met the criteria to be considered complete. The dataset is vertical. I have unique identifiers for each person and if that person has received more than one vaccination then they will have multiple observations listed. To be complete the person will need to meet one of these criteria: PCV15 + PPSV23 PCV20 PPSV23 + PCV15 PPSV23 + PCV20 PCV13 + PPSV23 PCV13 + PCV20 What would be the most efficient way to handle marking individuals as complete? Below is an example using dummy data of what I have vs what I want. HAVE: ID Vaccine 1 PCV15 1 PCV13 1 PPSV23 2 PCV13 2 PPSV23 3 PCV20 4 PPSV23 4 PCV15 5 PCV13 WANT (where 1 = complete and 0 = incomplete) ID Pneumo Complete 1 1 2 1 3 1 4 1 5 0 Thank you
... View more