I have 4 datasets - A,B,C,D. Each dataset has around 500 observations. The subjects need to be scrambled within the available subject ID's. For example Actual subjects are 101,102,103,104 , they need to be scrambled as 103,101,104,102 . I am instructed to merge 4 datasets and apply scrambling for those which have subjects in dataset A. For example if dataset A has 101 as subject id and any of the datasets B,C or D has 101 as Subject id it should be scrambled with same number. Unmatched subjects should remain as it is .
Eg 1. A-101 B-101 C-101 D-101 -Actual subjects.
A-103 B-103 C-103 D-103 - Scrambled subjects should be same across the datasets for a particular subject.
Eg 2. A-104 B-105 C-109 D-104 - Actual subjects
A-107 B-108 C-106 D-107 - Please observe scrambling for Dataset A and D.
Please help me with this requirement .
So you want consistent anonymization across a number of datasets?
Agreeing with @Reeza .
I think this is easy to do, but the question is not 100% clear to me neither.
Thanks,
Koen
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.