I have a repeated measurement data of individuals. How to mask the unique identifier variable (before data sharing) while keeping nature of repeated records and other logics embedded? Uniq_id variable is very long in and length and numeric. Using SAS 9.4.
data temp;
input uniq_id;
datalines;
2007122345567889
2007122345567889
2007122345567889
2008235689875421
2008235689875421
2008235689875421
;
data temp; set temp;
format uniq_id 20.;
run;
1. Create a list of your ID's, only unique values
2. Create a list of random IDs in the data set from step1, keeping the seed value stored - you'll want to keep track of the seeds over time so I recommend keeping a master file of seeds.
3. Match ID to RandomID so that an ID for a person is constant throughout the data set but it doesn't have the any significance.
Fully worked example here:
https://gist.github.com/statgeek/fd94b0b6e78815430c1340e8c19f8644
1. Create a list of your ID's, only unique values
2. Create a list of random IDs in the data set from step1, keeping the seed value stored - you'll want to keep track of the seeds over time so I recommend keeping a master file of seeds.
3. Match ID to RandomID so that an ID for a person is constant throughout the data set but it doesn't have the any significance.
Fully worked example here:
https://gist.github.com/statgeek/fd94b0b6e78815430c1340e8c19f8644
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.