At the moment these rules are getting misinterpreted widely in the industry. They are: Recode subject ID and Recode ID variables. Please provide the macros to solve these issues.
Some parameters to follow for the two rules:
Rules explanation:
Recode subject ID and Recode ID variable
Create a new random unique subject ID that is not made up of any identifiable information. Site numbers must not be replicated in the recoded subject IDs. The list of original subject IDs and the recoded ones must not have any values in common. The same recoded subject ID must be used in extension study data. If the same subject is part of several studies within the same request, consider providing same subject ID. If excluded patients (protocol inclusion/exclusion criteria) are deleted, this must be documented. A general recommendation is to not delete any patients (even screening failures) and their related data. This recommendation applies to both variables SUBJID and USUBJID, and the recoding for both must be consistent between them and across datasets. It is also advised to sort de-identified datasets with recoded USUBJID before release. | |
Variables such as Reference ID or Sponsor ID are usually constructed using CRF page numbers or laboratory sample numbers, which are Direct Identifiers and require recoding. The list of original IDs and the recoded ones must not have any values in common. This applies also to Investigator ID and Site ID, among others, when applicable. |
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.