BookmarkSubscribeRSS Feed
mlensing
Obsidian | Level 7

Hi everyone,

 

I am working on a case-control study in which I need to create a survey ID variable for records. For every case in dataset1, I pull 3 controls randomly from a source dataset to create dataset2 (this is being done via proc survey select) resulting in a dataset of cases (dataset1) and a dataset of controls (dataset2). The survey ID variable naming scheme should be as follows:

 

Case1: 12345-1

Control1: 12345-1-1

Control2: 12345-1-2

Control3: 12345-1-3

 

Case 2: 12345-2

Control1: 12345-2-1

Control2: 12345-2-2

Control3: 12345-2-3

 

This naming scheme can be applied to each dataset separately or can be applied to the combined dataset of all records. A key detail is that the controls are simply frequency matches and NOT paired matches. How would you all code this?

 

Thanks!

1 REPLY 1
PaigeMiller
Diamond | Level 26

Keep the data set long, not wide. You don't want separate variables here, you want separate records.


Beyond that, a more concrete explanation of the data, or showing us a portion of what the data set would look like, would be a great help. 

--
Paige Miller

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 664 views
  • 0 likes
  • 2 in conversation