Does SAS offer anything in the way of Statistical Disclosure Control?
There is no "PROC SDC" if that is what you are asking. There are multiple ways to address the issues of Statistical Dissclosure Analysis using SAS. You can use UNIVARIATE and FREQ to look at distributions and joint distributions to understand where you might need to trim data or combine categories. You can fuzz the data with a jittering macro. If you want to go further, you can create synthetic datasets using PROC MI with instructions for the recipient on how to use MIANALYZE to put the data back together again.
IMHO, the hard work in Statistical Disclosure Control is in the analyses in order to understand the controls that need to be put into place. Statistical Policy Working Paper 22 ( http://www.fcsm.gov/working-papers/spwp22.html ) is a good introduction on that.
Doc Muhlbaier
Duke
I realize this is an old thread, but for the benefit of those finding it on Google... There are obviously a lot of ways to accomplish statistical disclosure control (fuzzing, small cell suppression, etc.). All of these can be accomplished in SAS, even if they aren't implemented in actual SAS features.
If you're interested in small cell suppression specifically, I did a significant amount of work implementing various algorithms for small cell suppression in SAS while working as a government analyst. I've (rewritten and) published macro code for these algorithms for the benefit of others that might be interested in using them. There's two methods I've specifically implemented in macro code:
I hope somebody ends up finding this useful!
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.