Thanks Steve and no problem. I think I'm mostly just trying to wrap my head around the fundamentals here (I've exceeded my own knowledge with this question to be honest). I'm incorporating the weight, strata, and cluster variables in my proc surveyfreq's because the dataset instructions tell me to, but I'm not 100% sure what those variables are actually doing. I understand weighting, clustering, and stratification as concepts, but haven't really been able to wrap my head around what they do/how they function practically speaking (i.e., in SAS). Maybe I don't need to, but I do want to make sure I'm not overlooking something important. Your earlier point about subsampling throwing off the weighting was a great one and was the perfect example of something I'm worried about overlooking (that hadn't even occurred to me). At first, it made me pause and think I shouldn't be trying to force pairwise comparison - that I should just let it go and go the regression route - but then I realized I'm basically subsampling already by opting to run all of my analyses on the 18-65 year old segment of the survey respondents. Meaning that if I'm tracking your earlier point, the weighting is probably already thrown off to some degree (assuming the survey's weighting strategy was to correlate the whole sample with known population data). I suppose now all I'm really trying to do is understand how big of a deal that is/would be. If you have any thoughts on this front, I'd appreciate them. And if not, no worries - I appreciate all the help and clarity so far. It's been a real boon.
... View more