So, that part about knowing the subgroup sample sizes always confused me, which is why I didn't try the By statement route in the first place. But I guess it's just referring to modeling where you run multiple random subsamples of the population. So since I have a static sample size, the part "you can use a BY statement to obtain stratum level estimates when the stratum sample sizes are fixed" implies that this is a better method than the duplicate stratum variable? I liked the duplicate method because the confidence intervals were tighter, but I suppose that's the definition of temptation...
... View more