About tebteb

tebteb · ‎08-12-2015

Ok -- thought that was the solution but it is not what I want to do. I need an entirely new data set with just the aggregated variables. PROC MEANS, etc, does not get me this, unless I am missing something.

tebteb · ‎08-10-2015

Thanks! I have a cluster ID variable, just knew there was a way to do it more elegantly than what I was coming up with (which was long and messy).

tebteb · ‎08-10-2015

I have a large data set with about 10000 clusters (each with about 5-10 data points). There are about 30 variables in the dataset. I need to aggregate by cluster. Variables will aggregate differently (mostly count or mean). I do not want to retain any duplicate, non-aggregated data -- just one datapoint for each cluster. What is the simplest way to do this? I know I could do these with proc means and creating all new variables, but thought there might be a better way?

Online Status	Offline
Date Last Visited	‎09-01-2015 07:11 AM

Re: Best way to aggregate by cluster?

Re: Best way to aggregate by cluster?

Best way to aggregate by cluster?

Re: Best way to aggregate by cluster?

Re: Best way to aggregate by cluster?

Best way to aggregate by cluster?