BookmarkSubscribeRSS Feed
jiaxinyang
Calcite | Level 5

Hi All,

 

My dataset has a year identifier -- fyear, a two-digit industry code -- sid2d and the variable interested -- Resid.

I want to eliminate observations where there are fewer than ten observations in a two-digit industry code (sid2d) for a given year (fyear).

Can anyone tell me how to write the codes for this? Thanks in advance.

 

Best regards,

 

Jiaxin

1 REPLY 1
PGStats
Opal | Level 21

Try something like this:

 

proc sql;
create table want as
select *
from myDataset
group by sid2d, fyear
having count(*) >= 10;
quit;
PG