Thanks for the reply. i tried the following code to make sure this thing.
proc sort data=j out=c NODUPKEY; by gvkey year; run;
my first problem has been solved.
Regarding second thing there are about 5000 distinct GVKEY.
i am usning following codes;
proc sort data=j; by gvkey datadate; run;
proc panel data=j ;
id GVKEY DATADATE;
model invest = TQ cfcon /fixtwo pooled;
run;
now i am getting this issue;
ERROR: There is only one cross section or time series observation. Computations will be
terminated.
any way to solve this thing. your efforts are always appreciated.
ABdul