BookmarkSubscribeRSS Feed
margalits1
Obsidian | Level 7

Hello, I am running the following regression for a sample of 104,000 observations (some are omitted because of missing values).

I want to add clusters by firm (there are approx.1700 firms in the sample ). I already tried to increase the memsize manually to 6G, but with no luck.

I tried both locally and via WRDS servers.

This is the code:

 

%macro reg_s(var_name, data_name);
proc surveyreg data=&data_name;
cluster gvkey;
CLASS gvkey year_data;
MODEL &var_name=incu_post_ipo sign_inc_p sign_p sale_rank market_share ind_code_count gvkey year_data /SOLUTION;
ods output ParameterEstimates=&var_name._&data_name;
RUN; 
ods trace off;

%mend;


%reg_s(capx_1,ff);
run;

This is the error output:

NOTE: In data set FF, total 104383 observations read, 43366 observations with missing values are
      omitted.
ERROR: The SAS System stopped processing this step because of insufficient memory.
NOTE: PROCEDURE SURVEYREG used (Total process time):
      real time           1.06 seconds
      user cpu time       0.89 seconds
      system cpu time     0.10 seconds
      memory              51160.06k
      OS Memory           77100.00k
      Timestamp           01/02/2022 04:54:23 àçäØö
      Step Count                        20  Switch Count  0

2452  run;



P.S when I run the code without the clusters this is what I get:

NOTE: In data set FF, total 104383 observations read, 43366 observations with missing values are
      omitted.
NOTE: The data set WORK.CAPX_1_FF has 1127 observations and 6 variables.
NOTE: PROCEDURE SURVEYREG used (Total process time):
      real time           42.59 seconds
      user cpu time       39.06 seconds
      system cpu time     0.87 seconds
      memory              101172.81k
      OS Memory           126420.00k
      Timestamp           01/02/2022 05:06:44 àçäØö
      Step Count                        22  Switch Count  819


I already read some relevant Q&A but couldn't find an answer to my case. 

Thank you in advance.

4 REPLIES 4
Rick_SAS
SAS Super FREQ

I am not an expert in survey regression, but are you sure you want to include gvkey (the clustering variable) on the MODEL statement?  I noticed in the documentation examples, the variable on the CLUSTER statement is not included on the MODEL statement.

margalits1
Obsidian | Level 7
I want to have both fixed effects and clustering for gvkey (the firm).
Perhaps I should use a different model then?
andreas_lds
Jade | Level 19

Please post the result of

proc options group=memory;
run;

and some information about the hardware used: how much memory is installed, how much is free?

It seems strange that the first run terminates with ~50% memory used, compared with the second run.

margalits1
Obsidian | Level 7
Thanks for your reply,
I actually read a bit more and found that indeed surveyreg has memory issues with clusters, and now use GENMOD instead, which captures both f.e and two-way cluster, just as I wanted.
Thanks again

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 4 replies
  • 822 views
  • 2 likes
  • 3 in conversation