BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
jim_wong
Calcite | Level 5

Hi Everyone,

 

I am a new user, currently completing panel regression analysis for a university paper. The data I am using is a sample taken from the US Panel Study of Income Dynamics (PSID), with 23 variables and ~220k observations.

 

When I run a simple panel model, such as below, I do not get the observation plots, and the colored set of residual plots seem incomprehensible. Is it simply a case of so many observations making the plots over-complicated? Are these plots "normal" for proc panel, and if so how do I interpret them?

 

Thank you for any advice.

 

proc panel data=data plots=ALL;
id year id;
model ARSINH_wealth = pd age male white black hispanic otherrace education income employed divorce
marriage childbirth familydeath laidoff missedwork studentloan collegedegree ARSINH_socioeconomic / pooled;
title 'Model One - Pooled (OLS) Model (negative)';
run;

 

jim_wong_0-1652942236726.jpeg jim_wong_1-1652942247355.jpeg

 

This question was originally posted to 'New Users' but I was advised to post it here instead.

1 ACCEPTED SOLUTION

Accepted Solutions
acordes
Rhodochrosite | Level 12

Perhaps the visuals get more meaning if you work with a sample of the data. 

You could try subsetting your data. 

 

data want;
set data;
sample_filter=rand("integer", 1, 20);
run;

proc panel data=want plots=ALL;
where sample_filter = 1;   /* in this case you would work with a sample rate of 5% */ 
id year id;
model wealth = pd age male white black hispanic otherrace education income employed divorce
marriage childbirth familydeath laidoff missedwork studentloan collegedegree ARSINH_socioeconomic / pooled;
title 'Model One - Pooled (OLS) Model (negative)';
run;

 

View solution in original post

6 REPLIES 6
jim_wong
Calcite | Level 5

Hi Everyone,

 

I am a new user, currently completing panel regression analysis for a university paper. The data I am using is a sample taken from the US Panel Study of Income Dynamics (PSID), with 23 variables and ~220k observations.

 

When I run a simple panel model, such as below, I do not get the observation plots, and the colored set of residual plots seem incomprehensible. Is it simply a case of so many observations making the plots over-complicated? Are these plots "normal" for proc panel, and if so how do I interpret them?

 

Thank you for any advice.

 

proc panel data=data plots=ALL;
id year id;
model wealth = pd age male white black hispanic otherrace education income employed divorce
marriage childbirth familydeath laidoff missedwork studentloan collegedegree ARSINH_socioeconomic / pooled;
title 'Model One - Pooled (OLS) Model (negative)';
run;

 

jim_wong_1-1652906237704.jpeg jim_wong_0-1652906184544.jpeg

 

sbxkoenk
SAS Super FREQ

Hello @jim_wong ,

 

@SASCom1 can help you with this.

 

Thanks,

Koen

acordes
Rhodochrosite | Level 12

Perhaps the visuals get more meaning if you work with a sample of the data. 

You could try subsetting your data. 

 

data want;
set data;
sample_filter=rand("integer", 1, 20);
run;

proc panel data=want plots=ALL;
where sample_filter = 1;   /* in this case you would work with a sample rate of 5% */ 
id year id;
model wealth = pd age male white black hispanic otherrace education income employed divorce
marriage childbirth familydeath laidoff missedwork studentloan collegedegree ARSINH_socioeconomic / pooled;
title 'Model One - Pooled (OLS) Model (negative)';
run;

 

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and save with the early bird rate—just $795!

Register now

Discussion stats
  • 6 replies
  • 8769 views
  • 1 like
  • 5 in conversation