BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
matt23
Quartz | Level 8

Hi.

 

I am a student and I'm working on regression models. I have 24 models for every hour with 7 variables. When I do the selection based on adjrsq AIC SBC and RMSE, I get couple models for each hour that are very close. Is there an easy way to check them or run them through the data to see how they would do ? And by easy, I mean something I would be able to do as I am only a student. Or should I just trust the selection options?

1 ACCEPTED SOLUTION

Accepted Solutions
PaigeMiller
Diamond | Level 26

That's the problem with fitting models. THere are many ways to measure how good the model fits, they don't always agree, and if you're fitting models once an hour in some automated fashion, you can't really spend the time trying to understand which model is really really really really the best. So, I think you need to pick a criteria for how good the model fits, and go with it. You might want to consult with statisticians at your company/university to see if they have a preference for certain model fit statistics in your specific application.

--
Paige Miller

View solution in original post

3 REPLIES 3
PaigeMiller
Diamond | Level 26

That's the problem with fitting models. THere are many ways to measure how good the model fits, they don't always agree, and if you're fitting models once an hour in some automated fashion, you can't really spend the time trying to understand which model is really really really really the best. So, I think you need to pick a criteria for how good the model fits, and go with it. You might want to consult with statisticians at your company/university to see if they have a preference for certain model fit statistics in your specific application.

--
Paige Miller
matt23
Quartz | Level 8
Thank you. Can I ask one more question?
Let's say my Y is larger on weekdays than it is on weekends. Can I assume that this is the reason why the slopes for my X variables are more sensitive during weekends ? For example, one of my X variable has slope -100 throughout the weekday, and ranges from -100 to -250 throughout the weekend. Can it be due to the the fact that the Y value is larger on the weekday therefore X variables have less impact?
PaigeMiller
Diamond | Level 26

That's one possible cause of the changing slopes, but if the Y values are just higher on the weekend, that could result in different intercepts rather than different slopes. It is not the only potential cause of higher slopes. The difference in slopes could be due to outliers on the weekends, or higher variability on the weekends, or (depending on what modeling you are doing) more collinearity on weekends, and probably a few other things that I can't think of right now.

 

Also, Y in the range -100 to -250 on weekends is lower then weekdays, not higher.

--
Paige Miller

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

SAS Enterprise Guide vs. SAS Studio

What’s the difference between SAS Enterprise Guide and SAS Studio? How are they similar? Just ask SAS’ Danny Modlin.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 3 replies
  • 780 views
  • 0 likes
  • 2 in conversation