BookmarkSubscribeRSS Feed
MZM
Calcite | Level 5 MZM
Calcite | Level 5

Hi Modellers,

 

I am trying to build a model on a dataset with 2 million observations and an ordinal response variable with 53 levels with a normal distribution ranging from -26 to +26.

Both SAS/Base 9.4 and Enterprise Miner 13.2 are available for use. 

 

I am looking for any suggestions on modeling techniques that could be used especially in EM. 

Was also wondering if there's a way to use cumulative logit link function in EM.

Does that make sense to consider the response variable interval and then to use GLM?

 

Thanks,

M.

2 REPLIES 2
PaigeMiller
Diamond | Level 26

Unless your 53 levels are highly non-linear, I would treat them as a continuous variable and perform partial least squares regression modelling (which in my opinion is probably the best way to model 900 independent variables) in PROC PLS. I do not know if this is available in Enterprise Miner as I don't use it. I would not use PROC GLM with 900 independent variables.

--
Paige Miller
WendyCzika
SAS Employee

When you use the Regression node in Enterprise Miner with an ordinal target, it does use the cumulative logit link function.

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1565 views
  • 2 likes
  • 3 in conversation