BookmarkSubscribeRSS Feed
MZM
Calcite | Level 5 MZM
Calcite | Level 5

Hi Modellers,

 

I am trying to build a model on a dataset with 2 million observations and an ordinal response variable with 53 levels with a normal distribution ranging from -26 to +26.

Both SAS/Base 9.4 and Enterprise Miner 13.2 are available for use. 

 

I am looking for any suggestions on modeling techniques that could be used especially in EM. 

Was also wondering if there's a way to use cumulative logit link function in EM.

Does that make sense to consider the response variable interval and then to use GLM?

 

Thanks,

M.

2 REPLIES 2
PaigeMiller
Diamond | Level 26

Unless your 53 levels are highly non-linear, I would treat them as a continuous variable and perform partial least squares regression modelling (which in my opinion is probably the best way to model 900 independent variables) in PROC PLS. I do not know if this is available in Enterprise Miner as I don't use it. I would not use PROC GLM with 900 independent variables.

--
Paige Miller
WendyCzika
SAS Employee

When you use the Regression node in Enterprise Miner with an ordinal target, it does use the cumulative logit link function.

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1621 views
  • 2 likes
  • 3 in conversation