BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
pathakvishal
Fluorite | Level 6

I am using PROc Reg (Selection = stepwise ) linear regression for banking data which has credit customer information for which we need to predict the factor affecting the usageof credit card. In output of PRoc Reg . My model is giving of the final factors(dummyvariable factors) as 

 

dummy_card1 meaning( amex)

dummy_card4     meaning(discover), (both are types of primary )  refer to screenshot for more clarity

these card are primary credit cards

 

My question is can we have a final model in which we can have 2 dummy variables from same categorical variables .

 

Also , can we reduce the number of variable even after the output of ( selection = stepwise ). ?

 

 

P.s : I am naive in the world of analytics ..Thanx in advance 

 

 


dummydummy.png
1 ACCEPTED SOLUTION

Accepted Solutions
PaigeMiller
Diamond | Level 26

@pathakvishal wrote:

I am using PROc Reg (Selection = stepwise ) linear regression for banking data which has credit customer information for which we need to predict the factor affecting the usageof credit card. In output of PRoc Reg . My model is giving of the final factors(dummyvariable factors) as 

 

dummy_card1 meaning( amex)

dummy_card4     meaning(discover), (both are types of primary )  refer to screenshot for more clarity

these card are primary credit cards

 

My question is can we have a final model in which we can have 2 dummy variables from same categorical variables .

 


Yes, of course you can.

 


Also , can we reduce the number of variable even after the output of ( selection = stepwise ). ?

 


Yes, you can do this too, but I wouldn't, because stepwise provides the result that if finds to be the best fit model. I would however recommend that stepwise isn't highly regarded by most statisticians these days, and perhaps something like PROC PLS might be a better alternative with multiple input variables that are correlated.

--
Paige Miller

View solution in original post

2 REPLIES 2
PaigeMiller
Diamond | Level 26

@pathakvishal wrote:

I am using PROc Reg (Selection = stepwise ) linear regression for banking data which has credit customer information for which we need to predict the factor affecting the usageof credit card. In output of PRoc Reg . My model is giving of the final factors(dummyvariable factors) as 

 

dummy_card1 meaning( amex)

dummy_card4     meaning(discover), (both are types of primary )  refer to screenshot for more clarity

these card are primary credit cards

 

My question is can we have a final model in which we can have 2 dummy variables from same categorical variables .

 


Yes, of course you can.

 


Also , can we reduce the number of variable even after the output of ( selection = stepwise ). ?

 


Yes, you can do this too, but I wouldn't, because stepwise provides the result that if finds to be the best fit model. I would however recommend that stepwise isn't highly regarded by most statisticians these days, and perhaps something like PROC PLS might be a better alternative with multiple input variables that are correlated.

--
Paige Miller
pathakvishal
Fluorite | Level 6

Thanx Mate !!


@PaigeMiller wrote:

@pathakvishal wrote:

I am using PROc Reg (Selection = stepwise ) linear regression for banking data which has credit customer information for which we need to predict the factor affecting the usageof credit card. In output of PRoc Reg . My model is giving of the final factors(dummyvariable factors) as 

 

dummy_card1 meaning( amex)

dummy_card4     meaning(discover), (both are types of primary )  refer to screenshot for more clarity

these card are primary credit cards

 

My question is can we have a final model in which we can have 2 dummy variables from same categorical variables .

 


Yes, of course you can.

 


Also , can we reduce the number of variable even after the output of ( selection = stepwise ). ?

 


Yes, you can do this too, but I wouldn't, because stepwise provides the result that if finds to be the best fit model. I would however recommend that stepwise isn't highly regarded by most statisticians these days, and perhaps something like PROC PLS might be a better alternative with multiple input variables that are correlated.


 

SAS Innovate 2025: Register Today!

 

Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.


Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1400 views
  • 0 likes
  • 2 in conversation