BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Diimoo
Fluorite | Level 6

Hi,

I have 15 questions and each question has 4 different statuses like 0 – ‘no action’ 1 – ‘started ‘2- ‘implemented’ and 3 – ‘completed’. I'd like a good way to measure the impact of each question on our in visitors (number of visitors), and if possible apply regression.

  1. It is possible to use dummy-encoding to the ordinal variable and use regression ( 3 INV variable – 1,2,3)
  2. It is possible to output each dummies Standardized Coefficients - Beta
  3. Also get from R-squared how different independent variables (dummies) are contributing to a dependent variable ( number visiting)

Please let me know if the above makes sense

Thanks in advance.

1 ACCEPTED SOLUTION

Accepted Solutions
8 REPLIES 8
ballardw
Super User

Procedure intending to use what?

 

Many of the modeling / regression procedures will allow specifying categorical variables as Class variables and handle creating/using "dummy variables" in the background.

 

Output options tend to depend on which specific procedure is used.

Diimoo
Fluorite | Level 6

Thanks for the reply.

The intention is - which status (driver) are contributing more on the outcome ( i.e Driver 4 - Completed).

 

In that case we were thinking to show each RSQR and Beta for each status.

 

Thanks again.

Diimoo
Fluorite | Level 6

Is it ok to model /regression

by question;

Model Visit = Status1 (0,1) Status2 Status3 / STB output each R..

 

In ordinal variable

 

Please let me know

 

ballardw
Super User

SAS has over 20 different regression procedures.

Choice relates to type of regression wanted, data types involved and possibly sample design (surveyreg and surveylogistic.

The model statements have different options depending on specific procedure chosen.

 

What do you mean by Status1 (0,1) in that model statement?

Diimoo
Fluorite | Level 6

The status are ordinal VR :

Status like 0 – ‘no action’ 1 – ‘started ‘2- ‘implemented’ and 3 – ‘completed’..

 

All the 16 groups (participant) -  They have to choose one of the scores for each question (total = 15 questions)

 

I was thinking to convert the ordinal data to dummy (0, 1) - But not sure.? I see before where categorical is coded 0 and 1.

 

Ideally would like to get an idea which group (group 1 to group 16) has impact on the continuous outcome (visit)

PaigeMiller
Diamond | Level 26

@Diimoo wrote:

The status are ordinal VR :

Status like 0 – ‘no action’ 1 – ‘started ‘2- ‘implemented’ and 3 – ‘completed’..

 

All the 16 groups (participant) -  They have to choose one of the scores for each question (total = 15 questions)

 

I was thinking to convert the ordinal data to dummy (0, 1) - But not sure.? I see before where categorical is coded 0 and 1.

 

Ideally would like to get an idea which group (group 1 to group 16) has impact on the continuous outcome (visit)


As I explained in my above post, there is NO need to create dummy variables here. SAS will do the work for you.

--
Paige Miller
PaigeMiller
Diamond | Level 26

@Diimoo wrote:

Hi,

I have 15 questions and each question has 4 different statuses like 0 – ‘no action’ 1 – ‘started ‘2- ‘implemented’ and 3 – ‘completed’. I'd like a good way to measure the impact of each question on our in visitors (number of visitors), and if possible apply regression.

  1. It is possible to use dummy-encoding to the ordinal variable and use regression ( 3 INV variable – 1,2,3)

If I am understanding properly, there's no need for YOU to create dummy variables here. In most modeling procedure in SAS, such as PROC GLM or PROC PLS, you can take the variable (let's call it by the name STATUS) with the 0,1,2,3 value and place it is a CLASS statement, and then SAS will create the dummies for you internally, so you don't have to.

 

In this case, your model becomes (because you have 15 different status variables)

 

CLASS STATUS1-STATUS15;

MODEL VISITS = STATUS1-STATUS15/options ... ;

 

It is possible to output each dummies Standardized Coefficients - Beta

 

Yes, you can get coefficients for each level of STATUS.

 

Also get from R-squared how different independent variables (dummies) are contributing to a dependent variable ( number visiting)

 

There is NO SUCH THING as an R-squared for individual terms in the model; the only R-squared number you can get is for the entire model fit.

--
Paige Miller
Diimoo
Fluorite | Level 6

Thank you both ..

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 8 replies
  • 1490 views
  • 0 likes
  • 3 in conversation