BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
yanlu
Fluorite | Level 6

Hi,

 

I have a question about how can change the results computed inside the score : imagine I am an doctor and I want to identify the person who are sick. In the score node, I identify my target is 10% but I only identify 5% of sick people as prediction output. What I would like to do is identify more people healthy as sick and increase this percentage of sick who are really diasgnosed. I don't know how I should proceed to do so in sas.

 

ty for considering my question .

1 ACCEPTED SOLUTION

Accepted Solutions
Reeza
Super User

@yanlu wrote:

Is there a way for sas to understand the rapport of two column


I don't know what that means. 

 

A time variable such as this should be in a long format, not wide format. Then make sure it's set as a time variable. 

 

Time Value

Jan2015 100

Jan2016 200

 

Or some other structure may work, it really depends on your problem. 

 

FYI - SAS isn't a him and neither are many of the contributers on here, including myself. If a pronoun isn't required I would suggest not using one. 

 

View solution in original post

10 REPLIES 10
Reeza
Super User

You change your scores by chaingimg your models. Determine which model you're using and figure out how to tweak that. 

Once you determine your model, if you need further assistance in how to make changes please post back. 

yanlu
Fluorite | Level 6

First of all , ty for your answer

 

I am not really familliar with sas but I can do the basic 

The model is one option in sas . It's in the model package ? 

 

ty for your help

Reeza
Super User

But what kind of model? NN, Decision Tree, Linear Regression Model, Logistic regression model, some form of Ensemble Model?

 

Honestly, given what you're saying here, I think you should first walk through some SAS Enterprise Miner courses or books because I don't sense a whole of understanding for data mining or statistical methodology. 

yanlu
Fluorite | Level 6

Well I have check all my models I have done decision tree , linear regression , forest , neural network, maximum regression with the parameter misclassification error minimization

I check the misclassification error table , I don't have any improvement with the different model :'(
One is slightly better with at the level of the classification table concerning my problem, progress of 10 clients.

I was just thinking they were maybe other parameter, statistical wich allow to invers somehow the analysis.

My problem is I am trying to identify client who buy during one period I identify only 10% of the client who buy something so I have wrongly identied 90% of the clients who buy. This 90% who buy effectively are predicted as not buyer false negative

100% of my customer who buy is egual to 1700 more or less. My whole sample is 15 000 clients. So I have 13 300 clients in this period who doesn't buy anything. But I would like to revert this analysis somehow 😛

I think that the problem is maybe a lack of linear correlation between my class imputs , can I do anything to improve this ?
Have you one other explaination about why, this phenomenom appears

 

thank you for your answer !!!

Reeza
Super User

I'm going to guess that you're defining something wrong. In a case where 90% are not buying, my guess may be that you're actually modeling the opposite event. Double check how your variables are defined. 

 

You would get a 'better' model by simply guessing that everyone did not buy at this rate, so with 15000 records and good input variables I would assume a much better mode. 

yanlu
Fluorite | Level 6

Hi,

 

I do identify 150 clients who buy

I should have identified 1500 that actualy the problem

My target is set as binary, I can maybe consider it has nominal class it only contains 0 and 1.

I am going to check again all the variables 

 

thank for your answer ! 

 

 

 

yanlu
Fluorite | Level 6

Is there a way  to understand the rapport of two column, I have the sales of each month in my analysis.

the model considers each column separetely for the different month. Maybe if I could tell him this is one variable, my model would be better.

 

thank for any answer !

Reeza
Super User

@yanlu wrote:

Is there a way for sas to understand the rapport of two column


I don't know what that means. 

 

A time variable such as this should be in a long format, not wide format. Then make sure it's set as a time variable. 

 

Time Value

Jan2015 100

Jan2016 200

 

Or some other structure may work, it really depends on your problem. 

 

FYI - SAS isn't a him and neither are many of the contributers on here, including myself. If a pronoun isn't required I would suggest not using one. 

 

yanlu
Fluorite | Level 6

Sorry for this, I really appologize

 

Concretely, I have my ID on excel january, februari , mars  in each column for example

                                                       50        , 100       , 80

 

How can I passed to the long format

                                                      january-february-mart

                                                     january 50-february 100- mars 80

 

I really appreciated the help you have provided 

Reeza
Super User

There should be a tranpose task maybe under utilities?

If not, you can use a SAS code node with PROC TRANSPOSE to flip it to a wide dataset. 

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 10 replies
  • 1664 views
  • 0 likes
  • 2 in conversation