BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
ken88
Calcite | Level 5

How come no Segment Profile after I set "Cluster Variable Role" = Input in Cluster node?

Thank You

1 ACCEPTED SOLUTION

Accepted Solutions
M_Maldonado
Barite | Level 11

Hey Ken,

Not sure exactly what you mean.

If your question is, why would a Segment Profile Node does not show plots after a Cluster node with Cluster Variable Role set to input?

Well, it is just because of that Smiley Happy. The Segment Profile Node needs a segment variable to analyze.

To do both things in one run I would leave the cluster variable role as segment on my flow, check it out with the segment profile node, and then use the metadata node to change the role from segment to input.

Your flow will look something like:    Data->Cluster->Segment Profile->Metadata(change role of _segment_ variable)->Regression (or anything you had planned for your modeling).

The other way around (using the metadata role of _segment_ to segment) also works.

Would that answer your question?

Thanks,

Miguel

View solution in original post

8 REPLIES 8
M_Maldonado
Barite | Level 11

Hey Ken,

Not sure exactly what you mean.

If your question is, why would a Segment Profile Node does not show plots after a Cluster node with Cluster Variable Role set to input?

Well, it is just because of that Smiley Happy. The Segment Profile Node needs a segment variable to analyze.

To do both things in one run I would leave the cluster variable role as segment on my flow, check it out with the segment profile node, and then use the metadata node to change the role from segment to input.

Your flow will look something like:    Data->Cluster->Segment Profile->Metadata(change role of _segment_ variable)->Regression (or anything you had planned for your modeling).

The other way around (using the metadata role of _segment_ to segment) also works.

Would that answer your question?

Thanks,

Miguel

ken88
Calcite | Level 5

what criteria should I use for selecting segment variable?

is segment variable similar to target variable??

Thank You

M_Maldonado
Barite | Level 11

Ken,

Like Gergely said, clustering is unsupervised because you don't need a target.

Cluster node creates a variable called _segment_ with a role of segment to flag each of your observations.

You use the property Cluster Variable Role to tell Enterprise Miner what to do with that variable. If you specify "segment", a node like Segment Profile node can use it to give you an anlysis based on segment. If you specify "input" you can use it as one more input variable for any of your models.

To make it more clear, look at the exported data set of the Cluster node. Notice that you have 3 new variables: _segment_, segment_label, distance (to the centroid of that cluster). Cluster variable role is the way you tell EM what to do with _segment_ variable. By default _segment_label and distance will have a role of rejected, _segment_ will have a segment role.

for ken2.png

I hope this helps,

Miguel

ken88
Calcite | Level 5

Is there a way to keep the variable in the cluster train data without using it.

I want to keep the id of the customer in the dataset without using it for data analysis

M_Maldonado
Barite | Level 11

All three variables that the cluster node creates for you are kept automatically.

The "id" of the cluster (_segment_) is kept, and only used for data analysis if you specify the Cluster Variable Role as input.

Take a look at the exported train data coming out of your cluster node. It will make things more clear.

Good luck!

RalphAbbey
SAS Employee

If you set the id of the customer as an ID Variable, then it should stay with the data set, but not be used by any of the nodes for data analysis.

ken88
Calcite | Level 5

Thanks Miguel,

what the effect on the resulting cluster for diff setting in Cluster Variable Role?

so set Cluster Variable Role= segment == supervised segmentation

and set Cluster Variable Role = input == unsupervised segmentation?

Thank You

gergely_batho
SAS Employee

No effect at all. I affects only the ROLE of the segment variable.

It is unsupervised.

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 8 replies
  • 7398 views
  • 4 likes
  • 4 in conversation