BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
PGStats
Opal | Level 21

Hello, I am looking for example code showing how to create a graphical representation of a decision tree produced with HPSPLIT. I've obtained a graph with proc tree where I put all information in the leaves but I would prefer the layout provided by proc netdraw or proc dtree.

PG
1 ACCEPTED SOLUTION

Accepted Solutions
FriedEgg
SAS Employee
proc hpsplit data=sashelp.cars;
target origin / level=nominal;
input msrp cylinders length wheelbase mpg_city mpg_highway invoice weight horsepower / level=interval;
input enginesize / level=ordinal;
input drivetrain type / level=nominal;
output nodestats=nstat;
run;

proc sql;
create view treedata as
select a.parent as activity, a.id as node, b.splitvar, b.predictedvalue
from nstat a, nstat b
where a.parent=b.id
union
select c.id as activity, . as node, c.splitvar, c.predictedvalue
from nstat c;
quit;

data treedata1;
set treedata;
select(predictedvalue);
when ('ASIA') _pattern=1;
when ('EUROPE') _pattern=2;
otherwise _pattern=3;
end;
run;

pattern1  c=green; pattern2 v=s c=red;  pattern3 v=s c=blue; 
footnote   c=green   'Asia  '  c=red     'Europe '    c=blue    'USA'; 
proc netdraw data=treedata1 graphics; 
     actnet /activity=activity successor=NODE  id=(splitvar) tree compress rotate rotatetext arrowhead=0 htext=5; 
run;
footnote ' ';

 

Starting in EM14.1, I beleive, PROC HPSPLIT will produce it's own plots

View solution in original post

2 REPLIES 2
FriedEgg
SAS Employee
proc hpsplit data=sashelp.cars;
target origin / level=nominal;
input msrp cylinders length wheelbase mpg_city mpg_highway invoice weight horsepower / level=interval;
input enginesize / level=ordinal;
input drivetrain type / level=nominal;
output nodestats=nstat;
run;

proc sql;
create view treedata as
select a.parent as activity, a.id as node, b.splitvar, b.predictedvalue
from nstat a, nstat b
where a.parent=b.id
union
select c.id as activity, . as node, c.splitvar, c.predictedvalue
from nstat c;
quit;

data treedata1;
set treedata;
select(predictedvalue);
when ('ASIA') _pattern=1;
when ('EUROPE') _pattern=2;
otherwise _pattern=3;
end;
run;

pattern1  c=green; pattern2 v=s c=red;  pattern3 v=s c=blue; 
footnote   c=green   'Asia  '  c=red     'Europe '    c=blue    'USA'; 
proc netdraw data=treedata1 graphics; 
     actnet /activity=activity successor=NODE  id=(splitvar) tree compress rotate rotatetext arrowhead=0 htext=5; 
run;
footnote ' ';

 

Starting in EM14.1, I beleive, PROC HPSPLIT will produce it's own plots

PGStats
Opal | Level 21

Thanks Matt! That gives me a very good start.

 

Note: I had to replace act1 by Activity in the last step.

PG

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

Multiple Linear Regression in SAS

Learn how to run multiple linear regression models with and without interactions, presented by SAS user Alex Chaplin.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 4375 views
  • 0 likes
  • 2 in conversation