Solved: Error: Number of columns in X does not match with the number of variab...

dustychair · Posted 08-24-2018 03:07 AM

Hi,

I have been working on a code. This community helped me and taught me a lot. I have a question which may be very simple for you. In the below code when I use rowname statement I get this error 'Number of columns in X does not match with the number of variables in the data set'. Although rows and columns have the same candidates in the matrix, I don't understand why I get this error.

Thanks

data tr;
infile 'C:\coll_gen\N100\GROUP10\COMP40\PHI25\gbt\RESULTS-SIG1.OUT';
input CAND_1$ CAND_2$ ID_1 ID_2 THETA_1 THETA_2 MATCHES RT_PROB FLAG ;
run;

proc iml;

use tr;
read all var _ALL_;
close tr;

Cand=unique(Cand_1//Cand_2);
call sort(Cand);

X=j(ncol(Cand), ncol(Cand), .);

do i=1 to nrow(RT_PROB);
X[loc(Cand=Cand_1[i]),loc(Cand=Cand_2[i])]=RT_PROB[i];
end;

print X[colName=((Cand)) rowName=((Cand))];
create matrix from X [colname=Cand rowname=Cand];
append from X;
quit;

Rick_SAS · Posted 08-24-2018 05:45 AM

Because you are writing an extra column (the ROWNAME= variable), you need to include that information on the APPEND statement:

create matrix from X [colname=Cand rowname=Cand];
append from X[rowname=Cand];
close;

View solution in original post

PeterClemmensen · Posted 08-24-2018 03:21 AM

Hi @dustychair. Glad to hear that the community has taught you some 🙂

My guess is that you do not want to create another matrix from X, rather a data set, correct? If so then change this

create matrix from X [colname=Cand rowname=Cand];
append from X;

to this

create data MyData from X[colname=Cand];
   append from X;
close MyData;

The DATA to DATA Step Macro
Blog: SASnrd

dustychair · Posted 08-24-2018 03:52 AM

Hi Draycut,

Actually my main purpose is to use X matrix (or should I say data set?) in cluster analysis. In cluster analysis I need to define variables.Candidates will be clustered based on RT_PROB. So, I have to define both variables. Maybe there is an easy way to do that but I tried proc transpose to get cand variable in the rows. Here is my whole code for my purpose.

data tr;
infile 'C:\coll_gen\N100\GROUP10\COMP40\PHI25\gbt\RESULTS-SIG1.OUT';
input CAND_1$ CAND_2$ ID_1 ID_2 THETA_1 THETA_2 MATCHES RT_PROB FLAG ;
run;

proc iml;

use tr;
read all var _ALL_;
close tr;

Cand=unique(Cand_1//Cand_2);
call sort(Cand);

X=j(ncol(Cand), ncol(Cand), .);

do i=1 to nrow(RT_PROB);
X[loc(Cand=Cand_1[i]),loc(Cand=Cand_2[i])]=RT_PROB[i];
end;

print X[colName=((Cand)) rowName=((Cand))];
create matrix from X [colname=Cand rowname=Cand];
append from X;
quit;

data matrix;
set matrix;
array change _numeric_;
do over change;
if change=. then change=0;
end;
run ;

proc transpose data=matrix
out=matrix1;
run;

ods graphics on;

proc cluster data=matrix1 method=ward ccc pseudo out=tree;
var RT_PROB;
id cand;

run;

ods graphics off;

proc tree data=matrix1 out=New nclusters=5 noprint ;

run;

Kurt_Bremser · Posted 08-24-2018 03:27 AM

I moved this to the SAS/IML community.

Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
The macro for direct download as ZIP
How to post code
Please vote for Provide Sequential Search Capability for Hash Objects
How to deal with locked files on UNIX

Rick_SAS · Posted 08-24-2018 05:45 AM

Because you are writing an extra column (the ROWNAME= variable), you need to include that information on the APPEND statement:

create matrix from X [colname=Cand rowname=Cand];
append from X[rowname=Cand];
close;

Error: Number of columns in X does not match with the number of variables in the data set

Re: Error: Number of columns in X does not match with the number of variables in the data set

Re: Error: Number of columns in X does not match with the number of variables in the data set

Re: Error: Number of columns in X does not match with the number of variables in the data set

Re: Error: Number of columns in X does not match with the number of variables in the data set

Re: Error: Number of columns in X does not match with the number of variables in the data set

Registration is open