SAS Text Miner only sees Text variables and Target variables (variables with roles Text or Target). Target variables are only seen if they have a level of binary or nominal. If there are two or more Text variables in a SAS data set, the Text Parsing node selects exactly one of the Text variables for analysis and ignores all of the rest. It has no way of knowing how any of the Text variables were created, whether concatenated or filtered or anything else. If there are two or more Text variables, the Text Parsing node uses the following selection rules:
1. Pick the Text variable with the greatest length.
2. If two Text variables tie for having the greatest length, pick the one that comes first in sort order. (Example: variable Animals has length 272, and variable Vegetables has length 272, choose Animals because it appears first in sort order by name (A comes before V).
As a best practice, never let the Text Parsing node choose for you. Set the Use status of all Text variables to No except for the one that YOU choose to include in the analysis.
If you want to concatenate two or more Text variables, use a SAS Code node. Example code:
data &EM_EXPORT_TRAIN;
set &EM_IMPORT_DATA;
attrib NewText length=$242; /*Assume Text1-Text3 have length 80*/
NewText=catx(' ',Text1,Text2,Text3);
run;
The ATTRIB statement is necessary to prevent truncation of the resulting concatenation. Without the ATTRIB statement, NewText would be truncated to 200 characters.
You can attach a Text Parsing node to the SAS Code node and do the analysis using the concatenated variable.
I hope this helps.