Few additional points: 1. code 987 meaning both Arizona (State) and US (Country. Its a typo error. every code has unique description and category as you said. 2.First about values: You wrote: possible code (4 digit), but what is 4 digits in this case, it is a character variable with values like "0123", or is it a number with up to 4 digits? - and I guess that means that string "XYZ987" could also be 4 digits like "XYZ1139". a) Code with description are with 4 digits like 0123, 1100, 0200 etc; each falling under different category like Gender, Country , State. (I have data set with code, description and category) b) the variable in the data set looks like ABC0100;2745L2000;600AT0100; etc. (prefix need not be always numeric/ character) as you have observed, last four digit (or character) are always code for which i need to map description for the given category column; c)as i described in my previous post, category position are not always same in given string. 3) That leads to the last problem. If you want separate columns for Gender, State og Country in your output, you must have a table with the type of each code. You do not need it to associate code 987 with the text "M", but you need it to tell that "M" shold appeas in a column named "Gender". a) I have lookup dataset with three variables - Code, code description and Category. For every category, i need to have column generated in output data set with code description for each dataline. If code is not available in given string, need to identify it as missing.
... View more