BookmarkSubscribeRSS Feed
pdhokriya
Pyrite | Level 9
I do not want to additional semicolon.

This issue was, raw data has semicolon but in output did not have semicolon as pe raw data.

i.e. in col1 = data should be with semicolon as present in raw, but in final output did not present.

****************************************************************************************

data abc ;
old = "ASCORBIC ACID;CALCIUM CARBONATE;CALCIUM PHOSPHATE;CHROMIUM;CYANOCOBALAMIN;FOLIC ACID;MAGNESIUM CARBONATE;MAGNESIUM HYDROXIDE;MANGANESE;NICOTINIC ACID;PANTOTHENIC ACID;POTASSIUM BICARBONATE;POTASSIUM CARBONATE;POTASSIUM PHOSPHATE DIBASIC;PYRIDOXINE;QUERCETIN;RIBOFLAVIN;SODIUM BICARBONATE;SODIUM PHOSPHATE;THIAMINE;THIOCTIC ACID;ZINC ASCORBATE"; output;

old = "ASCORBIC ACID;BIOTIN;CALCIUM;CALCIUM PHOSPHATE;CHROMIUM;CUPRIC OXIDE;CYANOCOBALAMIN;FOLIC ACID;IODINE;IRON;MAGNESIUM OXIDE;MANGANESE;MOLYBDENUM;NICKEL;NICOTINAMIDE;PANTOTHENIC ACID;PHYTOMENADIONE;POTASSIUM;PYRIDOXINE;RETINOL;RIBOFLAVIN;SELENIUM;SODIUM METASILICATE;THIAMINE;TOCOPHEROL;VITAMIN D NOS;ZINC"; output;

old = "ab;countrynew;zcc;"; output;
old = "ab;charged;"; output;
old = "ab;cd;dr;"; output;
old = "ab"; output;
run;

data want ;
set abc ;
array col [10] $200 ;
target=1;
do source=1 to countw(old,';');
if col[target]=' ' then col[target]=scan(old,source,';');
else if 200 < length(catx(';',col[target],scan(old,source,';'))) then do;
target=target+1;
col[target]=scan(old,source,';');
end;
else col[target]=catx(';',col[target],scan(old,source,';'));
end;
run;

Current :ASCORBIC ACID;CALCIUM CARBONATE;CALCIUM PHOSPHATE;CHROMIUM;CYANOCOBALAMIN;FOLIC ACID;MAGNESIUM CARBONATE;MAGNESIUM HYDROXIDE;MANGANESE;NICOTINIC ACID;PANTOTHENIC ACID;POTASSIUM BICARBONATE

Required (as present in raw) : ASCORBIC ACID;CALCIUM CARBONATE;CALCIUM PHOSPHATE;CHROMIUM;CYANOCOBALAMIN;FOLIC ACID;MAGNESIUM CARBONATE;MAGNESIUM HYDROXIDE;MANGANESE;NICOTINIC ACID;PANTOTHENIC ACID;POTASSIUM BICARBONATE;


pdhokriya
Pyrite | Level 9
Yes, but if semicolon is at 201 position than not required . but if semicolon is placed below 200 (after splitting) than I will required in the output (as per raw).
Tom
Super User Tom
Super User

You really seem to be having trouble explaining what you want.  From the example data step you posted a PROC PRINT will produce this report.

Obs    old

 1     abcd;xyzxybav;charged;
 2     ab;countrynew;zcc;
 3     ab;charged;
 4     ab;cd;dr;
 5     ab

Now if you want to convert those strings into multiple values by splitting on the semi-colons then you could use something like this.  

data tall;
  row+1;
  set abc ;
  length new $20;
  do item=1 by 1 until(new=' ');
    new=scan(old,item,';');
    if item=1 or new ne ' ' then output;
  end;
run;

Which will produce data like this:

Obs    row    old                       new           item

  1     1     abcd;xyzxybav;charged;    abcd            1
  2     1     abcd;xyzxybav;charged;    xyzxybav        2
  3     1     abcd;xyzxybav;charged;    charged         3
  4     2     ab;countrynew;zcc;        ab              1
  5     2     ab;countrynew;zcc;        countrynew      2
  6     2     ab;countrynew;zcc;        zcc             3
  7     3     ab;charged;               ab              1
  8     3     ab;charged;               charged         2
  9     4     ab;cd;dr;                 ab              1
 10     4     ab;cd;dr;                 cd              2
 11     4     ab;cd;dr;                 dr              3
 12     5     ab                        ab              1

Which you could transform back into one observation per original ROW by using PROC TRANSPOSE.

proc transpose data=tall out=want(drop=_name_) prefix=col;
  by row ;
  id item;
  var new ;
run;

Result

Obs    row    col1    col2          col3

 1      1     abcd    xyzxybav      charged
 2      2     ab      countrynew    zcc
 3      3     ab      charged
 4      4     ab      cd            dr
 5      5     ab

If this is NOT what you want then please describe in detail how what you want is different. And what are the rules that explain how to get what you want.

 

 

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 18 replies
  • 2593 views
  • 1 like
  • 5 in conversation