DATA Step, Macro, Functions and more

Importing a text file with invalid characters

Reply
Contributor
Posts: 69

Importing a text file with invalid characters

While Importing TSV ( tab separated value) data as a SAS data set all the Danish letters are not appearing as the same.

I was using the following code:

proc import datafile='C:\Documents and Settings\Desktop\ExpMeet.tsv'
     out=base_data
     dbms=dlm
     replace;
     delimiter='09'x;
     datarow=2;
run;

Sample Data:

Siddhartha 

Helsinki 001/TelePresence || Lersø Parkalle 001/TelePresence

Bangalore 001/TelePresence ||Høje Taastrup 001/TelePresence

Anyone can help me in reading the Danish and swedish letters while importing .

Regards,

Trusted Advisor
Posts: 1,301

Re: Importing a text file with invalid characters

Posted in reply to Siddhartha

You need to change your encoding option on the import.  Since these are OS specific here is a link where you can find the one you need.

http://support.sas.com/documentation/cdl/en/nlsref/61893/HTML/default/viewer.htm#a002628552.htm

then do something like this:

filename in 'C:\Documents and Settings\Desktop\ExpMeet.tsv' encoding="uft-8";

proc import datafile=in
     out=base_data
     dbms=dlm
     replace;
     delimiter='09'x;
     datarow=2;
run;

Here is an additionaly resource which covers all of SAS's Language capabilities:

SAS(R) 9.2 National Language Support (NLS): Reference Guide

http://support.sas.com/documentation/cdl/en/nlsref/61893/HTML/default/viewer.htm#nlsrefwhatsnew902.h...

Super User
Posts: 10,023

Re: Importing a text file with invalid characters

Posted in reply to Siddhartha

From your sample data, it looks like the delimiter is | or /, rather than  tab .

And make sure your encoding of sas session has already contained Danish and swedish

Ksharp

Ask a Question
Discussion stats
  • 2 replies
  • 1369 views
  • 0 likes
  • 3 in conversation