06-20-2014 02:07 AM
Hi all. I am new to SAS and am currently struggling with importing data. I am trying to import a raw data file that is set out in column format (ie no delimiters). There are 49 variables and approximately 1 million observations.
Here is a copy of my code (with most variables not shown):
input episode 1-12
umrn $ 13-21
discharge_diag $ 252-501 (this is variable 29)
pro $ 502-521
sla06 $ 1186-1194; (variable 49)
The problem I am having starts at variable number 29 which is 250 characters long, and in the vast majority of cases, is empty. When I import, for some reason when it gets to variable 29 it drops to the next observation and every variable after that (30-49) also starts on a new observation. I tried 'missover' at the end of the infile line but this just caused variable 29 - 49 to be empty.
Any help would be greatly appreciated.
06-20-2014 03:36 AM
Use the "truncover" option when you have variable length records. This will cause the last variable(s) to be filled with what's available.
06-24-2014 01:38 AM
a) could you please post an example of the input data?
b) could it be that you have linefeeds or CR/LF in the text data (the $250 field)? That would cause a new (short) line, and since you read with position, every variable whose pointer lies beyond the EOL is set to missing