Hi Kurt, Just to thank you once again on your solution , really appreciate it. I managed to import the data into SAS My apologies for discarding/commenting the length statement , didn't understand the importance of it, I thought SAS was predisposed to import the value regardless of the length.... I get it now it sets it at 8 characters unless you assign the appropriate length yourself.... still on a learning curve. Thank you again....... This is the final code for the next person who will be in the same kind of situation. /* *
Creating the data set for the Predictive Analysis of Popularity of baby names in the USA
SAS Version 9.4
**/
title 'Baby Names in the USA since 1880'; /* Title of the Dataset */
filename pwd 'C:\Users\Roger\Desktop\BabyNames'; /* Directory where the data is stored*/
/* Create the Baby Names dataset */
data babyNames;
/* set length of other variables as needed, or they will default to 8 */
length fname $200;
length name $41 ;
length sex $1 ;
;
/* ... using input from all the text files which have a delimiter , to split the fields*/
infile pwd(*.txt) DLM=',' filename=fname;
/* ... creating the following input variables to store the data*/
input
name $ /* category, baby name */
sex $ /* category, gender of baby */
number /* number of times this name was given in that year*/
;
year = input(substr(scan(scan(fname,-1,'\'),1,'.'),4,4),4.); /* Input of year value into year attribute*/
label
year = 'Year of Birth'
name = 'Name'
sex = 'Gender'
number = 'Number of occurrences of the name'
;
run;
/* Print the first 1000 observations to see if the data loaded properly */
proc print
data=babyNames /* data set*/
(obs = 1000) /* Only Print the first 1000 obs*/
label; /* Print with given labels*/
run;
... View more