I have an extremely large csv dataset (in the millions) that is formatted in an odd way. There is only one column for variable values, and another column with a code for what each value is measuring. So, for example, if I had a small dataset with two variables A and B that was formatted the normal way: Year A B 1994 10 30 1994 20 40 In the scheme I was given, it would be formatted in this way: Year Value Code CodeDescription 1994 10 2844 "A" 1994 20 2844 "A" 1994 30 2855 "B" 1994 40 2855 "B" Using some simple "if" statements, I can read in the data in this way to separate it all out: if(Code = 2844) then A = Value if(Code = 2855) then B = Value When I read in the data and sort it by year, it will come out in this way: Year A B 1994 10 . 1994 20 . 1994 . 30 1994 . 40 Even if I sort it by date, it will come out this way. I would like it to be formatted as such: Year A B 1994 10 30 1994 20 40 I've tried a bunch of different options but cannot seem to get it to sort properly. Some of the variables have more measurements than others, and my goal is to have it aligned such that any missing variables will be at the end of the date sets, and the data can be aligned as best as it can be. I have found that, due to the way the data is originally formatted, SAS will treat each read as an individual observation, and set any other variables to missing. Does anyone know of a way I can do this? I have tried everything from exporting it and re-reading it in to transposing twice to see if that would make any sort of difference.
... View more