07-20-2014 10:49 AM
07-21-2014 12:02 PM
We either need a more complete data example or tell us how to get a 2007 value when there isn't one in the example data since the missing number of employees is for 2008.
And since there are multiple values of year there must be something else to align the "correct" number of imputed employees. For instance, for 2010 you have 24, 24, 31, 21 and 24 employees. If a 2011 value is missing and needs to be imputed which of the 2010 values should be used?
07-21-2014 12:35 PM
What you are asking about is called 'last observation carried forward' (LOCF). There is an entire literature about why this approach is not statistically sound for imputation (just Google it sometime), but it is still used a lot in the social sciences. I don't intend that as a criticism (sometimes we are 'told' what to do), but wanted to point out that LOCF is not without its detractors.
EGuide can't do LOCF with the point-and-click interface; you have to write a data step and use the LAG function (See
If the first item is missing, then you can sort the results in descending order and do it again.