Desktop productivity for business analysts and programmers

Complete missing data in big dataset

Reply
Learner
Posts: 1

Complete missing data in big dataset

Dear all,

 

I currently have a huge dataset which looks like this:

 

Value 1value 2 Value 3Value 4Value 5
OKOKNAME1.OK
OKOKNAME2Value2OK
OKOK Value1OK
OKOKNAME2.OK
OKOKNAME3Value3OK
OKOKNAME3Value3OK
OKOKNAME1Value1OK
OKOKNAME2.OK
...............
OKOKNAME3Value3OK

 

In the end I would like to have something like this:

Value 1value 2 Value 3Value 4Value 5
OKOKNAME1Value1OK
OKOKNAME2Value2OK
OKOK Value1OK
OKOKNAME2Value2OK
OKOKNAME3Value3OK
OKOKNAME3Value3OK
OKOKNAME1Value1OK
OKOKNAME2Value2OK
...............
OKOKNAME3Value3OK

 

I'm currently struggling with the code for this:

 

I have already the following datastep:

data &outlib..&outdsn.;

  Set &outlib..&outdsn;

 

if missing (value_4) and not missing (value_3) then

  do i=1 %to &vars_N.;

  do until (not missing(value_4));

 

but then nothing works further.

 

Can anyone help me out?

 

Thanks a lot in advance.

Frederik

Super User
Super User
Posts: 9,812

Re: Complete missing data in big dataset

Posted in reply to Freddy662

Post test data in the form of a datastep in a code window!!

 

As such I am not typing that in.  Why should the third row be used to populate the first one?  I can see no logical reason.  Post something which actually reflects the problem.  Now at a guess, what you can do is to first create a distinct list of Name + Value pairs which are not missing, then merge that back onto the data e.g:

proc sort data=have out=inter (rename=(value4=rep)) nodupkey;
  by value3 value4;
  where value3 ne "" and value4 ne "";
run;

data want;
  merge have inter;
  by value3;
  if value4="" then value4=rep;
run;
Ask a Question
Discussion stats
  • 1 reply
  • 122 views
  • 0 likes
  • 2 in conversation