BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Feragon42
Fluorite | Level 6

Hello everybody,

Today I come with this problem that I hope you can help me.

Let's say that I have this dataset: 

 

 

data have;
input A $ B C D $;
datalines;
try 0 57 0
do . 42 something
with 442 . this
;

 

 

And I need to return this:

 

Captura.PNG

 

The new variable, N_MISS, count how many missing character I have in every observation. I know  can do this easily with the function CMISS(OF A-D), but the problem is that I need to consider the 0 as a missing value.

I create a format to identify with 0 and 1 when the value are missing or not, and then sum al the variables of the observation and have the amount of not missing values.

 

PROC FORMAT LIBRARY = FORMATS;
	INVALUE $CMISS_COUNT '0' = 0
	                     ' ' = 0
		             '.' = 0
			     OTHER = 1;
	INVALUE NMISS_COUNT 0 = 0
			    . = 0
			    OTHER = 1;
RUN;

But, when I apply a format, the value of the character still the same for the sum. I can use the fuction PUT(A, $CMISS_COUNT.), but in the real excercise in don't have only 4 variables but 250 ... aaaaand do that 250 times is... well.

I hope I can express myself, sorry for my newbie english.

Goodbye, and thanks for all.

 

1 ACCEPTED SOLUTION

Accepted Solutions
collinelliot
Barite | Level 11

Something like this?

 


data want;
    set have;
    array nvars{*} _numeric_;
    array cvars{*} _character_;
    do i = 1 to dim(nvars);
        n_miss = sum(n_miss, (nvars(i) in (., 0)));
    end;
    do j = 1 to dim(cvars);
        c_miss = sum(c_miss, (cvars(j) in ('.', '0')));
    end;
    all_miss = sum(n_miss, c_miss);
run;

View solution in original post

5 REPLIES 5
ballardw
Super User

You can use the translate function in cmiss but you won't be able to use the "of" notation. Or you create a separate array of temporary variables that you apply translate to.

 

data junk;
   x = '0';
   y = '123';
   z = ' ';
   nmiss = cmiss(translate(x,' ','0'),y,z);
run;

This approach does have a potential issue in that it will treat '0000' as missing as well. If that is not desired and the value appears in your data then you may be better off creating an array of temporary variables with some logic involving If/then for exactly '0'.

 

collinelliot
Barite | Level 11

Something like this?

 


data want;
    set have;
    array nvars{*} _numeric_;
    array cvars{*} _character_;
    do i = 1 to dim(nvars);
        n_miss = sum(n_miss, (nvars(i) in (., 0)));
    end;
    do j = 1 to dim(cvars);
        c_miss = sum(c_miss, (cvars(j) in ('.', '0')));
    end;
    all_miss = sum(n_miss, c_miss);
run;
RW9
Diamond | Level 26 RW9
Diamond | Level 26

I can't test this right at the moment, but what about something like:

data want;
  set have;
  total=countc(cats(of a--d),'.0');
run;
Feragon42
Fluorite | Level 6
Interesting method.... But, when the database present a value like '200', the function will detect 2 extra missings thanks to the 0.

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 5 replies
  • 1230 views
  • 3 likes
  • 4 in conversation