BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
LL5
Pyrite | Level 9 LL5
Pyrite | Level 9

I am trying to count the total number of missing and non missing values for each character variables.

below is the sample data.

 

data temp;

input (disp1-disp8) ($);

datalines;

 

1 2 . 4 5 6 7 8

. 2 3 4 5 . 7 8

1 2 3 4 5 6 7 8

1 2 3 . . 6 . 8

1 2 3 4 5 6 7 8

1 . 3 . 5 . 7 .

;

run;

My goal is to try to get the following result. Is there anyway to do it for character variable? Thanks. 

 MissingNon Missing
disp115
disp215
disp315
disp424
disp515
disp624
disp715
disp815

 

1 ACCEPTED SOLUTION

Accepted Solutions
Tom
Super User Tom
Super User

Do you want to count the number of non-missing values? Or the number of DISTINCT non-missing values?

The NLEVELS option on PROC FREQ will do the latter.

proc freq nlevels;
  tables disp1-disp8 / noprint;
run;

For the former you can also use PROC FREQ but first make a user defined format.

proc format;
value $cmiss ' '='Missing' other='Non-missing';
run;
proc freq ;
 tables disp1-disp8;
 format disp1-disp8 $cmiss.;
run;

View solution in original post

9 REPLIES 9
Tom
Super User Tom
Super User

Do you want to count the number of non-missing values? Or the number of DISTINCT non-missing values?

The NLEVELS option on PROC FREQ will do the latter.

proc freq nlevels;
  tables disp1-disp8 / noprint;
run;

For the former you can also use PROC FREQ but first make a user defined format.

proc format;
value $cmiss ' '='Missing' other='Non-missing';
run;
proc freq ;
 tables disp1-disp8;
 format disp1-disp8 $cmiss.;
run;
LL5
Pyrite | Level 9 LL5
Pyrite | Level 9

Thank Tom, this works like a charm. I need the former approach and noted the result shows the non missing count, but is there anyway to show the missing count as well?

Tom
Super User Tom
Super User

You probably need to add the missing option the TABLES statement.

tables disp1-disp8 / missing;
AhmedAl_Attar
Rhodochrosite | Level 12

If you want to get the exact output to your original want, try this 


data trans_v(keep=vname miss nmiss)/view=trans_v;
	length vname $5 nmiss miss 4;
	set temp;
	array disps {8} $ disp1-disp8;
	label nmiss='Non Missing' 
		  miss='Missing'
		  ;

	do i=1 to 8;
		call missing(miss,nmiss);
		if (missing(disps{i})) then miss=1;
		else nmiss=1;
		vname=vname(disps{i});
		output; 
	end;
run;
proc summary data=trans_v nway missing;
	class	vname;
	var		miss nmiss;
	output out=stats(drop=_:) sum=;
run;

Ahmed 

LL5
Pyrite | Level 9 LL5
Pyrite | Level 9
Thanks AhmedAI_Attar, this is an awesome approach.
LL5
Pyrite | Level 9 LL5
Pyrite | Level 9

I have one more question, why do we need call missing(miss,nmiss) here?

ballardw
Super User

@LL5 wrote:

I have one more question, why do we need call missing(miss,nmiss) here?


I would suggest running that data step without that statement and compare to the results you get with it.

 

It resets the values of those variable to missing for each step of the loop.

Otherwise you have values from the previous steps that had been set to 1 when i is > 1.

LL5
Pyrite | Level 9 LL5
Pyrite | Level 9

Thanks Ballardw. I ran the data step without the call missing step and noted the number for miss and nmiss variables are kind of accumulating.

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 9 replies
  • 1269 views
  • 3 likes
  • 4 in conversation