BookmarkSubscribeRSS Feed
Sathish_jammy
Lapis Lazuli | Level 10

 

/*in the list of Cards - first one is exact word, followup are have typo err.*/

data CATEGORY;
input MODE :$20;
cards;			
A.VELOCITY
A.VALOSITY
A.VELOCIY
B.HYDROCHLORIC 
B.HYDRRACLORIK
B.HYDROCLOKIK
C.GEOMETRY
C.GEMENTRY
run;
/* the above dataset is just for an example */

/* I have a table in SQL with same kind of problem, As now just keep on eye with cards */
proc sql; update CATEGORY set Column2 = 'physics' where MODE =* 'A.VELOCITY'; run; proc sql; update CATEGORY set Column2 = 'chemistry' where MODE =* 'B.HYDROCLORIC'; run; proc sql; update CATEGORY set Column2 = 'maths' where MODE =* 'C.GEOMENTRY'; run;

/*Is there any effective way to resolve this than some other methods in SAS */
/*I don't care about the initial (A. B. C.) but i need to focus on methods(velocity geometry hydrochloric)

 

MY MAJOR CONCERN ON CATEGORIZING "TYPO ERR"

1 REPLY 1
Patrick
Opal | Level 21

@Sathish_jammy

You need to somehow standardize your data using some mechanism to group/cluster values with similar strings.

 

There are quite a few discussions and solution approaches around such "typo" problems in this forum.

To start with use search terms like: SPEDIS, COMPGED, FUZZY  ...and I'm sure the posts you find with these terms will give you ideas for other search terms you could use as well.

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 1 reply
  • 603 views
  • 1 like
  • 2 in conversation