05-10-2012 10:21 AM
I have some two million records (email id's) in that some of wrong email addresses like ('@' missing, '.com' missing, '.net' missing,...) and each email address their own character length...so now my question is 1) How to identify the 'error' email id's ?
2) How to delete 'error' email id's ?
3) How to make a two different data sets for 'error ones' and 'non errors' ?
any one can please help the logic (code).
05-10-2012 11:31 AM
For finding valid adresses I think perl regular expressions functions would be a good way (prxparse, prxmatch).
You can output data to different datasets within the same data step.
data A B;
if condition then output A;
else output B;
05-10-2012 03:08 PM