for missing values in industry variable, I am looking to compare email_id which is partially anonymized (name is anonymized but industry remain same) with the list of distinct values of industry column to derive the industry name from email address and return the matching value for missing records. for example: in row 6, email id is GGYDNEE3@homeandgardenBAHG.co, i want to compare it with all distinct values of industry column (in this case matching value would be home and garden) and return that value for missing value in industry column, similarly row 15 should get 'nursery' and row 20 should get 'landscape designer' for missing values in industry column. what is the logic to use here ?
... View more