BookmarkSubscribeRSS Feed
vnreddy
Quartz | Level 8

Hi,

 

I have two datasets with same column names 

I need to validate the values based on primary_account_no which column values are matching and not-matching from Old_data to New_data

 

for e.g., below table has same values till A4_DOB in both the datasets. whereas, A1-A3 values are not matching in New_data with Old_data.

1. I need to show such difference in a new column with Old_data values. 

 

Primary_Account_No No_apps A1_DOB A2_DOB A3_DOB A4_DOB A1_Exist_customer A2_Exist_customer A3_Exist_customer
12345678 3 19350119 19650420 19600507 0 M M M
12345679 3 19440223 19710705 19700720 0 M M M

 

could someone help me with this problem. I have more than 150 columns to compare.

 

Thanks

Vennapusa

 

5 REPLIES 5
ballardw
Super User

Since you don't show anything as a New or Old data set or the desired result for the comparison it is a bit difficult to determine exactly what you want.

 

Proc Compare may do some of what you want. You would want to make sure the data sets are both sorted by the same variables, your Primary_account_no for example, then use a BY statement with the same variable.

vnreddy
Quartz | Level 8

Hi,

 

I am expecting results something like below:

Description Primary_Account_No No_apps A1_DOB A2_DOB A3_DOB A4_DOB A1_Exist_customer A2_Exist_customer A3_Exist_customer A4_Exist_customer
Actual_value 12345678 3 19350119 19650420 19600507 0        
Expected_value 12345678 3 19350119 19650420 19600507 0 M M M  
STATUS 12345678 PASSED PASSED PASSED PASSED PASSED FAILED FAILED FAILED PASSED
Actual_value 12345679 3 19440223 19710705 19700720 0        
Expected_value 12345679 3 19440223 19710705 19700720 0 M M M  
STATUS 12345679 PASSED PASSED PASSED PASSED PASSED FAILED FAILED FAILED PASSED
Actual_value 12345680 2 19670201 19701019 0 0        
Expected_value 12345680 2 19670201 19701019 0 0 A A    
STATUS 12345680 PASSED PASSED PASSED PASSED PASSED FAILED FAILED PASSED PASSED
JerryV
SAS Employee

How about the proc compare part of this?

 

data work.carsOrig work.carsNew; set sashelp.cars;
primary_account_no+1;
run;

data work.carsNew; set work.carsNew; if primary_account_no= 3 then make='xxxxxxx'; run;

 

proc compare base=work.carsorig
compare=work.carsnew
out=differences outnoequal outall
noprint ;
by primary_account_no;
run;

vnreddy
Quartz | Level 8

Hi,

Based on below table:

Description Primary_Account_No No_apps A1_DOB A2_DOB A3_DOB A4_DOB A1_Exist_customer A2_Exist_customer A3_Exist_customer
Actual_value 12345678 3 19350119 19650420 19600507 0      
Expected_value 12345678 3 19350119 19650420 19600507 0 M M M
STATUS 12345678 PASSED PASSED PASSED PASSED PASSED FAILED FAILED FAILED
Actual_value 12345679 3 19440223 19710705 19700720 0      
Expected_value 12345679 3 19440223 19710705 19700720 0 M M M
STATUS 12345679 PASSED PASSED PASSED PASSED PASSED FAILED FAILED FAILED
Actual_value 12345680 2 19670201 19701019 0 0      
Expected_value 12345680 2 19670201 19701019 0 0 A A  
STATUS 12345680 PASSED PASSED PASSED PASSED PASSED FAILED FAILED PASSED

 

can i output something like this:

Variable Passed Failed Total
Primary_Account_No 73 27 100
No_apps 27 73 100

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 5 replies
  • 1636 views
  • 0 likes
  • 4 in conversation