Hello everyone,
I have a scenario where I need to find whether the same "reward_program_id1" OR "reward_program_id2" is used in multiple product_id. In the below data product_id is unique however, reward_program_id1 & reward_program_id2 digits can be used interchangeable to purchase product.
For example, product_id 89067 is purchased using reward_program_id2 = 567694(row #1) and the same digits are used as reward_program_id1 to purchase product_id 90873 (row #4). Likewise, product_id 34340 is purchased using reward_program_id1 = 429008(row #3) and the same digits are used as reward_program_id2 to purchase product_id 10983(row #8).
In the output I am trying to add a column "Multi_program_id" Y/N if the same reward ID is used in more than one product. I hope this makes sense, any help is appreciated!
Thanks in advance.
Data have; input product_id product_delivered $ reward_program_id1 reward_program_id2 ; infile datalines delimiter=','; datalines; 89067,Y,223456,567694 58394,N,343412, 34340,Y,429008,412948 90873,N,567694,293832 91028,N,389292,902221 38034,N,459303,343033 76281,N,828292,282929 10983,Y,909033,429008 18738,N,929292, 17930,Y,343412,982932 29383,N,898903, 11283,Y,976409,829737 98203,N,839234,928302 12930,N,918833,303033 78023,Y,123233,412948 ; run;
Output:
Product_id
product_delivered
reward_program_id1
reward_program_id2
Multi_program_id
89067
Y
223456
567694
Y
58394
N
343412
.
Y
34340
Y
429008
412948
Y
90873
N
567694
293832
Y
91028
N
389292
902221
N
38034
N
459303
343033
N
76281
N
828292
282929
N
10983
Y
909033
429008
Y
18738
N
929292
.
N
17930
Y
343412
982932
Y
29383
N
898903
.
N
11283
Y
976409
829737
N
98203
N
839234
928302
N
12930
N
918833
303033
N
78023
Y
123233
412948
Y
... View more