hi, i am providing you sample data , where i need to check 1)PAN no should and must be 10 digits .(first 5 places should and must be CHARACETERS) 2)I need to identify the customer who has more than 20% of available corpus data have1; infile datalines; input trdatedate9. invname$ corpus trtype$ PANNO$ datalines; 14/1/2013 FAISAL 20,000.00 SWITCH AJYPA***** 14/1/2013 RAMESH 1,000,000.00 PURCHASE ALOPP***** 11/1/2013 AMIN 250,000.00 PURCHASE ***PA4084F 11/1/2013 A AMIN 200,000.00 PURCHASE A********* 15/1/2013 MOHAN 200,000.00 PURCHASE ACB******* 11/1/2013 RAHUL 100,000.00 PURCHASE ***PD9407B 10/1/2013 BHAT 15,000.00 PURCHASE AXQ******* 14/1/2013 SHIRISH 2,500,000.00 PURCHASE AAGPR9**** 14/1/2013 CHATURBHUJ120,000.00 PURCHASE ***PD0807F 11/1/2013 KUMAR 95,000.00 PURCHASE ABTPG3**** 11/1/2013 G. KUMAR 110,000.00 SWITCH ADTPM5692E 14/1/2013 SHASHI 175,000.00 SWITCH ADC******** ; RUN; please help...
... View more