Hello All, I am working on SAS version: 9.04.01M4P110916 which is hosted on RHEL 64bit. I want to optimize a Proc SQL query. The source data is in Hadoop and client has only SAS ACCESS FOR HADOOP connector. The records in source table are 101730000. Below is the SAS query: PROC SQL; RESET inobs=max outobs=max noflow nofeedback noprompt nonumber; CREATE TABLE work.Test1 AS select distinct From libname.table Where column8 in ( value1,value2,value3,value4,value5,value6,value7,value8,value9,value10,value11,value12,value13,value14,value15, value16,value17); QUIT; The current query execution time: 06mins Please suggest is there anyway i can improve the efficiency (execution time) of above step.
... View more