Hi All, I am using SAS EG 7.1 and writing some query such as parsethrough which use using to connect to the hive and accessing data from Hadoop but its taking to much time to execute simple queries and i need to improve the performance of query , so could you please help me to improve performance . %let selstr = int(test_acc/100) as pac, test_acc-100 * int(test_acc/100) as test_sub_acc, date-&dt_1. as comp_dt, lstcapdte-&dt_2. as lstcap_dt, test_int_rate/100 as int_rate, Test_cur_bal as Bal, edh_ingest_ts; proc sql; connect to hadoop ( server="XXXXXXXXXXXX" uri="jdbc:hive2://XXXXXX,XXXX,XXXX/; serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=XXXserver2" HIVE_KERBEROS_PRINCIPAL="hive/XXXXXX" port=xxxxx schema=xxxxx read_method=JDBC properties="mapred.job.queue.name=XXXX_default;hive.fetch.task.conversion=minimal;hive.cbo.enable=true; hive.compute.query.using.stats=true;hive.stats.fetch.column.stats=true;hive.stats.fetch.partition.stats=true; hive.prewarm.enabled=true;" dbmax_text=255 ); create table work.test as select * from connection to hadoop ( select &selstr. from &table. where to_date(process_date_time) >=&test_date_1. and to_date(process_date_time) <=&test_date_2. ); disconnect from hadoop; quit; /*processing time as below*/ NOTE: PROCEDURE SQL used (Total process time): real time 35:37.20 2 The SAS System 09:28 Thursday, September 29, 2022 user cpu time 1:19.61 system cpu time 26.64 seconds memory 5633.62k OS Memory 43176.00k Timestamp 29/09/2022 03:46:38 PM Step Count 8 Switch Count 3 Page Faults 68 Page Reclaims 440 Page Swaps 0 Voluntary Context Switches 92058 Involuntary Context Switches 26290 Block Input Operations 0 Block Output Operations 0
... View more