Hi,
I am trying to help query performance in HIVE by computing the column statistics within SAS. I am having trouble getting the syntax correct. I was reading this paper http://support.sas.com/resources/papers/proceedings17/SAS0190-2017.pdf and trying the method in tip 7 but I can't get the following to run Analyze table xxxxxxxxxxx compute statistics;
Any advice?
Thanks,
Dan
The method you are referring in the paper runs the query optimization statistics from within Hive, which queries the hive's native metastore . You may not be able to able to run the same command within sas.
Try running the command from Hive prompt or HUE (cloudera).
But if you for some reason wnat to call thsi fram SAS, what have you tried so far?
Not sure, but I guess that you need to put this inside an EXECUTE ( ) BY HADOOP; block.
I am looking whether this is provided as part of implicit functionality as well, but you can surely do this with explicit pass-thru adding hive.stats.column.autogather = true
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.