BookmarkSubscribeRSS Feed
dkassis
Calcite | Level 5

Hi,

 

I am trying to help query performance in HIVE by computing the column statistics within SAS.  I am having trouble getting the syntax correct.  I was reading this paper http://support.sas.com/resources/papers/proceedings17/SAS0190-2017.pdf and trying the method in tip 7 but I can't get the following to run  Analyze table xxxxxxxxxxx compute statistics;  

 

Any advice?

 

Thanks,

Dan

3 REPLIES 3
r_behata
Barite | Level 11

The method you are referring in the paper runs the query optimization statistics from within Hive, which queries the hive's native metastore  . You may not be able to able to run the same command within sas.

 

Try running the command from Hive prompt or HUE (cloudera).

LinusH
Tourmaline | Level 20

But if you for some reason wnat to call thsi fram SAS, what have you tried so far?

Not sure, but I guess that you need to put this inside an EXECUTE ( ) BY HADOOP; block.

Data never sleeps
yevgeniyelbert
Fluorite | Level 6

I am looking whether this is provided as part of implicit functionality as well, but you can surely do this with explicit pass-thru adding hive.stats.column.autogather = true

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 1422 views
  • 0 likes
  • 4 in conversation