BookmarkSubscribeRSS Feed
dkassis
Calcite | Level 5

Hi,

 

I am trying to help query performance in HIVE by computing the column statistics within SAS.  I am having trouble getting the syntax correct.  I was reading this paper http://support.sas.com/resources/papers/proceedings17/SAS0190-2017.pdf and trying the method in tip 7 but I can't get the following to run  Analyze table xxxxxxxxxxx compute statistics;  

 

Any advice?

 

Thanks,

Dan

3 REPLIES 3
r_behata
Barite | Level 11

The method you are referring in the paper runs the query optimization statistics from within Hive, which queries the hive's native metastore  . You may not be able to able to run the same command within sas.

 

Try running the command from Hive prompt or HUE (cloudera).

LinusH
Tourmaline | Level 20

But if you for some reason wnat to call thsi fram SAS, what have you tried so far?

Not sure, but I guess that you need to put this inside an EXECUTE ( ) BY HADOOP; block.

Data never sleeps
yevgeniyelbert
Fluorite | Level 6

I am looking whether this is provided as part of implicit functionality as well, but you can surely do this with explicit pass-thru adding hive.stats.column.autogather = true

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 878 views
  • 0 likes
  • 4 in conversation