<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Configure Parquet as default Hive file format when creating tables via SAS Enterprise Guide in Administration and Deployment</title>
    <link>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653534#M19089</link>
    <description>&lt;P&gt;Thanks a lot Patrick. It worked!&lt;/P&gt;</description>
    <pubDate>Fri, 05 Jun 2020 08:36:04 GMT</pubDate>
    <dc:creator>OGA</dc:creator>
    <dc:date>2020-06-05T08:36:04Z</dc:date>
    <item>
      <title>Configure Parquet as default Hive file format when creating tables via SAS Enterprise Guide</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/652888#M19064</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;Users are using SAS Access to Hadoop from SAS Enterprise Guide.&lt;/P&gt;&lt;P&gt;I would like for all users to create tables with Parquet format as default format.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In order to force this, I have added the following option to the LIBNAME definition in the SAS Management Console under LIBNAME properties &amp;gt; Options &amp;gt; Advance Options &amp;gt; Other Properties&lt;/P&gt;&lt;P&gt;DBCREATE_TABLE_OPTS="stored as Parquet"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This works OK with Implicit SQL, but when using Explicit SQL Pass-through, this option is ignored and the table is created as Text File which is the default format for Hive. The way users connect to Hadoop with Explicit pass-through is by using a preassigned libname called HVINFLAB:&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier" size="2"&gt;proc sql;&lt;/FONT&gt;&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;&lt;FONT face="courier new,courier" size="2"&gt;connect using HVINFLAB as HADOOPL;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;execute by HADOOPL(&lt;/FONT&gt;&lt;/P&gt;&lt;P class="lia-indent-padding-left-60px"&gt;&lt;FONT face="courier new,courier" size="2"&gt;create table ci0101_tablaneglab_sas5 as&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;select ci0101_numpersona &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;from 04_negocio.ci0101&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;where ci0101_nroperiod = 2003&lt;/FONT&gt;&lt;/P&gt;&lt;P class="lia-indent-padding-left-60px"&gt;&lt;FONT face="courier new,courier" size="2"&gt;and meta_codienti = '2095');&lt;/FONT&gt;&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;&lt;FONT face="courier new,courier" size="2"&gt;disconnect from HADOOPL;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier" size="2"&gt;quit;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This is the LIBNAME I get from SAS Management Console&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;FONT size="2"&gt;LIBNAME hvinflab HADOOP URI="jdbc:hive2://kbzae01pro-acc.bbk.es:10002/04_infra_lab;principal=hive/kbzae01pro.bbk.es@BBK.ES;ssl=true"&lt;/FONT&gt; &lt;STRONG&gt;DBCREATE_TABLE_OPTS="stored as Parquet"&lt;/STRONG&gt; PORT=10002 SERVER="kbzae01pro-acc.bbk.es" SCHEMA="04_infra_lab" ;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have also tried to use the field "Hive Properties" under Libname properties &amp;gt; Advance Options &amp;gt; Connection. adding the following:&lt;/P&gt;&lt;P&gt;hive.default.fileformat=Parquet. This is the libname statement&amp;nbsp; I get:&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier" size="2"&gt;LIBNAME hvinflab HADOOP&amp;nbsp;&lt;STRONG&gt;URI="jdbc:hive2://kbzae01pro-acc.bbk.es:10002/04_infra_lab;principal=hive/kbzae01pro.bbk.es@BBK.ES;ssl=true"&amp;nbsp;&amp;nbsp;&lt;/STRONG&gt;&lt;STRONG&gt;PROPERTIES="hive.default.fileformat=Parquet" &lt;/STRONG&gt;&amp;nbsp;PORT=10002 SERVER="kbzae01pro-acc.bbk.es"&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, SAS is also ignoring this field. It seems that if the field "Hive JDBC URL" is used (which translates to "URI=" in the libname statement), the field "Hive Properties" is ignored.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;¿Is there a way of setting Parquet as default hadoop file format so that Explicit SQL Pass through does not ignore this configuration? I know that it can be specified on the CREATE TABLE statement but I am looking for another option so that users do not have to specify it everytime they want to create a table with Explicit SQL Pass-Through.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;</description>
      <pubDate>Wed, 03 Jun 2020 14:37:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/652888#M19064</guid>
      <dc:creator>OGA</dc:creator>
      <dc:date>2020-06-03T14:37:30Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Parquet as default Hive file format when creating tables via SAS Enterprise Guide</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653082#M19076</link>
      <description>&lt;P&gt;In order to make a connection to Hadoop from SAS, we copy the JAR's and XML's of the hadoop cluster to SAS compute server. In the XML files there is a hive-site.xml file from which the properties for the SAS Session are read. I think you can change the default file format property "&lt;SPAN&gt;hive.default.fileformat" in this file and see if it takes effect. &lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;I haven't tried this myself so let me know if it works for you.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Source:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Property configuration Manual:&amp;nbsp;&lt;A href="https://cwiki.apache.org/confluence/display/Hive/AdminManual+Configuration#AdminManualConfiguration-HiveConfigurationVariables" target="_blank"&gt;https://cwiki.apache.org/confluence/display/Hive/AdminManual+Configuration#AdminManualConfiguration-HiveConfigurationVariables&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Supported file format types:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://cwiki.apache.org/confluence/display/Hive/FileFormats#:~:text=The%20hive.,is%20the%20parameter's%20default%20value." target="_blank"&gt;https://cwiki.apache.org/confluence/display/Hive/FileFormats#:~:text=The%20hive.,is%20the%20parameter's%20default%20value.&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jun 2020 06:29:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653082#M19076</guid>
      <dc:creator>AnandVyas</dc:creator>
      <dc:date>2020-06-04T06:29:11Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Parquet as default Hive file format when creating tables via SAS Enterprise Guide</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653085#M19077</link>
      <description>&lt;P&gt;It makes sense to me that SAS doesn't add the &lt;EM&gt;store as parquet&lt;/EM&gt; bit to explicit pass-through SQL. SAS is not supposed to touch explicit SQL at all.&lt;/P&gt;
&lt;P&gt;I would expect that for implicit SQL &lt;EM&gt;store as parquet&lt;/EM&gt; gets added to the SQL sent to Hadoop. Besides of training your users to also add the &lt;EM&gt;store as parquet&lt;/EM&gt; bit to the explicit SQL they write, you could try if you can change the default via the &lt;A href="https://go.documentation.sas.com/?docsetId=acreldb&amp;amp;docsetTarget=n1h398otek0j00n1itib5k7ch738.htm&amp;amp;docsetVersion=9.4&amp;amp;locale=en" target="_self"&gt;DBCONINIT&lt;/A&gt; Libname option - something like &lt;A href="https://stackoverflow.com/questions/44038151/hive-how-to-set-parquet-orc-as-default-output-format" target="_self"&gt;dbconinit='set hive.default.fileformat=Parquet;'&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jun 2020 06:50:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653085#M19077</guid>
      <dc:creator>Patrick</dc:creator>
      <dc:date>2020-06-04T06:50:48Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Parquet as default Hive file format when creating tables via SAS Enterprise Guide</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653533#M19088</link>
      <description>&lt;P&gt;Hi Anand&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don´t have write access to those configuration files so I opted for the other suggestion becuase it seemed easier to implement.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Nevertheless, thanks a lot for your help&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Jun 2020 08:35:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653533#M19088</guid>
      <dc:creator>OGA</dc:creator>
      <dc:date>2020-06-05T08:35:17Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Parquet as default Hive file format when creating tables via SAS Enterprise Guide</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653534#M19089</link>
      <description>&lt;P&gt;Thanks a lot Patrick. It worked!&lt;/P&gt;</description>
      <pubDate>Fri, 05 Jun 2020 08:36:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653534#M19089</guid>
      <dc:creator>OGA</dc:creator>
      <dc:date>2020-06-05T08:36:04Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Parquet as default Hive file format when creating tables via SAS Enterprise Guide</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653551#M19090</link>
      <description>&lt;P&gt;Thanks for the feedback. That's good to know!&lt;/P&gt;</description>
      <pubDate>Fri, 05 Jun 2020 10:15:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Configure-Parquet-as-default-Hive-file-format-when-creating/m-p/653551#M19090</guid>
      <dc:creator>Patrick</dc:creator>
      <dc:date>2020-06-05T10:15:37Z</dc:date>
    </item>
  </channel>
</rss>

