<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Size of the zipped data set is different from its size before it was zipped in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/Size-of-the-zipped-data-set-is-different-from-its-size-before-it/m-p/472888#M14832</link>
    <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/132557"&gt;@iSAS&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;&lt;FONT face="Courier New" color="#000080" size="3"&gt;&lt;STRONG&gt;We zipped a dataset using:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" color="#0000ff" size="3"&gt;- call&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; system(&lt;/FONT&gt;&lt;FONT face="Courier New" color="#800080" size="3"&gt;"gzip path/data.sas7bdat"&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;)&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;However, when we unzipped it using:&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" color="#0000ff" size="3"&gt;- call&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; system(&lt;/FONT&gt;&lt;FONT face="Courier New" color="#800080" size="3"&gt;"gzip -d path/data.sas7bdat"&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;)&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;We noticed that its size&amp;nbsp;was different&amp;nbsp;from before it was zipped. Does this mean that something changed in the data set? We compared it from its back up data set using proc compare though but there is no discrepancy. However,&amp;nbsp;their file sizes are different. May we know the reason behind this?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;That wouldn't work. gzip creates&lt;/P&gt;
&lt;PRE&gt;path/data.sas7bdat.gz&lt;/PRE&gt;
&lt;P&gt;while&lt;/P&gt;
&lt;PRE&gt;gzip -d path/data.sas7bdat&lt;/PRE&gt;
&lt;P&gt;will find that the sas7bdat is not a gzipped file and would fail. Since the gzip removes the source file, there must be some process that recreated the sas7bdat. So I suggest you do a&lt;/P&gt;
&lt;PRE&gt;mv path/data.sas7bdat path/data_new.sas7bdat
gzip -d path/data.sas7bdat.gz&lt;/PRE&gt;
&lt;P&gt;and then do a visual compare of the size and the timestamps of data_new.sas7bdat and data.sas7bdat, as gzip preserves those during the whole process.&lt;/P&gt;
&lt;P&gt;I've never had a change of physical filesize from gzip / gzip -d.&lt;/P&gt;</description>
    <pubDate>Mon, 25 Jun 2018 07:48:16 GMT</pubDate>
    <dc:creator>Kurt_Bremser</dc:creator>
    <dc:date>2018-06-25T07:48:16Z</dc:date>
    <item>
      <title>Size of the zipped data set is different from its size before it was zipped</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Size-of-the-zipped-data-set-is-different-from-its-size-before-it/m-p/472868#M14831</link>
      <description>&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;We zipped a dataset using:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;- call&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; system(&lt;/FONT&gt;&lt;FONT color="#800080" face="Courier New" size="3"&gt;"gzip path/data.sas7bdat"&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;However, when we unzipped it using:&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;- call&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; system(&lt;/FONT&gt;&lt;FONT color="#800080" face="Courier New" size="3"&gt;"gzip -d path/data.sas7bdat"&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;We noticed that its size&amp;nbsp;was different&amp;nbsp;from before it was zipped. Does this mean that something changed in the data set? We compared it from its back up data set using proc compare though but there is no discrepancy. However,&amp;nbsp;their file sizes are different. May we know the reason behind this?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 25 Jun 2018 04:48:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Size-of-the-zipped-data-set-is-different-from-its-size-before-it/m-p/472868#M14831</guid>
      <dc:creator>iSAS</dc:creator>
      <dc:date>2018-06-25T04:48:40Z</dc:date>
    </item>
    <item>
      <title>Re: Size of the zipped data set is different from its size before it was zipped</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Size-of-the-zipped-data-set-is-different-from-its-size-before-it/m-p/472888#M14832</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/132557"&gt;@iSAS&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;&lt;FONT face="Courier New" color="#000080" size="3"&gt;&lt;STRONG&gt;We zipped a dataset using:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" color="#0000ff" size="3"&gt;- call&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; system(&lt;/FONT&gt;&lt;FONT face="Courier New" color="#800080" size="3"&gt;"gzip path/data.sas7bdat"&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;)&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;However, when we unzipped it using:&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" color="#0000ff" size="3"&gt;- call&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; system(&lt;/FONT&gt;&lt;FONT face="Courier New" color="#800080" size="3"&gt;"gzip -d path/data.sas7bdat"&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;)&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;We noticed that its size&amp;nbsp;was different&amp;nbsp;from before it was zipped. Does this mean that something changed in the data set? We compared it from its back up data set using proc compare though but there is no discrepancy. However,&amp;nbsp;their file sizes are different. May we know the reason behind this?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;That wouldn't work. gzip creates&lt;/P&gt;
&lt;PRE&gt;path/data.sas7bdat.gz&lt;/PRE&gt;
&lt;P&gt;while&lt;/P&gt;
&lt;PRE&gt;gzip -d path/data.sas7bdat&lt;/PRE&gt;
&lt;P&gt;will find that the sas7bdat is not a gzipped file and would fail. Since the gzip removes the source file, there must be some process that recreated the sas7bdat. So I suggest you do a&lt;/P&gt;
&lt;PRE&gt;mv path/data.sas7bdat path/data_new.sas7bdat
gzip -d path/data.sas7bdat.gz&lt;/PRE&gt;
&lt;P&gt;and then do a visual compare of the size and the timestamps of data_new.sas7bdat and data.sas7bdat, as gzip preserves those during the whole process.&lt;/P&gt;
&lt;P&gt;I've never had a change of physical filesize from gzip / gzip -d.&lt;/P&gt;</description>
      <pubDate>Mon, 25 Jun 2018 07:48:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Size-of-the-zipped-data-set-is-different-from-its-size-before-it/m-p/472888#M14832</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2018-06-25T07:48:16Z</dc:date>
    </item>
    <item>
      <title>Re: Size of the zipped data set is different from its size before it was zipped</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Size-of-the-zipped-data-set-is-different-from-its-size-before-it/m-p/474356#M14882</link>
      <description>My mistake, I mixed it up. We encountered this issue on on cport/cimport and not on gzip</description>
      <pubDate>Fri, 29 Jun 2018 07:01:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Size-of-the-zipped-data-set-is-different-from-its-size-before-it/m-p/474356#M14882</guid>
      <dc:creator>iSAS</dc:creator>
      <dc:date>2018-06-29T07:01:34Z</dc:date>
    </item>
  </channel>
</rss>

