<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Validating dataset in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523431#M142197</link>
    <description>&lt;P&gt;Dear SAS experts,&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;I want to have some suggestions to validate a dataset which has 100 variables and over 100000 records. I am importing data in string form (even for the numeric data). I observed some variables getting truncated in the end. How do I validate each variable to make sure that the data populated totally (not truncated)?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; How do we validate the dataset?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 24 Dec 2018 20:45:57 GMT</pubDate>
    <dc:creator>buddha_d</dc:creator>
    <dc:date>2018-12-24T20:45:57Z</dc:date>
    <item>
      <title>Validating dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523431#M142197</link>
      <description>&lt;P&gt;Dear SAS experts,&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;I want to have some suggestions to validate a dataset which has 100 variables and over 100000 records. I am importing data in string form (even for the numeric data). I observed some variables getting truncated in the end. How do I validate each variable to make sure that the data populated totally (not truncated)?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; How do we validate the dataset?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Dec 2018 20:45:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523431#M142197</guid>
      <dc:creator>buddha_d</dc:creator>
      <dc:date>2018-12-24T20:45:57Z</dc:date>
    </item>
    <item>
      <title>Re: Validating dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523437#M142198</link>
      <description>&lt;P&gt;Just as an example :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;data one;&lt;BR /&gt;infile "xyz.txt" RECFM=V LRECL=2000 PAD MISSOVER;&lt;BR /&gt;length&lt;BR /&gt;a1 $20.&lt;BR /&gt;a2 $100.&lt;BR /&gt;a3 $50.&lt;BR /&gt;;&lt;BR /&gt;input a1 $&lt;BR /&gt;a2 $&lt;BR /&gt;a3 $&lt;BR /&gt;;&lt;BR /&gt;run;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Dec 2018 21:11:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523437#M142198</guid>
      <dc:creator>buddha_d</dc:creator>
      <dc:date>2018-12-24T21:11:33Z</dc:date>
    </item>
    <item>
      <title>Re: Validating dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523439#M142199</link>
      <description>&lt;P&gt;Unless you have a completely correct version of your data to compare to you are not going to be able to validate your data fully.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What is it about your input file that means you are unable to read it correctly in the first place?&lt;/P&gt;</description>
      <pubDate>Mon, 24 Dec 2018 21:49:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523439#M142199</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2018-12-24T21:49:44Z</dc:date>
    </item>
    <item>
      <title>Re: Validating dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523440#M142200</link>
      <description>&lt;P&gt;SAS Kiwi,&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;For example the string value is 1532564.7564 and I am getting&amp;nbsp;&lt;SPAN&gt;1532564.756 after my import. So, My question is this is the value that got is getting truncated. But, when I change it to numberical data then I would get the full value. Like wise, Licnum is character data (eg:12xd456) and this is getting truncated in the last digit ( shows up as12xd456).&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;there are about 100 columns with 100,000 records, how do I validate each column that there is no truncation. I have specification sheet, but this is not importing each column perfectly. In order to get the data with out truncation, how do I code each column to check and see that data imported with out truncation. When it is a huge dataset, it is hard to check each line, so I am thinking of having some kind of macro that checks the maximum length to begin with. Based on that I could check the specifications and confirm if I need to increase the length of&amp;nbsp; a variable.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Dec 2018 22:20:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523440#M142200</guid>
      <dc:creator>buddha_d</dc:creator>
      <dc:date>2018-12-24T22:20:59Z</dc:date>
    </item>
    <item>
      <title>Re: Validating dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523442#M142201</link>
      <description>&lt;P&gt;sorry for the typo LicNum showing up as&amp;nbsp;&amp;nbsp;&lt;SPAN&gt;12xd4 instead of&amp;nbsp;12xd456 (actutal)&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Dec 2018 22:25:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523442#M142201</guid>
      <dc:creator>buddha_d</dc:creator>
      <dc:date>2018-12-24T22:25:14Z</dc:date>
    </item>
    <item>
      <title>Re: Validating dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523456#M142208</link>
      <description>&lt;P&gt;Why are you reading in numbers as strings? If you read them in as numbers to begin with you wouldn't get truncation.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For example 1532564.7564 can be read using a numeric INFORMAT like so: &lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;input @10 MyNum 12.;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 25 Dec 2018 07:23:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523456#M142208</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2018-12-25T07:23:42Z</dc:date>
    </item>
    <item>
      <title>Re: Validating dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523497#M142224</link>
      <description>Do you have a specification document? SAS cannot know what the values should be, it does a best guess. If it doesn't work, then you need to tell SAS what it should be or how to read it, which needs to come from somewhere. So for the 100 variables, how do you know, besides inspecting every record, what the type, format, and length it should be.</description>
      <pubDate>Tue, 25 Dec 2018 19:40:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validating-dataset/m-p/523497#M142224</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2018-12-25T19:40:19Z</dc:date>
    </item>
  </channel>
</rss>

