<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Data quality and management help required? in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131888#M26855</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Data-warehouse contains small to very large datasets with over 225 different variables in each data sets. Business requirement says there is only one type of error is expected and that is there would be a missing or blank variable value of any variable in a row, or in many rows, and if found the entire row/rows must be deleted. The catch is the blank could occur in any of the 225 variables and in any row, &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;is there a function or logic in datstep to do it or what do you guys think about Proc Datasets?&lt;/P&gt;&lt;P&gt;It's an &lt;SPAN style="text-decoration: underline;"&gt;&lt;STRONG&gt;appending table&lt;/STRONG&gt;&lt;/SPAN&gt; monthly, quarterly and yearly, therefore the preference is to use automated simple solutions.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 07 May 2013 18:33:31 GMT</pubDate>
    <dc:creator>MarkWik</dc:creator>
    <dc:date>2013-05-07T18:33:31Z</dc:date>
    <item>
      <title>Data quality and management help required?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131888#M26855</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Data-warehouse contains small to very large datasets with over 225 different variables in each data sets. Business requirement says there is only one type of error is expected and that is there would be a missing or blank variable value of any variable in a row, or in many rows, and if found the entire row/rows must be deleted. The catch is the blank could occur in any of the 225 variables and in any row, &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;is there a function or logic in datstep to do it or what do you guys think about Proc Datasets?&lt;/P&gt;&lt;P&gt;It's an &lt;SPAN style="text-decoration: underline;"&gt;&lt;STRONG&gt;appending table&lt;/STRONG&gt;&lt;/SPAN&gt; monthly, quarterly and yearly, therefore the preference is to use automated simple solutions.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 07 May 2013 18:33:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131888#M26855</guid>
      <dc:creator>MarkWik</dc:creator>
      <dc:date>2013-05-07T18:33:31Z</dc:date>
    </item>
    <item>
      <title>Re: Data quality and management help required?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131889#M26856</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;In a data step, you could use the CMISS function, to find a row with any missing values.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Then, delete the row and write the table back to the data warehouse&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 07 May 2013 19:24:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131889#M26856</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2013-05-07T19:24:46Z</dc:date>
    </item>
    <item>
      <title>Re: Data quality and management help required?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131890#M26857</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;So the way I read this was that if ANY of the rows in your dataset have 1 blank value at all, then drop the entire row. Is that correct, or did you mean the entire row has to be blank in order for it to be dropped?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I guess I wil just give you examples of both ways to be safe... &lt;BR /&gt;First: If any columns have a blank variable, then drop that entire row.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data new;&lt;/P&gt;&lt;P&gt;set old;&lt;/P&gt;&lt;P&gt;array varsyay(*) firstcolumn--lastcolumn;&lt;/P&gt;&lt;P&gt;if cmiss(of cvars(*))=0;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Now drop the row ONLY if the entire row is blank.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data new;&lt;/P&gt;&lt;P&gt;set old;&lt;/P&gt;&lt;P&gt;array varsyay(*) firstcolumn--lastcolumn;&lt;/P&gt;&lt;P&gt;if not(cmiss(of varsyay(*)) eq dim(varsyay(*));&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 07 May 2013 19:37:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131890#M26857</guid>
      <dc:creator>Anotherdream</dc:creator>
      <dc:date>2013-05-07T19:37:29Z</dc:date>
    </item>
    <item>
      <title>Re: Data quality and management help required?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131891#M26858</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;And if you have a mixture of numeric and string values combine tge cmiss with Nmiss something like&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Append base= collection data= new_rows(&lt;/P&gt;&lt;P&gt;Where=( NOT sum( 0,&amp;nbsp; cmiss( of _character_),&amp;nbsp; nmiss( of _numeric_) ) ) ) ;&lt;/P&gt;&lt;P&gt;Run ;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 07 May 2013 19:39:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Data-quality-and-management-help-required/m-p/131891#M26858</guid>
      <dc:creator>Peter_C</dc:creator>
      <dc:date>2013-05-07T19:39:57Z</dc:date>
    </item>
  </channel>
</rss>

