<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: SAS and Data Quality Dimensions in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/601100#M173826</link>
    <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;: I think, you mean this link&amp;nbsp;&lt;A href="https://blogs.sas.com/content/sascom/tag/bcbs-239/" target="_blank" rel="noopener"&gt;https://blogs.sas.com/content/sascom/tag/bcbs-239/&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;I have read these 14 principles, I am interested by their&amp;nbsp; implementation in SAS.&lt;/P&gt;
&lt;P&gt;Thereby, could&amp;nbsp;the scope of the data quality for the Banking credit risk data, defined by the banking supervisory,&amp;nbsp; "became"&amp;nbsp; the same as “CDEs=Critical Data elements” ? in order to limit the scope of the data quality.&lt;/P&gt;</description>
    <pubDate>Fri, 01 Nov 2019 21:25:27 GMT</pubDate>
    <dc:creator>LineMoon</dc:creator>
    <dc:date>2019-11-01T21:25:27Z</dc:date>
    <item>
      <title>SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599528#M173070</link>
      <description>&lt;P&gt;Hello Experts,&lt;/P&gt;
&lt;P&gt;Please, I would like to know, if you have created some sas programs and templates for the data quality Dimensions :&amp;nbsp;COMPLETENESS,UNIQUENESS,TIMELINESS,VALIDITY,ACCURACY,CONSISTENCY&lt;/P&gt;
&lt;P&gt;In order to enhance the data quality reports ?&lt;/P&gt;
&lt;P&gt;Are there any SAS implementions for the data quality dimensions as specified in the standard :&amp;nbsp; BCBS 239 ?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 26 Oct 2019 13:22:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599528#M173070</guid>
      <dc:creator>LineMoon</dc:creator>
      <dc:date>2019-10-26T13:22:06Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599529#M173071</link>
      <description>&lt;P&gt;Internet searches for&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;SAS BCBS 239&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;finds lots of hits&lt;/P&gt;</description>
      <pubDate>Sat, 26 Oct 2019 13:28:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599529#M173071</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-10-26T13:28:41Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599558#M173081</link>
      <description>&lt;P&gt;Thank very much for your answer.&lt;/P&gt;
&lt;P&gt;As a part of the creation of the six data quality Dimensions (COMPLETENESS, UNIQUENESS, TIMELINESS, VALIDITY, ACCURACY, CONSISTENCY) via SAS,&amp;nbsp;&amp;nbsp;there are three components that will be involved in the process of the generation of the Data Quality Report:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Input Design&lt;/LI&gt;
&lt;LI&gt;SAS Program&lt;/LI&gt;
&lt;LI&gt;Output (Data quality Report)&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It seems clear that, there is a strong dependency between the &lt;STRONG&gt;Input Design &lt;/STRONG&gt;and &lt;STRONG&gt;SAS program &lt;/STRONG&gt;(to run with less coding and keeping more transparency). In order to create by SAS an automatic Data Quality report.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;Consequently, are there any general solution that have been proposed for both the &lt;STRONG&gt;Input Design &lt;/STRONG&gt;and &lt;STRONG&gt;SAS program?&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 26 Oct 2019 18:53:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599558#M173081</guid>
      <dc:creator>LineMoon</dc:creator>
      <dc:date>2019-10-26T18:53:10Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599603#M173106</link>
      <description>For starters are you referring to a specific product, such as SAS Data Quality Studio or Data Integration Studio? Or are you referring to using Base SAS to do data quality management? Given your last question, I'm assuming the latter and my further comments are based on this assumption. &lt;BR /&gt;&lt;BR /&gt;There is a dedicated SAS product designed to characterize/profile data quality measures. In general though, there needs to be some subject matter expertise applied, because you need to specify both the data and the rules and you need to develop a system to do so. I've had to this before by the way, building a data quality report and system to transfer data to different government entities to meet their rules. Without Data Integration studio sadly...didn't know enough at the time to have used that instead anyways!</description>
      <pubDate>Sun, 27 Oct 2019 03:22:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599603#M173106</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-10-27T03:22:55Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599663#M173138</link>
      <description>&lt;P&gt;Thank you very much.&lt;/P&gt;
&lt;P&gt;I am&amp;nbsp; referring to the use of the&amp;nbsp; Base SAS to do data quality report with the six data quality Dimensions: COMPLETENESS, UNIQUENESS, TIMELINESS, VALIDITY, ACCURACY, CONSISTENCY&lt;/P&gt;</description>
      <pubDate>Sun, 27 Oct 2019 17:47:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599663#M173138</guid>
      <dc:creator>LineMoon</dc:creator>
      <dc:date>2019-10-27T17:47:25Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599686#M173160</link>
      <description>Ok, so yeah, you need to define your rules and write your programs. They will vary based on input. For example, for some of our data the timeliness is 24 hours and for others its 30 days. You can only know that by knowing the business rules. &lt;BR /&gt;&lt;BR /&gt;So pick a data system and start there by first establishing the business rules around how data are collected. You then translate that to the dimensions above, ie completeness for 'name' type fields would be mean that First Name and LastName are always filled out but MiddleName can be blank. &lt;BR /&gt;Uniqueness means that each transaction ID should be unique for example - this may not be true in all systems, especially if you have a transaction system that allows for returns. In that case, the TransactionID and Type (purchase or return) form the unique key to be checked. &lt;BR /&gt;&lt;BR /&gt;I don't recall the definitions of the other terms at the top of my head but hopefully this helps you get started. &lt;BR /&gt;&lt;BR /&gt;This paper walks through some of it and you can also search on lexjansen.com for more examples:&lt;BR /&gt;&lt;A href="https://support.sas.com/resources/papers/proceedings/proceedings/sugi29/093-29.pdf" target="_blank"&gt;https://support.sas.com/resources/papers/proceedings/proceedings/sugi29/093-29.pdf&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Note that is not a trivial task, but can be done, it just takes some time. &lt;BR /&gt;&lt;BR /&gt;There's also a SAS code book that can help you get there faster, the macro is in the library. &lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/Library-Datasets-Summary-Macro-DATA-SPECS/ta-p/544757" target="_blank"&gt;https://communities.sas.com/t5/SAS-Communities-Library/Library-Datasets-Summary-Macro-DATA-SPECS/ta-p/544757&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;You can probably use that to help build your rules.</description>
      <pubDate>Sun, 27 Oct 2019 21:24:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599686#M173160</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-10-27T21:24:22Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599944#M173288</link>
      <description>&lt;P&gt;Thank you very much for these useful and details information.&lt;/P&gt;
&lt;P&gt;That’s very kind from you.&lt;/P&gt;
&lt;P&gt;AS you said ““that is not a trivial task, but can be done, it just takes some time”&lt;/P&gt;
&lt;P&gt;&amp;nbsp;That’s right; and, in order to find a general solution, the key could be in the “input Design” that will be running in SAS Program.&lt;/P&gt;
&lt;P&gt;As you know, the input design implemented the business rules and easily&amp;nbsp; understood by the functional and Technical; and loaded with less complexity in SAS.&lt;/P&gt;
&lt;P&gt;Therefore, in many cases, the inputs design, it is combination of three components:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Set of fields to apply the data quality&lt;/LI&gt;
&lt;LI&gt;Data Quality Dimensions to apply: COMPLETENESS, UNIQUENESS, TIMELINESS, VALIDITY, ACCURACY, CONSISTENCY&lt;/LI&gt;
&lt;LI&gt;Set of reference variables used in the compare.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;The appearance of the three components in the “input design” in a smart way could solve the issue &amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2019 21:55:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/599944#M173288</guid>
      <dc:creator>LineMoon</dc:creator>
      <dc:date>2019-10-28T21:55:21Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/600095#M173371</link>
      <description>Yes, but that also takes time and you need to know the rules that could even come into play. My DQ programs for clinical trials and health care were different from the data quality rules when I was n agriculture. But similar idea. &lt;BR /&gt;&lt;BR /&gt;When I built similar systems I put the rules/logic in an Excel file and had the business drive the rules and I just implemented them. &lt;BR /&gt;&lt;BR /&gt;Thinking about all of the different possible cases isn't trivial, which is why there's a dedicated SAS product that does this, SAS Data Quality Studio. &lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 29 Oct 2019 15:25:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/600095#M173371</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-10-29T15:25:45Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/600214#M173428</link>
      <description>&lt;P&gt;As for input , That's right . Instead, the Output design could have nearly the same format ?&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2019 23:43:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/600214#M173428</guid>
      <dc:creator>LineMoon</dc:creator>
      <dc:date>2019-10-29T23:43:03Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/600227#M173435</link>
      <description>Sure it could.</description>
      <pubDate>Wed, 30 Oct 2019 01:19:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/600227#M173435</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-10-30T01:19:35Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/600899#M173768</link>
      <description>&lt;P&gt;In order to apply the 6 Data quality dimensions on N variables(for exemple : 1000 variables),&lt;/P&gt;
&lt;P&gt;How much time do you estimate to do this job?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Is there any method for estimating the time necessary to do this job ?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 01 Nov 2019 08:42:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/600899#M173768</guid>
      <dc:creator>LineMoon</dc:creator>
      <dc:date>2019-11-01T08:42:23Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/601100#M173826</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;: I think, you mean this link&amp;nbsp;&lt;A href="https://blogs.sas.com/content/sascom/tag/bcbs-239/" target="_blank" rel="noopener"&gt;https://blogs.sas.com/content/sascom/tag/bcbs-239/&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;I have read these 14 principles, I am interested by their&amp;nbsp; implementation in SAS.&lt;/P&gt;
&lt;P&gt;Thereby, could&amp;nbsp;the scope of the data quality for the Banking credit risk data, defined by the banking supervisory,&amp;nbsp; "became"&amp;nbsp; the same as “CDEs=Critical Data elements” ? in order to limit the scope of the data quality.&lt;/P&gt;</description>
      <pubDate>Fri, 01 Nov 2019 21:25:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/601100#M173826</guid>
      <dc:creator>LineMoon</dc:creator>
      <dc:date>2019-11-01T21:25:27Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/601156#M173837</link>
      <description>Depends on the variables. If 25 are diagnosis codes for example, you can look at them all together, and there's probably similar functionality in banking data. For example, all amount fields can be looked at together. &lt;BR /&gt;&lt;BR /&gt;I would say you would first go through and group variables together and identify which need to checked and which ones don't and which go together. Then you have the subset you need to check. I'd guess you're looking at 20/30 minutes per variable here to make decisions and ensure things are correct and not accounting for extra follow up time. Some variables may take a minute to check and verify and some, like an account type that's changed over time and has old and new types may take a few hours so 20/30 minutes on average. Usually you assume 5 productive hours a day. You can do the math from there.</description>
      <pubDate>Sat, 02 Nov 2019 17:41:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/601156#M173837</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-11-02T17:41:38Z</dc:date>
    </item>
    <item>
      <title>Re: SAS and Data Quality Dimensions</title>
      <link>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/601185#M173849</link>
      <description>&lt;P&gt;Thank you very much.&lt;/P&gt;
&lt;P&gt;It's a good idea&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 02 Nov 2019 23:13:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/SAS-and-Data-Quality-Dimensions/m-p/601185#M173849</guid>
      <dc:creator>LineMoon</dc:creator>
      <dc:date>2019-11-02T23:13:05Z</dc:date>
    </item>
  </channel>
</rss>

