<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Optimal number of tracked Columns in SCDs in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228546#M5569</link>
    <description>&lt;P&gt;It's only 60T rows so I wouldn't be too worried about performance for creating a digest value for thirty&amp;nbsp;type 2 columns.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Where I have some concerns: If you're using the OOTB SCD Type 2 loader then the other 59970 columns are type 1 and the code generated will also create a digest value for all these columns. I'm not sure if the SCD Type 2 loader has been implemented to deal nicely with such a vast number of columns. Make sure that you check the log for truncation warnings and the like. Eventually consider to implement this as a custom transformation with your own code.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13674"&gt;@LinusH﻿&lt;/a&gt;&amp;nbsp;wrote: May be some re-modelling is required.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Are your business users aware of the fact that they will have to change all their queries adding a selection on "change_current_ind='Y'" or something similar?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Do you know what your business users want this history records for? May-be maintaining a separate history table or a change event table would serve the purpose as well.&lt;/P&gt;</description>
    <pubDate>Mon, 05 Oct 2015 23:11:24 GMT</pubDate>
    <dc:creator>Patrick</dc:creator>
    <dc:date>2015-10-05T23:11:24Z</dc:date>
    <item>
      <title>Optimal number of tracked Columns in SCDs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228407#M5552</link>
      <description>&lt;P&gt;Hi - I currently have a dimension which has ~60,000 rows and 350 variables. The business want to start tracking history and to turn this dimension into a SCD - so I asked them which variables they want to track and they came back with a list of approx 30 variables - my feeling is that this is too many and will have a significant affect on performance but I have nothing to back that up.&amp;nbsp; Has anyone done any analysis or know if SAS recommend&amp;nbsp;the maximum amount of tracked columns will work efficiently.&lt;/P&gt;&lt;P&gt;I tried to do some searching but have not been able to find anything apart from "The number and length of these columns affects the run-time performance of the job" - so I know it will affect performance but the business want to know what the maximum is so they can work on reducing&amp;nbsp;their list.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any advice would be appreciated.&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Rachel&lt;/P&gt;</description>
      <pubDate>Mon, 05 Oct 2015 12:49:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228407#M5552</guid>
      <dc:creator>rcw68</dc:creator>
      <dc:date>2015-10-05T12:49:47Z</dc:date>
    </item>
    <item>
      <title>Re: Optimal number of tracked Columns in SCDs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228451#M5557</link>
      <description>Since the dimension has quite few rows, I don't think that 30 columns is too much, at least from a performance view.&lt;BR /&gt;But one can ask why do you have a dimension with 350 cols? What user can grasp this amount of information? For me, it sounds like you have some data modelling tasks comming up.</description>
      <pubDate>Mon, 05 Oct 2015 15:28:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228451#M5557</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2015-10-05T15:28:49Z</dc:date>
    </item>
    <item>
      <title>Re: Optimal number of tracked Columns in SCDs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228546#M5569</link>
      <description>&lt;P&gt;It's only 60T rows so I wouldn't be too worried about performance for creating a digest value for thirty&amp;nbsp;type 2 columns.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Where I have some concerns: If you're using the OOTB SCD Type 2 loader then the other 59970 columns are type 1 and the code generated will also create a digest value for all these columns. I'm not sure if the SCD Type 2 loader has been implemented to deal nicely with such a vast number of columns. Make sure that you check the log for truncation warnings and the like. Eventually consider to implement this as a custom transformation with your own code.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13674"&gt;@LinusH﻿&lt;/a&gt;&amp;nbsp;wrote: May be some re-modelling is required.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Are your business users aware of the fact that they will have to change all their queries adding a selection on "change_current_ind='Y'" or something similar?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Do you know what your business users want this history records for? May-be maintaining a separate history table or a change event table would serve the purpose as well.&lt;/P&gt;</description>
      <pubDate>Mon, 05 Oct 2015 23:11:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228546#M5569</guid>
      <dc:creator>Patrick</dc:creator>
      <dc:date>2015-10-05T23:11:24Z</dc:date>
    </item>
    <item>
      <title>Re: Optimal number of tracked Columns in SCDs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228556#M5572</link>
      <description>&lt;P&gt;Elaborating a bit more.&lt;/P&gt;&lt;P&gt;Business&amp;nbsp;stakeholders are seldom interested in history records,&amp;nbsp;especially when it comes to labels/names.&lt;/P&gt;&lt;P&gt;Sounds like you don't have a detail data store - which is the&amp;nbsp;natural place to keep a complete&amp;nbsp;history record.&lt;/P&gt;&lt;P&gt;Then you can more easily design and create&amp;nbsp;data&amp;nbsp;marts (dimensions and facts) that business requires at the moment. This will hopefully lead to&amp;nbsp;smaller and more easy to use data&amp;nbsp;patterns (and not&amp;nbsp;so much historization, and fewer columns).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Btw, by not track changes as Type 2, it does not necessary mean classifying them as type 1, it just means that&amp;nbsp;they are only&amp;nbsp;updated when any type 2 columns for the row has new value.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Oct 2015 00:18:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Optimal-number-of-tracked-Columns-in-SCDs/m-p/228556#M5572</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2015-10-06T00:18:01Z</dc:date>
    </item>
  </channel>
</rss>

