<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic create fake/test data with certain attributes of original data in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/create-fake-test-data-with-certain-attributes-of-original-data/m-p/151355#M29879</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hej everybody.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a quick question on how to efficiently create a test data set.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am working remotely with confidential data and I would like to create a fake data set with the same data structure so that I can do some programming on my local computer. As potentially many tables are affected, I would like to automatize this procedure as far as possible. Basically, I would like to do the following:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;1. generate some code that creates a data set with the same variables/data types (a simple proc sql- describe table may do?)&lt;/P&gt;&lt;P&gt;2. save mean, std, and correlations (of number variables) and detect string data types of other variables.&lt;/P&gt;&lt;P&gt;3. create random data with the same of the statistical attributes of the original data.&lt;/P&gt;&lt;P&gt;4. the code should be flexible so that I can adjust it easily for different tables.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any help is highly appreciated!!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Daniel&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sun, 22 Jun 2014 19:52:08 GMT</pubDate>
    <dc:creator>danmet</dc:creator>
    <dc:date>2014-06-22T19:52:08Z</dc:date>
    <item>
      <title>create fake/test data with certain attributes of original data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/create-fake-test-data-with-certain-attributes-of-original-data/m-p/151355#M29879</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hej everybody.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a quick question on how to efficiently create a test data set.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am working remotely with confidential data and I would like to create a fake data set with the same data structure so that I can do some programming on my local computer. As potentially many tables are affected, I would like to automatize this procedure as far as possible. Basically, I would like to do the following:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;1. generate some code that creates a data set with the same variables/data types (a simple proc sql- describe table may do?)&lt;/P&gt;&lt;P&gt;2. save mean, std, and correlations (of number variables) and detect string data types of other variables.&lt;/P&gt;&lt;P&gt;3. create random data with the same of the statistical attributes of the original data.&lt;/P&gt;&lt;P&gt;4. the code should be flexible so that I can adjust it easily for different tables.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any help is highly appreciated!!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Daniel&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 22 Jun 2014 19:52:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/create-fake-test-data-with-certain-attributes-of-original-data/m-p/151355#M29879</guid>
      <dc:creator>danmet</dc:creator>
      <dc:date>2014-06-22T19:52:08Z</dc:date>
    </item>
    <item>
      <title>Re: create fake/test data with certain attributes of original data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/create-fake-test-data-with-certain-attributes-of-original-data/m-p/151356#M29880</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The first part is easy:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Data want;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; set have (obs=0);&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Creating fake data with mean and std is quite possible. Correlation I'm not so sure of, especially if you want to mantain correlation from&amp;nbsp; variable x to y1, y2, y3, etc. As when you create the dummy X you have a lot of constraints on the other variables that will be an interesting challenge.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 23 Jun 2014 14:54:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/create-fake-test-data-with-certain-attributes-of-original-data/m-p/151356#M29880</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2014-06-23T14:54:01Z</dc:date>
    </item>
    <item>
      <title>Re: create fake/test data with certain attributes of original data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/create-fake-test-data-with-certain-attributes-of-original-data/m-p/151357#M29881</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Could you not just annonymize the data you have, so anything identifiable would be replaced with some code + random number?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 23 Jun 2014 15:00:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/create-fake-test-data-with-certain-attributes-of-original-data/m-p/151357#M29881</guid>
      <dc:creator>RW9</dc:creator>
      <dc:date>2014-06-23T15:00:18Z</dc:date>
    </item>
  </channel>
</rss>

