<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Proc migrate improvement in Administration and Deployment</title>
    <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846100#M25566</link>
    <description>&lt;P&gt;I would set up a method to copy (sftp) several files in parallel, and then (also parallel) run data steps to do the conversion.&lt;/P&gt;
&lt;P&gt;Before that, run tests to see if CEDA (using the files as is) is feasible in production use.&lt;/P&gt;</description>
    <pubDate>Thu, 24 Nov 2022 09:53:43 GMT</pubDate>
    <dc:creator>Kurt_Bremser</dc:creator>
    <dc:date>2022-11-24T09:53:43Z</dc:date>
    <item>
      <title>Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845892#M25555</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;i'm currently in a migration moving from AIX to UNIX (RHEL) and we need to migrate our sas dataset files from source to new server, since there is an incopatibility between filesystems, and we're using PROC MIGRATE mapping source libraries on new server RHEL, PROC MIGRATE runs about an average of 150GB per PROC running ( i've seen i can run more than 5 PROCs at the same time without problem) and this tooks arround 10-20% of the cpu consumption, my question, is there any posibility to increase the ratio or the resurces the PROC takes to increase the performance and get better migration rates? my goal would be 300GB/h in order to have all the data migrated in 1 day.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If any one has encounter this situation during a migration may can help!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Wed, 23 Nov 2022 11:36:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845892#M25555</guid>
      <dc:creator>W1ndwaker</dc:creator>
      <dc:date>2022-11-23T11:36:58Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845916#M25557</link>
      <description>&lt;P&gt;I take it that you have to move your data over the network. 300 GB/h translates roughly to 85 MB/s, which means you need a 1 Gbit network bandwidth throughout the whole distance, including all switches/firewalls.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 23 Nov 2022 12:50:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845916#M25557</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2022-11-23T12:50:39Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845946#M25558</link>
      <description>I supose bandwith is not a problem, since i got 6 simultaneous proces running giving a maximum 720GB/h at peak performance, the problem comes when a single directory (or library as you wanna call it) has a big ammount of data like 2TB of data, then I can only run 1 proc migrate on that folder and this is where the bottleneck is for me, if i can get a single proc migrate to a 300GB would be great for folders about 2.5TB&lt;BR /&gt;&lt;BR /&gt;Thanks for the reply btw!!</description>
      <pubDate>Wed, 23 Nov 2022 14:37:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845946#M25558</guid>
      <dc:creator>W1ndwaker</dc:creator>
      <dc:date>2022-11-23T14:37:19Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845955#M25559</link>
      <description>&lt;P&gt;I suspect that the MIGRATE procedure is single-threaded, which would explain the limitation.&lt;/P&gt;
&lt;P&gt;Have you tried this:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;copy the .sas7bdat file from one server to the other&lt;/LI&gt;
&lt;LI&gt;run a simple data step on the target server to rewrite the dataset, making one-time use of CEDA&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Wed, 23 Nov 2022 14:49:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845955#M25559</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2022-11-23T14:49:41Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845958#M25560</link>
      <description>Yep i did, we ponder between the methods to migrate the whole data and it works probably faster, there ara 2 more point, first, you need to move the whole data from A to B and this will take a lot of time also, and then proceed with a DATA step on every single dataset, considering we can have thousands.&lt;BR /&gt;&lt;BR /&gt;Resuming the best option seeing all the posibilities it's to migrate from the source directory mapped on the new server but will be great to have a multithread proces (that i think it uses while creating the index of the migrated table).&lt;BR /&gt;&lt;BR /&gt;Reading the doc didn't saw any option rather than buffersize and this doesn't seem have any effect on the process</description>
      <pubDate>Wed, 23 Nov 2022 14:59:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/845958#M25560</guid>
      <dc:creator>W1ndwaker</dc:creator>
      <dc:date>2022-11-23T14:59:23Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846010#M25561</link>
      <description>&lt;P&gt;SAS/CONNECT and PROC UPLOAD is another way of migrating between SAS installations. You do need the product installed and licensed on both SAS installations though. Also there are the Base SAS CPORT / CIMPORT procedures which can do whole SAS libraries in one process.&lt;/P&gt;</description>
      <pubDate>Wed, 23 Nov 2022 20:21:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846010#M25561</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2022-11-23T20:21:37Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846089#M25564</link>
      <description>Actually I've already try all this methods, getting to the point that migrate is the best way, cport/cimport is more trikier since you have to do 2 operation for the same result as a migrate, the thing is the stopper on proc migrate to work at a capped speed, I also tryed to modify sasv9.cfg performance options to give more resources for the session in ordre to get more GB/h during the proc migrate, but not working out.</description>
      <pubDate>Thu, 24 Nov 2022 07:51:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846089#M25564</guid>
      <dc:creator>W1ndwaker</dc:creator>
      <dc:date>2022-11-24T07:51:33Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846100#M25566</link>
      <description>&lt;P&gt;I would set up a method to copy (sftp) several files in parallel, and then (also parallel) run data steps to do the conversion.&lt;/P&gt;
&lt;P&gt;Before that, run tests to see if CEDA (using the files as is) is feasible in production use.&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 09:53:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846100#M25566</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2022-11-24T09:53:43Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846203#M25569</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/423342"&gt;@W1ndwaker&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;I supose bandwith is not a problem, since i got 6 simultaneous proces running giving a maximum 720GB/h at peak performance, the problem comes when a single directory (or library as you wanna call it) has a big ammount of data like 2TB of data, then I can only run 1 proc migrate on that folder and this is where the bottleneck is for me, if i can get a single proc migrate to a 300GB would be great for folders about 2.5TB&lt;BR /&gt;&lt;BR /&gt;Thanks for the reply btw!!&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Sound like the bottle neck is the speed of reading from that location. Or perhaps more likely the problem is you are writing them all to same target location, as writing takes more time that reading since you get no boost from cacheing.&amp;nbsp; &amp;nbsp;Can you split up the files into batches and write to different physical output disks?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 18:13:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846203#M25569</guid>
      <dc:creator>Tom</dc:creator>
      <dc:date>2022-11-24T18:13:24Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846210#M25570</link>
      <description>&lt;P&gt;Definitely worth exploring parallel processing in that case.&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 19:04:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846210#M25570</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2022-11-24T19:04:26Z</dc:date>
    </item>
    <item>
      <title>Re: Proc migrate improvement</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846254#M25574</link>
      <description>After some more test i get to the point that the proc migrate cannot go faster than a certain speed per thread, i found a param, cpucount to force the utilization of all available cpus in the systems, this combined with a moment of no work on the source disk gave me a 220GB/h, quite good, but in the end the target server was running about 20%of cpu usage, if i put 2 proc migrates i get the double rate, 430GB/h and 40% cpu usage.&lt;BR /&gt;&lt;BR /&gt;In the end i supose the best way is spliting the big directories, i have few of them with more than 300 files and 2-3 TB each, if 1 proc takes arround 10-12 hours to finish if i split the folder on 3 subfolders i get this to 3-4 hours and the system runs correctly without errors, this is where we will go for the migration day.</description>
      <pubDate>Fri, 25 Nov 2022 08:00:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Proc-migrate-improvement/m-p/846254#M25574</guid>
      <dc:creator>W1ndwaker</dc:creator>
      <dc:date>2022-11-25T08:00:29Z</dc:date>
    </item>
  </channel>
</rss>

