<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Cannot push job to specific host in grid environment in Administration and Deployment</title>
    <link>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/450312#M12779</link>
    <description>&lt;P&gt;Notwithstanding the fact that you already have a solution I'd like to share my 2 cents and explain how we go about similar tasks.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We have a job flow that needs inordinate amounts of SASWORK. We have multiple grid compute nodes. Most have lots of SASWORK with insane speed but one has twice that at ludicrous speed. We direct the jobs to that server by defining a resource called LargeWork in LSF&amp;nbsp; and configuring that host as providing that resource. Then when scheduling a job you can add the required resource LargeWork to the schedule definition and Process manager will always direct that job to that specific host.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Resources are defined in the file &lt;SPAN class="ph filepath"&gt;lsb.shared. In lsf.cluster.&lt;VAR class="keyword varname ibm-item-note-alternate"&gt;cluster_name&lt;/VAR&gt; you add the resource name to the desired host(s). Do remember to restart the daemons with badmin reconfig and lsadmin reconfig commands.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="ph filepath"&gt;And to second Juan's observation: do youreally&amp;nbsp; have a metadata server that doubles as a grid compute node? &lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="ph filepath"&gt;And as far as your metadata server being a single point of failure: have a look at clustering it over multiple hosts. It was a life saver for us many times.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="ph filepath"&gt;Regards,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="ph filepath"&gt;- Jan.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 02 Apr 2018 09:23:10 GMT</pubDate>
    <dc:creator>jklaverstijn</dc:creator>
    <dc:date>2018-04-02T09:23:10Z</dc:date>
    <item>
      <title>Cannot push job to specific host in grid environment</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/448322#M12673</link>
      <description>&lt;P&gt;Have sas 9.4 in grid environment (Linux).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When try to&amp;nbsp;run&amp;nbsp;job (in batch mode -&amp;gt; &lt;STRONG&gt;sas test.sas&lt;/STRONG&gt;)&amp;nbsp;and try to&amp;nbsp;dispatched it on&amp;nbsp;specific host (&lt;STRONG&gt;amdusa.company.com&lt;/STRONG&gt;) using below statement, its not running on &lt;STRONG&gt;amdusa.company.com&lt;/STRONG&gt; but on different grid node/server (in batch mode). + none other user's job dispatching on this host.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;options metaserver=amdusa.company.com metaport=12345 metauser=userid metapass=xxx;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;amdusa.company.com&lt;/STRONG&gt; host is properly&amp;nbsp;defined in &lt;STRONG&gt;lsb.hosts&lt;/STRONG&gt; as well as in list of LSF_MASTER_LIST and LSF_SERVER_HOSTS parameters from &lt;STRONG&gt;lsf.conf&lt;/STRONG&gt;. Also,&amp;nbsp;"bhosts" command shows&amp;nbsp;"ok" status for host "&lt;STRONG&gt;amdusa.company.com&lt;/STRONG&gt;"&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;job runs&amp;nbsp;fine locally with "./sas -nodms" on &lt;STRONG&gt;amdusa.company.com. &lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;check things around but nothing looks missing.&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 20:42:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/448322#M12673</guid>
      <dc:creator>woo</dc:creator>
      <dc:date>2018-03-23T20:42:40Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot push job to specific host in grid environment</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/448496#M12685</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/26689"&gt;@woo&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;this is a great question right there, and quite interesting. I wonder, do you have a High-Availability (HA) set up in your grid environment?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I can perfectly imagine the fact that your Load Balancer (physical or EGO) is believing this host is down and, somehow, bringing the load from this host to another node in the grid. You could check this in EGO, in RTM or your physical load balancer (with your IT guys).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Another option, perhaps you would like to check the resources configuration for this host: queue configuration, queue length, jobs that can run, queue status (maybe full), etc.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Anyway, it seems as you host does not have a problem "per-se", since you can run sas code in it locally.&lt;/P&gt;
&lt;P&gt;However, if you send it as a grid job, the job is being directed to another host... and this is why I would consider as starting point that the problem is either on your HA configuration (EGO, Load Balancer, RTM) or how the node is registered into the grid (bhosts, lshosts, RTM).&lt;/P&gt;</description>
      <pubDate>Sun, 25 Mar 2018 14:31:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/448496#M12685</guid>
      <dc:creator>JuanS_OCS</dc:creator>
      <dc:date>2018-03-25T14:31:37Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot push job to specific host in grid environment</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/449080#M12697</link>
      <description>&lt;P&gt;Thanks a lot Juan, we are looking into it. and not sure what you mean HA available in environment but we have grid environment with like 18 to 20 servers with one metadata server. we do not have any auto failover or any stand-by server&amp;nbsp;if metadata fails. we manually troubleshoot and bring it up. Thanks -&lt;/P&gt;</description>
      <pubDate>Tue, 27 Mar 2018 17:56:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/449080#M12697</guid>
      <dc:creator>woo</dc:creator>
      <dc:date>2018-03-27T17:56:37Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot push job to specific host in grid environment</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/449235#M12718</link>
      <description>&lt;P&gt;Hi again&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/26689"&gt;@woo&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;hmmm, is&amp;nbsp; amdusa.company.com&amp;nbsp;a metadata server but also a GRID slave node or master node?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Mar 2018 07:22:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/449235#M12718</guid>
      <dc:creator>JuanS_OCS</dc:creator>
      <dc:date>2018-03-28T07:22:22Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot push job to specific host in grid environment</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/450289#M12778</link>
      <description>&lt;P&gt;We came across one script under our lsf directory structure which has one environment variable defined which has couple different values (some queues name). There was one "if" statement which defined where job should goes based on user's bash profile. We created new queue for just that specific host and tried to see if job runs on that specific host and it worked fine. We put original script back in place and put that host back in server master list and it started working fine.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So it could be possible something went wrong when we took that host out from master server list and put it back for maintenance purpose. But now everything seems normal. Thanks for your help...appreciate your time.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 02 Apr 2018 02:57:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/450289#M12778</guid>
      <dc:creator>woo</dc:creator>
      <dc:date>2018-04-02T02:57:17Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot push job to specific host in grid environment</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/450312#M12779</link>
      <description>&lt;P&gt;Notwithstanding the fact that you already have a solution I'd like to share my 2 cents and explain how we go about similar tasks.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We have a job flow that needs inordinate amounts of SASWORK. We have multiple grid compute nodes. Most have lots of SASWORK with insane speed but one has twice that at ludicrous speed. We direct the jobs to that server by defining a resource called LargeWork in LSF&amp;nbsp; and configuring that host as providing that resource. Then when scheduling a job you can add the required resource LargeWork to the schedule definition and Process manager will always direct that job to that specific host.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Resources are defined in the file &lt;SPAN class="ph filepath"&gt;lsb.shared. In lsf.cluster.&lt;VAR class="keyword varname ibm-item-note-alternate"&gt;cluster_name&lt;/VAR&gt; you add the resource name to the desired host(s). Do remember to restart the daemons with badmin reconfig and lsadmin reconfig commands.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="ph filepath"&gt;And to second Juan's observation: do youreally&amp;nbsp; have a metadata server that doubles as a grid compute node? &lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="ph filepath"&gt;And as far as your metadata server being a single point of failure: have a look at clustering it over multiple hosts. It was a life saver for us many times.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="ph filepath"&gt;Regards,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="ph filepath"&gt;- Jan.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 02 Apr 2018 09:23:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/450312#M12779</guid>
      <dc:creator>jklaverstijn</dc:creator>
      <dc:date>2018-04-02T09:23:10Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot push job to specific host in grid environment</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/466265#M13278</link>
      <description>&lt;P&gt;Thanks Jan, appreciate your input. It makes perfectly sense.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 30 May 2018 21:46:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Cannot-push-job-to-specific-host-in-grid-environment/m-p/466265#M13278</guid>
      <dc:creator>woo</dc:creator>
      <dc:date>2018-05-30T21:46:55Z</dc:date>
    </item>
  </channel>
</rss>

