<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic SAS GRID Controller is failed and nodes are not taking control in Administration and Deployment</title>
    <link>https://communities.sas.com/t5/Administration-and-Deployment/SAS-GRID-Controller-is-failed-and-nodes-are-not-taking-control/m-p/844299#M25500</link>
    <description>&lt;P&gt;Hi team ,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The Grid Control Server(26) is failed due to &amp;nbsp;a physical hardware/Network/Power failure . The rest of installation is available ( including the shared file system ) are up and running.&lt;/P&gt;
&lt;P&gt;The common answer is the other nodes would take over. &lt;STRONG&gt;But in our case when are trying to make node as controller it is not becoming . &lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;LSF services on nodes are running fine but when we tried to login as egosh services ,it is failing with below error.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Error:- &lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;[sasinst@lxxxxt27 ~]$ egosh user logon -u Admin -x Admin&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Cannot contact the master host. If you cannot start the cluster&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;successfully, refer to VEMKD and LIM log files for troubleshooting&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;information&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;I have started the lsadmin&amp;nbsp; and bsadmin services on 27 28 29 (rest of the node ) and then tried to login to egosh services but it is giving above error .&lt;/P&gt;</description>
    <pubDate>Tue, 15 Nov 2022 08:51:51 GMT</pubDate>
    <dc:creator>MG18</dc:creator>
    <dc:date>2022-11-15T08:51:51Z</dc:date>
    <item>
      <title>SAS GRID Controller is failed and nodes are not taking control</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/SAS-GRID-Controller-is-failed-and-nodes-are-not-taking-control/m-p/844299#M25500</link>
      <description>&lt;P&gt;Hi team ,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The Grid Control Server(26) is failed due to &amp;nbsp;a physical hardware/Network/Power failure . The rest of installation is available ( including the shared file system ) are up and running.&lt;/P&gt;
&lt;P&gt;The common answer is the other nodes would take over. &lt;STRONG&gt;But in our case when are trying to make node as controller it is not becoming . &lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;LSF services on nodes are running fine but when we tried to login as egosh services ,it is failing with below error.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Error:- &lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;[sasinst@lxxxxt27 ~]$ egosh user logon -u Admin -x Admin&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Cannot contact the master host. If you cannot start the cluster&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;successfully, refer to VEMKD and LIM log files for troubleshooting&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;information&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;I have started the lsadmin&amp;nbsp; and bsadmin services on 27 28 29 (rest of the node ) and then tried to login to egosh services but it is giving above error .&lt;/P&gt;</description>
      <pubDate>Tue, 15 Nov 2022 08:51:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/SAS-GRID-Controller-is-failed-and-nodes-are-not-taking-control/m-p/844299#M25500</guid>
      <dc:creator>MG18</dc:creator>
      <dc:date>2022-11-15T08:51:51Z</dc:date>
    </item>
    <item>
      <title>Re: SAS GRID Controller is failed and nodes are not taking control</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/SAS-GRID-Controller-is-failed-and-nodes-are-not-taking-control/m-p/844363#M25501</link>
      <description>&lt;P&gt;The lsf.conf file and the ego.conf file have parameters for the list of master hosts (LSF_MASTER_LIST &amp;amp; EGO_MASTER_LIST). The value of these variables is a space separated list of host names in order of priority.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If the dead master machine is the only host listed by these variables, LSF does not know what to do.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For example, if you have EGO_MASTER_LIST="myhost1.mydomain.com" then when myhost1 goes down, LSF does not know who to make the master. If, on the other hand, EGO_MASTER_LIST="myhost1.mydomain.com myhost2.mydomain.com", when myhost1 goes down, myhost2 becomes the master.&lt;/P&gt;</description>
      <pubDate>Tue, 15 Nov 2022 12:46:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/SAS-GRID-Controller-is-failed-and-nodes-are-not-taking-control/m-p/844363#M25501</guid>
      <dc:creator>doug_sas</dc:creator>
      <dc:date>2022-11-15T12:46:57Z</dc:date>
    </item>
  </channel>
</rss>

