Hi Folks,
We are using SAS 9.4M6 with having multiple tier architecture (3 meta, 6 compute and 2 web nodes - all in Linux). We have been facing issues frequently with the Web Infrastructure services getting down again and again, as due to High Availability the system tries to start the services in other compute nodes than the existing (it may be due a network glitch that the server might not be providing the correct status of service's availability). Once it tries to start the WIP services in another compute node with a new PID, this locks/corrupts the WIP database as it does not match with the existing PID added in the postmaster.pid file. So, every time we have to run the pg_resetxlog to fix the problem.
Is there any suggestion how this issue can be fixed so that we do not need to reset the lock file every time to make the WIP services and hence the web services (SAS Studio) working?