02-28-2018 07:21 AM
I am using RSUBMIT method to distribute the jobs parallel. I have scenario where I am able to find the solution.
1) I submitted the RSUBMIT blocks and distributed the jobs to different grids
2) due to some reason middle of the programs execution one of the grid node is unavailable/failed
Is there any way to find the unavailable/unresponsive/ Idle grid node?
04-02-2018 06:28 PM
Can you share your log or the error message you're seeing?
Do you have access to SSH to run commands (like bhosts)?