BookmarkSubscribeRSS Feed
kcpython
Obsidian | Level 7

Hello, how can I configure the Grid to automatically kill jobs that run longer than a week? Or am I going to have to write a bash script? 🙂

2 REPLIES 2
JuanS_OCS
Amethyst | Level 16

Hello @kcpython,

 

well, you can very well do this taks with a script, but LSF can handle orphan jobs as well:

 

https://www.ibm.com/support/knowledgecenter/SSETD4_9.1.3/lsf_admin/orphan_job_termination.html

 

LSF administrator enables the feature at the cluster level 
by defining a cluster-wide termination grace period
with the parameter ORPHAN_JOB_TERM_GRACE_PERIOD in the lsb.params file.
The cluster-wide termination grace period applies to all dependent jobs in the cluster.

You can also read:

 

http://sunray2.mit.edu/kits/platform-lsf/7.0.6/1/guides/kit_lsf_guide_source/admin/job_ops.html 

kcpython
Obsidian | Level 7

What defines a job as an orphan? Or is setting a run time wall a more appropriate way of handling this?

suga badge.PNGThe SAS Users Group for Administrators (SUGA) is open to all SAS administrators and architects who install, update, manage or maintain a SAS deployment. 

Join SUGA 

Get Started with SAS Information Catalog in SAS Viya

SAS technical trainer Erin Winters shows you how to explore assets, create new data discovery agents, schedule data discovery agents, and much more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1355 views
  • 0 likes
  • 2 in conversation