It's more like a run book kinda question highlighting problems that could arise and how to deal with them. While different sites can have different problems based on the way SAS is consumed i.e. for batch, end users, solutions based deployments etc. I don't think there could be one size fit all kind of document highlighting platform specific risks. Few common ones could be the underlying host resources and disk space for sure. I think if there are a lot of SAS users at your site, it should be a combination of alerting rules, thresholds and user education to avoid such scenarios. For SAS Grid you can have rules like: - No process can use more than 500G of SASWORK, if they need additional space it has to be requested to the platform team - Limit on the number of jobs submitted - Usage of Q's based on priority - Specific nodes to departments/groups - High volume jobs to be executed in night/weekends - User education on how to plan/use SAS platform to submit jobs that may take high resources
... View more