in Viya 4 2024.09 LTS on AWS EKS I get sporadic "sas-files" pod failure due to OOM. While running the "Kubectl get logs" for the previous sas-files pod (to check the failure reason) : "{"version":1,"timeStamp":"2025-03-14T13:28:12.117Z","level":"warn","source":"sas-files","message":"Reached the maximum number of URI tags for 'http.client.requests'. Are you using 'uriVariables'?","properties":{"logger":"org.springframework.boot.actuate.autoconfigure.metrics.OnlyOnceLoggingDenyMeterFilter","thread":"https-jsse-nio-8080-exec-4","__session":"88884b07-ff5f-49e5-bea6-49d1b1cea81f","username":"John.Smith@somesascompany.com"}} At this point it hits the OOM . then pod logs additional entries : {"version":1,"timeStamp":"2025-03-14T18:16:58.564Z","level":"error","source":"sas-files","message":"com.sas.commons.rest.exceptions.ResourceException: An error occurred. The request failed.","properties":{"logger":"com.sas.commons.rest.ExceptionLog","thread":"https-jsse-nio-8080-exec-7","username":"sas.catalog"},"messageParameters":{"0":"com.sas.commons.rest.exceptions.ResourceException","1":"An error occurred. The request failed."}} {"version":1,"timeStamp":"2025-03-14T18:22:22.522Z","level":"error","source":"sas-files","message":"ERROR: large object 3607475 does not exist Exception : org.postgresql.util.PSQLException: ERROR: large object 3607475 does not exist","properties":{"logger":"com.sas.svcs.file.domain.PostgresContentDAOImpl","thread":"https-jsse-nio-8080-exec-3","username":"sas.searchIndex"}} {"version":1,"timeStamp":"2025-03-14T18:22:22.522Z","level":"error","source":"sas-files","message":"An error occurred. The request failed. Exception : ResourceException: An error occurred. The request failed. Http Status: 500 INTERNAL_SERVER_ERROR","properties":{"logger":"com.sas.svcs.file.domain.PostgresContentDAOImpl","thread":"https-jsse-nio-8080-exec-3","username":"sas.searchIndex"}} {"version":1,"timeStamp":"2025-03-14T18:22:22.531Z","level":"error","source":"sas-files","message":"com.sas.commons.rest.exceptions.ResourceException: An error occurred. The request failed.","properties":{"logger":"com.sas.commons.rest.ExceptionLog","thread":"https-jsse-nio-8080-exec-3","username":"sas.searchIndex"},"messageParameters":{"0":"com.sas.commons.rest.exceptions.ResourceException","1":"An error occurred. The request failed."}} {"version":1,"timeStamp":"2025-03-14T18:22:22.55Z","level":"error","source":"sas-files","message":"ERROR: large object 3607475 does not exist Exception : org.postgresql.util.PSQLException: ERROR: large object 3607475 does not exist","properties":{"logger":"com.sas.svcs.file.domain.PostgresContentDAOImpl","thread":"https-jsse-nio-8080-exec-1","username":"sas.catalog"}} {"version":1,"timeStamp":"2025-03-14T18:22:22.551Z","level":"error","source":"sas-files","message":"An error occurred. The request failed. Exception : ResourceException: An error occurred. The request failed. Http Status: 500 INTERNAL_SERVER_ERROR","properties":{"logger":"com.sas.svcs.file.domain.PostgresContentDAOImpl","thread":"https-jsse-nio-8080-exec-1","username":"sas.catalog"}} {"version":1,"timeStamp":"2025-03-14T18:22:22.552Z","level":"error","source":"sas-files","message":"com.sas.commons.rest.exceptions.ResourceException: An error occurred. The request failed.","properties":{"logger":"com.sas.commons.rest.ExceptionLog","thread":"https-jsse-nio-8080-exec-1","username":"sas.catalog"},"messageParameters":{"0":"com.sas.commons.rest.exceptions.ResourceException","1":"An error occurred. The request failed."}} One simple solution would be to increase the pod/deploy memory, but I assume this user might generate too high memory but I am not clear of the process, can you please help to troubleshoot for the root cause? Thanks a lot!
... View more