<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Recursive Segmentation Violations - system exiting error while  executing sas command from LSF in Administration and Deployment</title>
    <link>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900109#M27419</link>
    <description>&lt;P&gt;Hello, experts!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have a problem with executing scheduled job from Management console using LSF.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A .sh script is executed from LSF. It is a shell over the execution of the sas command. The required deployed stp (deployed job) and sas system options are passed to the .sh script as parameters.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The sas command inside .sh script looks as follows:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;/sas/SAS94/SASFoundation/9.4/sas -xcmd -noterminal -nosyntaxcheck -autoexec ./autoexec.sas -sysin ./scripts/$SCRIPT.sas -nolog -noprint -altlog $LOG_FILE&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When the command crashes, there are no entries in altlog. When trying to redirect the error output stream to a file, we get a certain error:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;ERROR: Recursive Segmentation Violations - system exiting&lt;/STRONG&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This error appears every day with multiple jobs.&amp;nbsp;&lt;STRONG&gt;The problem is that it is not related to any particular job&lt;/STRONG&gt;. Today a job can work successfully, and tomorrow it will fall into an error.&lt;/P&gt;
&lt;P&gt;Some of the jobs (most of them) run successfully, some of them do not. In case of detecting a problem with the start of a job, restarting the job manually helps.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There are no problems with RAM utilization on the server.&lt;BR /&gt;Could you please tell me what this problem may be related to?&lt;/P&gt;</description>
    <pubDate>Thu, 26 Oct 2023 10:29:55 GMT</pubDate>
    <dc:creator>John_Wick</dc:creator>
    <dc:date>2023-10-26T10:29:55Z</dc:date>
    <item>
      <title>Recursive Segmentation Violations - system exiting error while  executing sas command from LSF</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900109#M27419</link>
      <description>&lt;P&gt;Hello, experts!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have a problem with executing scheduled job from Management console using LSF.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A .sh script is executed from LSF. It is a shell over the execution of the sas command. The required deployed stp (deployed job) and sas system options are passed to the .sh script as parameters.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The sas command inside .sh script looks as follows:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;/sas/SAS94/SASFoundation/9.4/sas -xcmd -noterminal -nosyntaxcheck -autoexec ./autoexec.sas -sysin ./scripts/$SCRIPT.sas -nolog -noprint -altlog $LOG_FILE&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When the command crashes, there are no entries in altlog. When trying to redirect the error output stream to a file, we get a certain error:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;ERROR: Recursive Segmentation Violations - system exiting&lt;/STRONG&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This error appears every day with multiple jobs.&amp;nbsp;&lt;STRONG&gt;The problem is that it is not related to any particular job&lt;/STRONG&gt;. Today a job can work successfully, and tomorrow it will fall into an error.&lt;/P&gt;
&lt;P&gt;Some of the jobs (most of them) run successfully, some of them do not. In case of detecting a problem with the start of a job, restarting the job manually helps.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There are no problems with RAM utilization on the server.&lt;BR /&gt;Could you please tell me what this problem may be related to?&lt;/P&gt;</description>
      <pubDate>Thu, 26 Oct 2023 10:29:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900109#M27419</guid>
      <dc:creator>John_Wick</dc:creator>
      <dc:date>2023-10-26T10:29:55Z</dc:date>
    </item>
    <item>
      <title>Re: Recursive Segmentation Violations - system exiting error while  executing sas command from LSF</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900121#M27420</link>
      <description>&lt;P&gt;I would try to find the similarities in the SEGVs.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;If you have multiple machines, do the SEGVs happen on a specific machine?&lt;/LI&gt;
&lt;LI&gt;Does it happen for a specific user?&lt;/LI&gt;
&lt;LI&gt;Does it happen for a specific SAS program?&lt;/LI&gt;
&lt;LI&gt;Does it happen during a specific time of the day?&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Thu, 26 Oct 2023 12:38:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900121#M27420</guid>
      <dc:creator>doug_sas</dc:creator>
      <dc:date>2023-10-26T12:38:28Z</dc:date>
    </item>
    <item>
      <title>Re: Recursive Segmentation Violations - system exiting error while  executing sas command from LSF</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900131#M27422</link>
      <description>1) Yes, we have multiple machines. SEGVs happen on a specific machine&lt;BR /&gt;2) This happens for user "lsfadmin", because the scheduled job is running under that user&lt;BR /&gt;3) No, it may happen for different sas programs&lt;BR /&gt;4) It most often occurs in a specific time frame that is fairly spread out over a period of time</description>
      <pubDate>Thu, 26 Oct 2023 13:50:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900131#M27422</guid>
      <dc:creator>John_Wick</dc:creator>
      <dc:date>2023-10-26T13:50:20Z</dc:date>
    </item>
    <item>
      <title>Re: Recursive Segmentation Violations - system exiting error while  executing sas command from LSF</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900140#M27423</link>
      <description>&lt;P&gt;If SEGVs occur on one specific machine, my guess would be that something is out of sync due to a incomplete or failed hotfix installation. Assuming all your machines are at the same version/maintenance level with the same hotfixes installed, compare the files in the &amp;lt;SASROOT&amp;gt;/SASFoundation/9.4 directory on a working machine to the ones on the failing machine to see if there are differences, specifically in shared library files *.so (UNIX/Linux) or *.dll (Windows).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Also, the SEGVs may occur in multiple SAS programs, but only when executing something similar (a specific PROC or data step function). If you can find out what it was executing when SEGV occurred, maybe that would help narrow it down.&lt;/P&gt;</description>
      <pubDate>Thu, 26 Oct 2023 14:16:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900140#M27423</guid>
      <dc:creator>doug_sas</dc:creator>
      <dc:date>2023-10-26T14:16:48Z</dc:date>
    </item>
    <item>
      <title>Re: Recursive Segmentation Violations - system exiting error while  executing sas command from LSF</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900161#M27425</link>
      <description>The problem is that the error occurs at the stage of running the command itself, which I specified in the problem description.</description>
      <pubDate>Thu, 26 Oct 2023 15:44:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900161#M27425</guid>
      <dc:creator>John_Wick</dc:creator>
      <dc:date>2023-10-26T15:44:04Z</dc:date>
    </item>
    <item>
      <title>Re: Recursive Segmentation Violations - system exiting error while  executing sas command from LSF</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900213#M27428</link>
      <description>&lt;P&gt;I'd suggest opening a track with SAS Tech Support for this. Given it is a machine-specific problem that introduces the possibility this is a hardware-related problem. Is it a physical or virtual server? Hardware issues are more likely with physical servers. It's possible that more diagnostics are needed to identify the problem.&lt;/P&gt;</description>
      <pubDate>Thu, 26 Oct 2023 20:08:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900213#M27428</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2023-10-26T20:08:40Z</dc:date>
    </item>
    <item>
      <title>Re: Recursive Segmentation Violations - system exiting error while  executing sas command from LSF</title>
      <link>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900325#M27435</link>
      <description>&lt;P&gt;The cause of this error could not be found. But we managed to use the following workaround:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Using &lt;STRONG&gt;bhist -all&lt;/STRONG&gt;, determine the error status in LSF with which the jobs are falling (in our case Exit code=102. Also with SIGSEGV, Exit code=139 is possible)&lt;/LI&gt;
&lt;LI&gt;Define the queue in which the jobs are executed (in our case &lt;STRONG&gt;QUEUEUE_NAME=normal&lt;/STRONG&gt;. The queue name is also specified when using &lt;STRONG&gt;bhist -all&lt;/STRONG&gt;)&lt;/LI&gt;
&lt;LI&gt;In the file &lt;BR /&gt;&lt;STRONG&gt;&amp;lt;LSF-root-directory&amp;gt;/conf/lsbatch/sas_cluster/configdir/lsb.queues&lt;/STRONG&gt;&lt;BR /&gt;Add the REQUEUE_EXIT_VALUES and MAX_JOB_REQUEUE parameters:&lt;BR /&gt;...&lt;BR /&gt;Begin Queue&lt;BR /&gt;QUEUEUE_NAME=normal&lt;BR /&gt;...&lt;BR /&gt;&lt;STRONG&gt;REQUEUE_EXIT_VALUES=139 102&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;MAX_JOB_REQUEUE=3&lt;/STRONG&gt;&lt;BR /&gt;...&lt;BR /&gt;End Queue&lt;BR /&gt;...&lt;/LI&gt;
&lt;LI&gt;Save the changes&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;This will allow LSF to restart jobs in case of errors 139 and 102. The number of restarts is adjusted in MAX_JOB_REQUEUE. In our case it is 3 times.&lt;/P&gt;</description>
      <pubDate>Fri, 27 Oct 2023 12:48:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Administration-and-Deployment/Recursive-Segmentation-Violations-system-exiting-error-while/m-p/900325#M27435</guid>
      <dc:creator>John_Wick</dc:creator>
      <dc:date>2023-10-27T12:48:43Z</dc:date>
    </item>
  </channel>
</rss>

