BookmarkSubscribeRSS Feed
Go
Quartz | Level 8 Go
Quartz | Level 8

Hi All , can some one assit me with below error ? we have 2 node lsf/grid, only 2 app servers, they are not able to communicate between them, below is the error I see, any idea ?

 

 

/sasgrid/BCUS_SASGRID/lsf/log>tail -f lim.log.dpds1335a

Jun 17 18:05:35 2019 18744562 3 3.4.0 probeMasterTcp: Last master is  UNKNOWN

Jun 17 18:05:35 2019 18744562 3 3.4.0 isHighestCandidate: Attempting to probe master candidate dpds1336a timeout is 2

Jun 17 18:05:50 2019 18744562 3 3.4.0 probeMasterTcp: Last master is  UNKNOWN

Jun 17 18:05:50 2019 18744562 3 3.4.0 isHighestCandidate: Attempting to probe master candidate dpds1336a timeout is 2

Jun 17 18:05:50 2019 18744562 3 3.4.0 probeMasterTcp: Last master is  UNKNOWN

Jun 17 18:05:50 2019 18744562 3 3.4.0 isHighestCandidate: Attempting to probe master candidate dpds1336a timeout is 2

Jun 17 18:06:05 2019 18744562 3 3.4.0 probeMasterTcp: Last master is  UNKNOWN

Jun 17 18:06:05 2019 18744562 3 3.4.0 isHighestCandidate: Attempting to probe master candidate dpds1336a timeout is 2

Jun 17 18:06:05 2019 18744562 3 3.4.0 probeMasterTcp: Last master is  UNKNOWN

Jun 17 18:06:05 2019 18744562 3 3.4.0 isHighestCandidate: Attempting to probe master candidate dpds1336a timeout is 2

Jun 17 18:06:20 2019 18744562 3 3.4.0 probeMasterTcp: Last master is  UNKNOWN

Jun 17 18:06:20 2019 18744562 3 3.4.0 isHighestCandidate: Attempting to probe master candidate dpds1336a timeout is 2

2 REPLIES 2
SASKiwi
PROC Star

Have you checked that all SAS services on both app servers are up and running? You could also try rebooting all of your SAS servers in the correct sequence.

 

Also raise a track with SAS Tech Support as they can get you going faster.

doug_sas
SAS Employee

Lots of things could cause this problem.

  1. DNS changes such that the dpds1335a cannot resolve dpds1336a
  2. Firewall changes on dpds1335a that prevents dpds1336a from connecting
  3. Network equipment breakdown.
  4. LIM is hung on dpds1335a
  5. License has expired.

So you can do the following:

  • Make sure LIM is running and responding on dpds1335a. Run 'lsid' on dpds1335a
  • Make sure you can ping dpds1335a from dpds1336a.
  • You could also try to telnet from dpds1336a to dpds1335a on the LIM port.

suga badge.PNGThe SAS Users Group for Administrators (SUGA) is open to all SAS administrators and architects who install, update, manage or maintain a SAS deployment. 

Join SUGA 

Get Started with SAS Information Catalog in SAS Viya

SAS technical trainer Erin Winters shows you how to explore assets, create new data discovery agents, schedule data discovery agents, and much more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 969 views
  • 3 likes
  • 3 in conversation