[Trillian-users] Trillian is on hold again

Mark Maciolek Mark.Maciolek at unh.edu
Mon May 22 13:18:52 EDT 2017


Hi,

 

Having restarted PBS and apls; which has had no effect on the job que, I will be rebooting trillian at 1:30PM EDT today.

 

Mark

 

From: Trillian-users [mailto:trillian-users-bounces at lists.sr.unh.edu] On Behalf Of Maciolek, Mark
Sent: Monday, May 22, 2017 11:52 AM
To: Joseph Jensen <jbj1 at wildcats.unh.edu>
Cc: trillian-users at lists.sr.unh.edu
Subject: Re: [Trillian-users] Trillian is on hold again

 

 

 

 

From: Joseph Jensen [mailto:jbj1 at wildcats.unh.edu] 
Sent: Monday, May 22, 2017 11:43 AM
To: Maciolek, Mark <Mark.Maciolek at unh.edu <mailto:Mark.Maciolek at unh.edu> >
Subject: Trillian is on hold again

 

Hi Mark 

I am not sure if you are already aware, but Trillian is on hold, and all the programs that have been running have been going for longer that 3 days.

 

Joseph B. Jensen

 

 

Hi,

 

On Sunday reached max open files again:

 

2017-05-21 09:29:50: [8057] ------------------------------------------ resvconfirm msg

2017-05-21 09:29:50: [8057] type confirm uid 33040 gid 1000 apid 0 pagg 0 resId 0 numCmds 1

2017-05-21 09:29:50: [8057] File new reservation resId 38 pagg 0 flags 0x200

2017-05-21 09:29:50: [8057] Confirmed apid 125869 resId 38 pagg 0 flags 0x200 nids: 199

2017-05-21 09:29:50: [8057] openSocket:665: socket: Too many open files

2017-05-21 09:29:50: [8057] main:1683: parseXml error: ret 'TCP socket open failed' (timeout 0)

 

 

Have increased limit to 10000 will watch it but if jobs don’t start in the next 30 minutes will restart the ap scheduler.

 

Mark

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.sr.unh.edu/pipermail/trillian-users/attachments/20170522/37fb3a4c/attachment.html>


More information about the Trillian-users mailing list