[Trillian-users] trillian issue

Maciolek, Mark Mark.Maciolek at unh.edu
Thu Dec 6 10:27:11 EST 2018


Hi,

Jobs submitted on trillian are being queued and not running since yesterday morning.

qstat -s shows this reason

Not Running: Insufficient amount of resource arch

The only logs are from PBS mom_logs showing this:

20181205:12/05/2018 08:00:01;0080;pbs_mom;Node;alps_engine_query;ALPS ENGINE query failed with BASIL version 1.1.
20181205:12/05/2018 08:00:01;0002;pbs_mom;Node;alps_inventory;ALPS inventory request failed.
20181205:12/05/2018 08:10:01;0080;pbs_mom;Node;alps_engine_query;ALPS ENGINE query failed with BASIL version 1.1.
20181205:12/05/2018 08:10:01;0002;pbs_mom;Node;alps_inventory;ALPS inventory request failed.

The alps logs don't show any obvious issue. 

I restarted pbs on trillian, which had the effect of cancelling some jobs and restarting others.

Will keep an eye on the logs for now.

Mark

--Mark Maciolek
Network Administrator
Morse Hall Rm 338
http://www.unh.edu/research/support-units/research-computing-center




More information about the Trillian-users mailing list