Exclusive job requeue

You can configure automatic job requeue so that a failed job is not rerun on the same host.

Limitations

  • If mbatchd is restarted, this feature might not work properly, since LSF forgets which hosts have been excluded. If a job ran on a host and exited with an exclusive exit code before mbatchd was restarted, the job could be dispatched to the same host again after mbatchd is restarted.

  • Exclusive job requeue does not work for MultiCluster jobs or parallel jobs

  • A job terminated by a signal is not requeued