Enable fast job dispatch

Procedure

  1. Log in to the LSF master host as the root user.
  2. Set LSB_MAX_JOB_DISPATCH_PER_SESSION = Min(Max(300, Total CPUs), 3000).
  3. Set MAX_SBD_CONNS equal to the number of hosts in the cluster plus 2*LSB_MAX_JOB_DISPATCH_PER_SESSION plus a buffer of 200.
    Note:

    The system has automatically set this for you. If not suitable, you can manually adjust it.

  4. In lsf.conf, set the parameter LSB_MAX_JOB_DISPATCH_PER_SESSION to a value greater than 300 and less than or equal to one-half the value of MAX_SBD_CONNS. Total File Descriptors = Max (Available FDs, MAX_SBD_CONNS+100)
    Note:

    The system has automatically set this for you. If not suitable, you can still manually adjust it.

  5. In lsf.conf, define the parameter LSF_SERVER_HOSTS to decrease the load on the master LIM.
  6. In the shell you used to increase the file descriptor limit, shut down the LSF batch daemons on the master host:

    badmin hshutdown

  7. Run badmin mbdrestart to restart the LSF batch daemons on the master host.
  8. Run badmin hrestart all to restart every sbatchd in the cluster:
    Note:

    When you shut down the batch daemons on the master host, all LSF services are temporarily unavailable, but existing jobs are not affected. When mbatchd is later started by sbatchd, its previous status is restored and job scheduling continues.