Specifying mandatory application-level checkpoint and restart executables that apply to all checkpointable batch jobs in the cluster
Specifying the directory that contains customized application-level checkpoint and restart executables
Saving standard output and standard error to files in the checkpoint directory
Automatically checkpointing jobs before suspending or terminating them
For Cray systems only, copying all open job files to the checkpoint directory
Configuration file |
Parameter and syntax |
Behavior |
---|---|---|
lsf.conf |
LSB_ECHKPNT_METHOD= "echkpnt_application" |
|
Configuration file |
Parameter and syntax |
Behavior |
---|---|---|
lsf.conf |
LSB_ECHKPNT_METHOD_DIR=path |
|
Configuration file |
Parameter and syntax |
Behavior |
---|---|---|
lsf.conf |
LSB_ECHKPNT_KEEP_OUTPUT=Y | y |
|
Configuration file |
Parameter and syntax |
Behavior |
---|---|---|
lsb.queues |
JOB_CONTROLS=SUSPEND CHKPNT TERMINATE |
|
Configuration file |
Parameter and syntax |
Behavior |
---|---|---|
lsb.hosts |
|
|