IBM

queueInfoEnt Struct Reference

queueInfoEnt queue information entry. More...


Data Fields

char * queue
char * description
int priority
short nice
char * userList
char * hostList
char * hostStr
int nIdx
float * loadSched
float * loadStop
int userJobLimit
float procJobLimit
char * windows
int rLimits [LSF_RLIM_NLIMITS]
char * hostSpec
int qAttrib
int qStatus
int maxJobs
int numJobs
int numPEND
int numRUN
int numSSUSP
int numUSUSP
int mig
int schedDelay
int acceptIntvl
char * windowsD
char * nqsQueues
char * userShares
char * defaultHostSpec
int procLimit
char * admins
char * preCmd
char * postCmd
char * requeueEValues
int hostJobLimit
char * resReq
int numRESERVE
int slotHoldTime
char * sndJobsTo
char * rcvJobsFrom
char * resumeCond
char * stopCond
char * jobStarter
char * suspendActCmd
char * resumeActCmd
char * terminateActCmd
int sigMap [LSB_SIG_NUM]
char * preemption
int maxRschedTime
int numOfSAccts
struct shareAcctInfoEntshareAccts
char * chkpntDir
int chkpntPeriod
int imptJobBklg
int defLimits [LSF_RLIM_NLIMITS]
int chunkJobSize
int minProcLimit
int defProcLimit
char * fairshareQueues
char * defExtSched
char * mandExtSched
int slotShare
char * slotPool
int underRCond
int overRCond
float idleCond
int underRJobs
int overRJobs
int idleJobs
int warningTimePeriod
char * warningAction
char * qCtrlMsg
char * acResReq
int symJobLimit
char * cpuReq
int proAttr
int lendLimit
int hostReallocInterval
int numCPURequired
int numCPUAllocated
int numCPUBorrowed
int numCPULent
int schGranularity
int symTaskGracePeriod
int minOfSsm
int maxOfSsm
int numOfAllocSlots
char * servicePreemption
int provisionStatus
int minTimeSlice
char * queueGroup
int numApsFactors
struct apsFactorInfoapsFactorInfoList
struct apsFactorMapapsFactorMaps
struct apsLongNameMapapsLongNames
int maxJobPreempt
int maxPreExecRetry
int localMaxPreExecRetry
int maxJobRequeue
int usePam
int cu_type_exclusive
char * cu_str_exclusive
char * resRsvLimit
struct fsFactors fairFactors
int maxSlotsInPool
int usePriorityInPool
int noPreemptInterval
int maxTotalTimePreempt
int qAttrib2
int preemptDelayTime
int imptSlotBklg
int maxSlotsPreempted
char * successEValues
int nDiscreteProcLimits
int * discreteProcLimits
int maxProtocolInstance
char * networkReq
char * globalFairshare
int cpuFrequency
int maxPerJobHostLimit
int * jobSizeList
char * hostPreCmd
char * hostPostCmd
int localMaxPreExecRetryAction

Detailed Description

queueInfoEnt queue information entry.

Field Documentation

The name of the queue.

Describes the typical use of the queue.

Defines the priority of the queue.

This determines the order in which the job queues are searched at job dispatch time: queues with higher priority values are searched first. (This is contrary to UNIX process priority ordering.)

Defines the nice value at which jobs in this queue will be run.

A blank-separated list of names of users allowed to submit jobs to this queue.

A blank-separated list of names of hosts to which jobs in this queue may be dispatched.

Original HOSTS string in case "-" is used.

The number of load indices in the loadSched and loadStop arrays.

The queue and host loadSched and loadStop arrays control batch job dispatch, suspension, and resumption.

The values in the loadSched array specify thresholds for the corresponding load indices. Only if the current values of all specified load indices of a host are within (below or above, depending on the meaning of the load index) the corresponding thresholds of this queue, will jobs in this queue be dispatched to the host. The same conditions are used to resume jobs dispatched from this queue that have been suspended on the host.

The values in the loadStop array specify the thresholds for job suspension.

If any of the current load index values of a host goes beyond a queue's threshold, jobs from the queue will be suspended.

For an explanation of the fields in the loadSched and loadStop arrays, see lsb_hostinfo.

Per-user limit on the number of jobs that can be dispatched from this queue and executed concurrently.

Per-processor limit on the number of jobs that can be dispatched from this queue and executed concurrently.

A blank-separated list of time windows describing the run window of the queue.

When a queue's run window is closed, no job from this queue will be dispatched. When the run window closes, any running jobs from this queue will be suspended until the run window reopens, when they will be resumed. The default is no restriction, or always open (i.e., 24 hours a day, seven days a week).

A time window has the format begin_time-end_time. Time is specified in the format [day:]hour[:minute], where all fields are numbers in their respective legal ranges: 0(Sunday)-6 for day, 0-23 for hour, and 0-59 for minute. The default value for minute is 0 (on the hour); the default value for day is every day of the week. The begin_time and end_time of a window are separated by `-', with no white space (i.e., blank or TAB) in between. Both begin_time and end_time must be present for a time window.

Note that this run window only applies to batch jobs;interactive jobs scheduled by the LSF Load Information Manager (LIM) are controlled by another set of run windows.

int queueInfoEnt::rLimits[LSF_RLIM_NLIMITS]

The per-process UNIX hard resource limits for all jobs submitted to this queue (see getrlimit() and lsb.queues).

The default values for the resource limits are unlimited, indicated by -1. The constants used to index the rLimits array and the corresponding resource limits are listed below.
LSF_RLIMIT_CPU (CPULIMIT)
LSF_RLIMIT_FSIZE (FILELIMIT)
LSF_RLIMIT_DATA (DATALIMIT)
LSF_RLIMIT_STACK (STACKLIMIT)
LSF_RLIMIT_CORE (CORELIMIT)
LSF_RLIMIT_RSS (MEMLIMIT)
LSF_RLIMIT_RUN (RUNLIMIT)
LSF_RLIMIT_PROCESS (PROCESSLIMIT)
LSF_RLIMIT_SWAP (SWAPLIMIT)
LSF_RLIMIT_THREAD
LSF_RLIMIT_NOFILE
LSF_RLIMIT_OPENMAX
LSF_RLIMIT_VMEM

A host name or host model name.

If the queue CPULIMIT or RUNLIMIT gives a host specification, hostSpec will be that specification. Otherwise, if defaultHostSpec (see below) is not NULL, hostSpec will be defaultHostSpec. Otherwise, if DEFAULT_HOST_SPEC is defined in the lsb.params file, (see lsb.params), hostSpec will be this value. Otherwise, hostSpec will be the name of the host with the largest CPU factor in the cluster.

The attributes of the queue.

The status of the queue.

The maximum number of jobs dispatched by the queue and not yet finished.

Number of jobs in the queue, including pending, running, and suspended jobs.

Number of pending jobs in the queue.

Number of running jobs in the queue.

Number of system suspended jobs in the queue.

Number of user suspended jobs in the queue.

The queue migration threshold in minutes.

The number of seconds that a new job waits, before being scheduled.

A value of zero (0) means the job is scheduled without any delay.

The number of seconds for a host to wait after dispatching a job to a host, before accepting a second job to dispatch to the same host.

A blank-separated list of time windows describing the dispatch window of the queue.

When a queue's dispatch window is closed, no job from this queue will be dispatched.The default is no restriction, or always open (i.e., 24 hours a day, seven days a week).

For the time window format, see windows (above).

A blank-separated list of queue specifiers.

Each queue specifier is of the form queue@host where host is an NQS host name and queue is the name of a queue on that host.

A blank-separated list of user shares.

Each share is of the form [user, share] where user is a user name, a user group name, the reserved word default or the reserved word others, and share is the number of shares the user gets.

The value of DEFAULT_HOST_SPEC in the Queue section for this queue in the lsb.queues file.

An LSF resource limit used to limit the number of job slots (processors) a (parallel) job in the queue will use.

A job submitted to this queue must specify a number of processors not greater than this limit.

A list of administrators of the queue.

The users whose names are here are allowed to operate on the jobs in the queue and on the queue itself.

Queue's pre-exec command.

The command is executed before the real batch job is run on the execution host (or on the first host selected for a parallel batch job).

Queue's post-exec command.

The command is run when a job terminates.

Jobs that exit with these values are automatically requeued.

The maximum number of job slots a host can process from this queue, including job slots of dispatched jobs which have not finished yet and reserved slots for some PEND jobs.

This limit controls the number of jobs sent to each host, regardless of a uniprocessor host or multiprocessor host. Default value for this limit is infinity.

Resource requirement string used to determine eligible hosts for a job.

Number of reserved job slots for pending jobs.

The time used to hold the reserved job slots for a PEND job in this queue.

Remote MultiCluster send-jobs queues to forward jobs to.

Remote MultiCluster receive-jobs queues that can forward to this queue.

Resume threshold conditions for a suspended job in this queue.

Stop threshold conditions for a running job in this queue.

Job starter command for a running job in this queue.

Command configured for the SUSPEND action.

Command configured for the RESUME action.

Command configured for the TERMINATE action.

int queueInfoEnt::sigMap[LSB_SIG_NUM]

Configurable signal mapping.

Preemptive scheduling and preemption policy specified for the queue.

Time period for a remote cluster to schedule a job.

MultiCluster job forwarding model only. Determines how long a MultiCluster job stays pending in the execution cluster before returning to the submission cluster. The remote timeout limit in seconds is:

  • MAX_RSCHED_TIME * MBD_SLEEP_TIME=timeout

Number of share accounts in the queue.

(Only used for queues with fairshare policy) a share account vector capturing the fairshare information of the users using the queue.

The storage for the array of queueInfoEnt structures will be reused by the next call.

The directory where the checkpoint files are created.

The checkpoint period in minutes.

MultiCluster job forwarding model only.

Specifies the MultiCluster pending job limit for a receive-jobs queue. This represents the maximum number of MultiCluster import jobs that can be pending in the queue; once the limit has been reached, the queue stops accepting jobs from remote clusters.

int queueInfoEnt::defLimits[LSF_RLIM_NLIMITS]

The default (soft) resource limits for all jobs submitted to this queue (see getrlimit() and lsb.queues).

The maximum number of jobs allowed to be dispatched together in one job chunk.

Must be a positive integer greater than 1.

The minimum number of job slots (processors) that a job in the queue will use.

The default (soft) limit on the number of job slots (processors) that a job in the queue will use.

The list of queues for cross-queue fairshare.

Default external scheduling for the queue.

Mandatory external scheduling options for the queue.

Share of job slots for queue-based fairshare.

Represents the percentage of running jobs (job slots) in use from the queue. SLOT_SHARE must be greater than zero (0) and less than or equal to 100.

The sum of SLOT_SHARE for all queues in the pool does not need to be 100%. It can be more or less, depending on your needs.

Name of the pool of job slots the queue belongs to for queue-based fairshare.

A queue can only belong to one pool. All queues in the pool must share the same set of hosts.

Specify any ASCII string up to 60 chars long. You can use letters, digits, underscores (_) or dashes (-). You cannot use blank spaces.

Specifies a threshold for job underrun exception handling.

If a job exits before the specified number of minutes, LSF invokes LSF_SERVERDIR/eadmin to trigger the action for a job underrun exception.

Specifies a threshold for job overrun exception handling.

If a job runs longer than the specified run time, LSF invokes LSF_SERVERDIR/eadmin to trigger the action for a job overrun exception.

Specifies a threshold for idle job exception handling.

The value should be a number between 0.0 and 1.0 representing CPU time/runtime. If the job idle factor is less than the specified threshold, LSF invokes LSF_SERVERDIR/eadmin to trigger the action for a job idle exception.

The number of underrun jobs in the queue.

The number of overrun jobs in the queue.

The number of idle jobs in the queue.

Specifies the amount of time before a job control action occurs that a job warning action is to be taken.

For example, 2 minutes before the job reaches run time limit or termination deadline, or the queue's run window is closed, an URG signal is sent to the job.

Job action warning time is not normalized.

A job action warning time must be specified with a job warning action in order for job warning to take effect.

Specifies the job action to be taken before a job control action occurs.

For example, 2 minutes before the job reaches run time limit or termination deadline, or the queue's run window is closed, an URG signal is sent to the job.

A job warning action must be specified with a job action warning time in order for job warning to take effect.

If specified, LSF sends the warning action to the job before the actual control action is taken. This allows the job time to save its result before being terminated by the job control action.

You can specify actions similar to the JOB_CONTROLS queue level parameter: send a signal, invoke a command, or checkpoint the job.

AdminAction - queue control message.

Acept resource request.

Limit of running session scheduler jobs.

cpu_req for service partition of session scheduler

Indicate whether it would be willing to donate/borrow.

The maximum number of hosts to lend.

The grace period to lend/return idle hosts.

Number of CPUs required by CPU provision.

Number of CPUs actually allocated.

Number of CPUs borrowed.

Number of CPUs lent.

Scheduling granularity.

in milliseconds.

The grace period for stopping session scheduler tasks.

Minimum number of SSMs.

Maximum number of SSMs.

Number of allocated slots.

Service preemptin policy.

Dynamic cpu provision status.

The minimum time for preemption and backfill, in seconds.

List of queues defined in a queue group for absolute priority scheduling (APS) across multiple queues.

The number of calculation factors for absolute priority scheduling (APS).

List of calculation factors for absolute priority scheduling (APS).

The mapping of factors to subfactors for absolute priority scheduling (APS).

The mapping of factors to their long names for absolute priority scheduling (APS).

Maximum number of job preempted times.

Maximum number of pre-exec retry times.

Maximum number of pre-exec retry times for local cluster.

Maximum number of job re-queue times.

Use Linux-PAM.

Compute unit type.

A string specified in EXCLUSIVE=CU[<string>].

Resource reservation limit.

struct fsFactors queueInfoEnt::fairFactors

Fairshare factors.

Used together with automatic scanning through the list of hosts to calculate total usable slots for one slot pool.

The minimum value of both will take effect. And eventually, the value will be used for slot distribution calculation

When enabled, LSF dispatch jobs strictly force defined rate for each queue in one slot pool first, then dispatch more jobs with FCFS policy if there are slots left in the slot pool.

Uninterrupted running time (minutes) before job can be preempted.

Maximum accumulated preemption time (minutes).

2nd queue attributes field

The grace period before preemption.

This variable has similar meaning to imptJobBklg only that the limit is posed on the number of pending slots other than the number of pending jobs.

Maximum number of slots that can be preempted.

Jobs with these exit values are treated as done successfully.

Store the count of discrete proclimit vlaues specified in queue.

Store discrete proclimit values specified in queue.

maximum number of window instances allowed for job

POE job network requirement string.

If the queue participates into a global fairshare policy, this variable is the name of the global fairshare policy.

Otherwise this variable is NULL.

target cpu frequency

An LSF resource limit used to limit the number of hosts a (parallel) job in the queue will use.

A blank-separated list of job sizes to which a parallel job in this queue may be dispatched.

> The number of items in the JOB_SIZE_LIST

Queue's host-based pre-exec command.

Queue's host-based post-exec command.

The Queue's action taken when job's pre-exec retry times reach the limit.



     
Date Modified: 16 Jul 2014

Terms of Use
Copyright © 1994-2014 International Business Machines Corp.
US Government Users Restricted Rights - Use, duplication or disclosure
restricted by GSA ADP Schedule Contract with IBM Corp.