queueInfoEnt Struct Reference

queueInfoEnt queue information entry. More...


Data Fields
char *	queue
char *	description
int	priority
short	nice
char *	userList
char *	hostList
char *	hostStr
int	nIdx
float *	loadSched
float *	loadStop
int	userJobLimit
float	procJobLimit
char *	windows
int	rLimits [LSF_RLIM_NLIMITS]
char *	hostSpec
int	qAttrib
int	qStatus
int	maxJobs
int	numJobs
int	numPEND
int	numRUN
int	numSSUSP
int	numUSUSP
int	mig
int	schedDelay
int	acceptIntvl
char *	windowsD
char *	nqsQueues
char *	userShares
char *	defaultHostSpec
int	procLimit
char *	admins
char *	preCmd
char *	postCmd
char *	requeueEValues
int	hostJobLimit
char *	resReq
int	numRESERVE
int	slotHoldTime
char *	sndJobsTo
char *	rcvJobsFrom
char *	resumeCond
char *	stopCond
char *	jobStarter
char *	suspendActCmd
char *	resumeActCmd
char *	terminateActCmd
int	sigMap [LSB_SIG_NUM]
char *	preemption
int	maxRschedTime
int	numOfSAccts
struct shareAcctInfoEnt *	shareAccts
char *	chkpntDir
int	chkpntPeriod
int	imptJobBklg
int	defLimits [LSF_RLIM_NLIMITS]
int	chunkJobSize
int	minProcLimit
int	defProcLimit
char *	fairshareQueues
char *	defExtSched
char *	mandExtSched
int	slotShare
char *	slotPool
int	underRCond
int	overRCond
float	idleCond
int	underRJobs
int	overRJobs
int	idleJobs
int	warningTimePeriod
char *	warningAction
char *	qCtrlMsg
char *	acResReq
int	symJobLimit
char *	cpuReq
int	proAttr
int	lendLimit
int	hostReallocInterval
int	numCPURequired
int	numCPUAllocated
int	numCPUBorrowed
int	numCPULent
int	schGranularity
int	symTaskGracePeriod
int	minOfSsm
int	maxOfSsm
int	numOfAllocSlots
char *	servicePreemption
int	provisionStatus
int	minTimeSlice
char *	queueGroup
int	numApsFactors
struct apsFactorInfo *	apsFactorInfoList
struct apsFactorMap *	apsFactorMaps
struct apsLongNameMap *	apsLongNames
int	maxJobPreempt
int	maxPreExecRetry
int	localMaxPreExecRetry
int	maxJobRequeue
int	usePam
int	cu_type_exclusive
char *	cu_str_exclusive
char *	resRsvLimit
struct fsFactors	fairFactors
int	maxSlotsInPool
int	usePriorityInPool
int	noPreemptInterval
int	maxTotalTimePreempt
int	qAttrib2
int	preemptDelayTime
int	imptSlotBklg
int	maxSlotsPreempted
char *	successEValues
int	nDiscreteProcLimits
int *	discreteProcLimits
int	maxProtocolInstance
char *	networkReq
char *	globalFairshare
int	cpuFrequency
int	maxPerJobHostLimit
int *	jobSizeList
char *	hostPreCmd
char *	hostPostCmd
int	localMaxPreExecRetryAction

Detailed Description

queueInfoEnt queue information entry.

Field Documentation

char* queueInfoEnt::queue

The name of the queue.

char* queueInfoEnt::description

Describes the typical use of the queue.

int queueInfoEnt::priority

Defines the priority of the queue.

This determines the order in which the job queues are searched at job dispatch time: queues with higher priority values are searched first. (This is contrary to UNIX process priority ordering.)

short queueInfoEnt::nice

Defines the nice value at which jobs in this queue will be run.

char* queueInfoEnt::userList

A blank-separated list of names of users allowed to submit jobs to this queue.

char* queueInfoEnt::hostList

A blank-separated list of names of hosts to which jobs in this queue may be dispatched.

char* queueInfoEnt::hostStr

Original HOSTS string in case "-" is used.

int queueInfoEnt::nIdx

The number of load indices in the loadSched and loadStop arrays.

float* queueInfoEnt::loadSched

The queue and host loadSched and loadStop arrays control batch job dispatch, suspension, and resumption.

The values in the loadSched array specify thresholds for the corresponding load indices. Only if the current values of all specified load indices of a host are within (below or above, depending on the meaning of the load index) the corresponding thresholds of this queue, will jobs in this queue be dispatched to the host. The same conditions are used to resume jobs dispatched from this queue that have been suspended on the host.

float* queueInfoEnt::loadStop

The values in the loadStop array specify the thresholds for job suspension.

If any of the current load index values of a host goes beyond a queue's threshold, jobs from the queue will be suspended.

For an explanation of the fields in the loadSched and loadStop arrays, see lsb_hostinfo.

int queueInfoEnt::userJobLimit

Per-user limit on the number of jobs that can be dispatched from this queue and executed concurrently.

float queueInfoEnt::procJobLimit

Per-processor limit on the number of jobs that can be dispatched from this queue and executed concurrently.

char* queueInfoEnt::windows

A blank-separated list of time windows describing the run window of the queue.

When a queue's run window is closed, no job from this queue will be dispatched. When the run window closes, any running jobs from this queue will be suspended until the run window reopens, when they will be resumed. The default is no restriction, or always open (i.e., 24 hours a day, seven days a week).

A time window has the format begin_time-end_time. Time is specified in the format [day:]hour[:minute], where all fields are numbers in their respective legal ranges: 0(Sunday)-6 for day, 0-23 for hour, and 0-59 for minute. The default value for minute is 0 (on the hour); the default value for day is every day of the week. The begin_time and end_time of a window are separated by `-', with no white space (i.e., blank or TAB) in between. Both begin_time and end_time must be present for a time window.

Note that this run window only applies to batch jobs;interactive jobs scheduled by the LSF Load Information Manager (LIM) are controlled by another set of run windows.

int queueInfoEnt::rLimits[LSF_RLIM_NLIMITS]

The per-process UNIX hard resource limits for all jobs submitted to this queue (see getrlimit() and lsb.queues).

The default values for the resource limits are unlimited, indicated by -1. The constants used to index the rLimits array and the corresponding resource limits are listed below.
LSF_RLIMIT_CPU (CPULIMIT)
LSF_RLIMIT_FSIZE (FILELIMIT)
LSF_RLIMIT_DATA (DATALIMIT)
LSF_RLIMIT_STACK (STACKLIMIT)
LSF_RLIMIT_CORE (CORELIMIT)
LSF_RLIMIT_RSS (MEMLIMIT)
LSF_RLIMIT_RUN (RUNLIMIT)
LSF_RLIMIT_PROCESS (PROCESSLIMIT)
LSF_RLIMIT_SWAP (SWAPLIMIT)
LSF_RLIMIT_THREAD
LSF_RLIMIT_NOFILE
LSF_RLIMIT_OPENMAX
LSF_RLIMIT_VMEM

char* queueInfoEnt::hostSpec

A host name or host model name.

If the queue CPULIMIT or RUNLIMIT gives a host specification, hostSpec will be that specification. Otherwise, if defaultHostSpec (see below) is not NULL, hostSpec will be defaultHostSpec. Otherwise, if DEFAULT_HOST_SPEC is defined in the lsb.params file, (see lsb.params), hostSpec will be this value. Otherwise, hostSpec will be the name of the host with the largest CPU factor in the cluster.

int queueInfoEnt::qAttrib

The attributes of the queue.

int queueInfoEnt::qStatus

The status of the queue.

int queueInfoEnt::maxJobs

The maximum number of jobs dispatched by the queue and not yet finished.

int queueInfoEnt::numJobs

Number of jobs in the queue, including pending, running, and suspended jobs.

int queueInfoEnt::numPEND

Number of pending jobs in the queue.

int queueInfoEnt::numRUN

Number of running jobs in the queue.

int queueInfoEnt::numSSUSP

Number of system suspended jobs in the queue.

int queueInfoEnt::numUSUSP

Number of user suspended jobs in the queue.

int queueInfoEnt::mig

The queue migration threshold in minutes.

int queueInfoEnt::schedDelay

The number of seconds that a new job waits, before being scheduled.

A value of zero (0) means the job is scheduled without any delay.

int queueInfoEnt::acceptIntvl

The number of seconds for a host to wait after dispatching a job to a host, before accepting a second job to dispatch to the same host.

char* queueInfoEnt::windowsD

A blank-separated list of time windows describing the dispatch window of the queue.

When a queue's dispatch window is closed, no job from this queue will be dispatched.The default is no restriction, or always open (i.e., 24 hours a day, seven days a week).

For the time window format, see windows (above).

char* queueInfoEnt::nqsQueues

A blank-separated list of queue specifiers.

Each queue specifier is of the form queue@host where host is an NQS host name and queue is the name of a queue on that host.

char* queueInfoEnt::userShares

A blank-separated list of user shares.

Each share is of the form [user, share] where user is a user name, a user group name, the reserved word default or the reserved word others, and share is the number of shares the user gets.

char* queueInfoEnt::defaultHostSpec

The value of DEFAULT_HOST_SPEC in the Queue section for this queue in the lsb.queues file.

int queueInfoEnt::procLimit

An LSF resource limit used to limit the number of job slots (processors) a (parallel) job in the queue will use.

A job submitted to this queue must specify a number of processors not greater than this limit.

char* queueInfoEnt::admins

A list of administrators of the queue.

The users whose names are here are allowed to operate on the jobs in the queue and on the queue itself.

char* queueInfoEnt::preCmd

Queue's pre-exec command.

The command is executed before the real batch job is run on the execution host (or on the first host selected for a parallel batch job).

char* queueInfoEnt::postCmd

Queue's post-exec command.

The command is run when a job terminates.

char* queueInfoEnt::requeueEValues

Jobs that exit with these values are automatically requeued.

int queueInfoEnt::hostJobLimit

The maximum number of job slots a host can process from this queue, including job slots of dispatched jobs which have not finished yet and reserved slots for some PEND jobs.

This limit controls the number of jobs sent to each host, regardless of a uniprocessor host or multiprocessor host. Default value for this limit is infinity.

char* queueInfoEnt::resReq

Resource requirement string used to determine eligible hosts for a job.

int queueInfoEnt::numRESERVE

Number of reserved job slots for pending jobs.

int queueInfoEnt::slotHoldTime

The time used to hold the reserved job slots for a PEND job in this queue.

char* queueInfoEnt::sndJobsTo

Remote MultiCluster send-jobs queues to forward jobs to.

char* queueInfoEnt::rcvJobsFrom

Remote MultiCluster receive-jobs queues that can forward to this queue.

char* queueInfoEnt::resumeCond

Resume threshold conditions for a suspended job in this queue.

char* queueInfoEnt::stopCond

Stop threshold conditions for a running job in this queue.

char* queueInfoEnt::jobStarter

Job starter command for a running job in this queue.

char* queueInfoEnt::suspendActCmd

Command configured for the SUSPEND action.

char* queueInfoEnt::resumeActCmd

Command configured for the RESUME action.

char* queueInfoEnt::terminateActCmd

Command configured for the TERMINATE action.

int queueInfoEnt::sigMap[LSB_SIG_NUM]

Configurable signal mapping.

char* queueInfoEnt::preemption

Preemptive scheduling and preemption policy specified for the queue.

int queueInfoEnt::maxRschedTime

Time period for a remote cluster to schedule a job.

MultiCluster job forwarding model only. Determines how long a MultiCluster job stays pending in the execution cluster before returning to the submission cluster. The remote timeout limit in seconds is:

MAX_RSCHED_TIME * MBD_SLEEP_TIME=timeout

int queueInfoEnt::numOfSAccts

Number of share accounts in the queue.

struct shareAcctInfoEnt* queueInfoEnt::shareAccts

(Only used for queues with fairshare policy) a share account vector capturing the fairshare information of the users using the queue.

The storage for the array of queueInfoEnt structures will be reused by the next call.

char* queueInfoEnt::chkpntDir

The directory where the checkpoint files are created.

int queueInfoEnt::chkpntPeriod

The checkpoint period in minutes.

int queueInfoEnt::imptJobBklg

MultiCluster job forwarding model only.

Specifies the MultiCluster pending job limit for a receive-jobs queue. This represents the maximum number of MultiCluster import jobs that can be pending in the queue; once the limit has been reached, the queue stops accepting jobs from remote clusters.

int queueInfoEnt::defLimits[LSF_RLIM_NLIMITS]

The default (soft) resource limits for all jobs submitted to this queue (see getrlimit() and lsb.queues).

int queueInfoEnt::chunkJobSize

The maximum number of jobs allowed to be dispatched together in one job chunk.

Must be a positive integer greater than 1.

int queueInfoEnt::minProcLimit

The minimum number of job slots (processors) that a job in the queue will use.

int queueInfoEnt::defProcLimit

The default (soft) limit on the number of job slots (processors) that a job in the queue will use.

char* queueInfoEnt::fairshareQueues

The list of queues for cross-queue fairshare.

char* queueInfoEnt::defExtSched

Default external scheduling for the queue.

char* queueInfoEnt::mandExtSched

Mandatory external scheduling options for the queue.

int queueInfoEnt::slotShare

Share of job slots for queue-based fairshare.

Represents the percentage of running jobs (job slots) in use from the queue. SLOT_SHARE must be greater than zero (0) and less than or equal to 100.

The sum of SLOT_SHARE for all queues in the pool does not need to be 100%. It can be more or less, depending on your needs.

char* queueInfoEnt::slotPool

Name of the pool of job slots the queue belongs to for queue-based fairshare.

A queue can only belong to one pool. All queues in the pool must share the same set of hosts.

Specify any ASCII string up to 60 chars long. You can use letters, digits, underscores (_) or dashes (-). You cannot use blank spaces.

int queueInfoEnt::underRCond

Specifies a threshold for job underrun exception handling.

If a job exits before the specified number of minutes, LSF invokes LSF_SERVERDIR/eadmin to trigger the action for a job underrun exception.

int queueInfoEnt::overRCond

Specifies a threshold for job overrun exception handling.

If a job runs longer than the specified run time, LSF invokes LSF_SERVERDIR/eadmin to trigger the action for a job overrun exception.

float queueInfoEnt::idleCond

Specifies a threshold for idle job exception handling.

The value should be a number between 0.0 and 1.0 representing CPU time/runtime. If the job idle factor is less than the specified threshold, LSF invokes LSF_SERVERDIR/eadmin to trigger the action for a job idle exception.

int queueInfoEnt::underRJobs

The number of underrun jobs in the queue.

int queueInfoEnt::overRJobs

The number of overrun jobs in the queue.

int queueInfoEnt::idleJobs

The number of idle jobs in the queue.

int queueInfoEnt::warningTimePeriod

Specifies the amount of time before a job control action occurs that a job warning action is to be taken.

For example, 2 minutes before the job reaches run time limit or termination deadline, or the queue's run window is closed, an URG signal is sent to the job.

Job action warning time is not normalized.

A job action warning time must be specified with a job warning action in order for job warning to take effect.

char* queueInfoEnt::warningAction

Specifies the job action to be taken before a job control action occurs.

For example, 2 minutes before the job reaches run time limit or termination deadline, or the queue's run window is closed, an URG signal is sent to the job.

A job warning action must be specified with a job action warning time in order for job warning to take effect.

If specified, LSF sends the warning action to the job before the actual control action is taken. This allows the job time to save its result before being terminated by the job control action.

You can specify actions similar to the JOB_CONTROLS queue level parameter: send a signal, invoke a command, or checkpoint the job.

char* queueInfoEnt::qCtrlMsg

AdminAction - queue control message.

char* queueInfoEnt::acResReq

Acept resource request.

int queueInfoEnt::symJobLimit

Limit of running session scheduler jobs.

char* queueInfoEnt::cpuReq

cpu_req for service partition of session scheduler

int queueInfoEnt::proAttr

Indicate whether it would be willing to donate/borrow.

int queueInfoEnt::lendLimit

The maximum number of hosts to lend.

int queueInfoEnt::hostReallocInterval

The grace period to lend/return idle hosts.

int queueInfoEnt::numCPURequired

Number of CPUs required by CPU provision.

int queueInfoEnt::numCPUAllocated

Number of CPUs actually allocated.

int queueInfoEnt::numCPUBorrowed

Number of CPUs borrowed.

int queueInfoEnt::numCPULent

Number of CPUs lent.

int queueInfoEnt::schGranularity

Scheduling granularity.

in milliseconds.

int queueInfoEnt::symTaskGracePeriod

The grace period for stopping session scheduler tasks.

int queueInfoEnt::minOfSsm

Minimum number of SSMs.

int queueInfoEnt::maxOfSsm

Maximum number of SSMs.

int queueInfoEnt::numOfAllocSlots

Number of allocated slots.

char* queueInfoEnt::servicePreemption

Service preemptin policy.

int queueInfoEnt::provisionStatus

Dynamic cpu provision status.

int queueInfoEnt::minTimeSlice

The minimum time for preemption and backfill, in seconds.

char* queueInfoEnt::queueGroup

List of queues defined in a queue group for absolute priority scheduling (APS) across multiple queues.

int queueInfoEnt::numApsFactors

The number of calculation factors for absolute priority scheduling (APS).

struct apsFactorInfo* queueInfoEnt::apsFactorInfoList

List of calculation factors for absolute priority scheduling (APS).

struct apsFactorMap* queueInfoEnt::apsFactorMaps

The mapping of factors to subfactors for absolute priority scheduling (APS).

struct apsLongNameMap* queueInfoEnt::apsLongNames

The mapping of factors to their long names for absolute priority scheduling (APS).

int queueInfoEnt::maxJobPreempt

Maximum number of job preempted times.

int queueInfoEnt::maxPreExecRetry

Maximum number of pre-exec retry times.

int queueInfoEnt::localMaxPreExecRetry

Maximum number of pre-exec retry times for local cluster.

int queueInfoEnt::maxJobRequeue

Maximum number of job re-queue times.

int queueInfoEnt::usePam

Use Linux-PAM.

int queueInfoEnt::cu_type_exclusive

Compute unit type.

char* queueInfoEnt::cu_str_exclusive

A string specified in EXCLUSIVE=CU[<string>].

char* queueInfoEnt::resRsvLimit

Resource reservation limit.

struct fsFactors queueInfoEnt::fairFactors

Fairshare factors.

int queueInfoEnt::maxSlotsInPool

Used together with automatic scanning through the list of hosts to calculate total usable slots for one slot pool.

The minimum value of both will take effect. And eventually, the value will be used for slot distribution calculation

int queueInfoEnt::usePriorityInPool

When enabled, LSF dispatch jobs strictly force defined rate for each queue in one slot pool first, then dispatch more jobs with FCFS policy if there are slots left in the slot pool.

int queueInfoEnt::noPreemptInterval

Uninterrupted running time (minutes) before job can be preempted.

int queueInfoEnt::maxTotalTimePreempt

Maximum accumulated preemption time (minutes).

int queueInfoEnt::qAttrib2

2nd queue attributes field

int queueInfoEnt::preemptDelayTime

The grace period before preemption.

int queueInfoEnt::imptSlotBklg

This variable has similar meaning to imptJobBklg only that the limit is posed on the number of pending slots other than the number of pending jobs.

int queueInfoEnt::maxSlotsPreempted

Maximum number of slots that can be preempted.