The lsf.shared file contains common definitions that are shared by all load sharing clusters defined by lsf.cluster.cluster_name files. This includes lists of cluster names, host types, host models, the special resources available, and external load indices.
o Cluster Section
o HostType Section
o HostModel Section
o Resource Section
Cluster Section Structure
Cluster Section Example
HostType Section Structure
HostType Section Example
HostModel Section Structure
HostModel Section Example
Resource Section Structure
Resource Section Example
(Required) Lists the cluster names recognized by the openlava system
The first line must contain the mandatory keyword ClusterName. The other keyword is optional.
Each subsequent line defines one cluster.
(Required) Defines all cluster names recognized by the openlava system
All cluster names referenced anywhere in the openlava system must be defined here. The file names of cluster-specific configuration files must end with the associated cluster name.
By default, the first ten hosts listed in the Host section of lsf.cluster.cluster_name are available to LIMs in remote clusters.
This parameter is useful when LSF_CONFDIR is not shared or replicated.
(Required) Lists the valid host types in the cluster
The first line consists of the mandatory keyword TYPENAME.
Subsequent lines name valid host types.
Host type names are usually based on a combination of the hardware name and operating system. If your site already has a system for naming host types, you can use the same names for openlava.
(Required) Lists models of machines and gives the relative CPU scaling factor for each model
openlava uses the relative CPU scaling factor to normalize the CPU load indices so that jobs are more likely to be sent to faster hosts. The CPU factor affects the calculation of job execution time limits and accounting. Using large or inaccurate values for the CPU factor can cause confusing results when CPU time limits or accounting are used.
The first line consists of the mandatory keywords MODELNAME, CPUFACTOR, and ARCHITECTURE.
Subsequent lines define a model and its CPU factor.
(Reserved for system use only) Indicates automatically detected host models that correspond to the model names.
Though it is not required, you would typically assign a CPU factor of 2.0 to the slowest machine model in your system and higher numbers for the others. For example, for a machine model that executes at twice the speed of your slowest model, a factor of 2.0 should be assigned.
Generally, you need to identify the distinct host types in your system, such as MIPS and SPARC first, and then the machine models within each, such as SparcIPC, Sparc1, Sparc2, and Sparc10.
MODELNAME CPUFACTOR ARCHITECTURE
PC400 13.0 (i86pc_400 i686_400)
PC450 13.2 (i86pc_450 i686_450)
Sparc5F 3.0 (SUNWSPARCstation5_2.0_sparc)
Sparc20 4.7 (SUNWSPARCstation20_151_sparc)
Ultra5S 10.3 (SUNWUltra5_270_sparcv9 SUNWUltra510_270_sparcv9)
(Optional) Defines resources.
The first line consists of the keywords. RESOURCENAME and DESCRIPTION are mandatory. The other keywords are optional. Subsequent lines define resources.
The name you assign to the new resource. An arbitrary character string.
o A resource name cannot begin with a number.
o A resource name cannot contain any of the following characters:
: . ( ) [ + - * / ! & | < > @ =
o A resource name cannot be any of the following reserved names:
cpu cpuf io logins ls idle maxmem maxswp maxtmp type model status it mem ncpus ndisks pg r15m r15s r1m swap swp tmp ut
o Resource names are case sensitive
o Resource names can be up to 29 characters in length
The type of resource:
o Boolean--Resources that have a value of 1 on hosts that have the resource and 0 otherwise.
o Numeric--Resources that take numerical values, such as all the load indices, number of processors on a host, or host CPU factor.
o String-- Resources that take string values, such as host type, host model, host status.
If TYPE is not given, the default type is Boolean.
Brief description of the resource.
The information defined here will be returned by the ls_info() API call or printed out by the lsinfo command as an explanation of the meaning of the resource.
Applies to numeric resources only.
If a larger value means greater load, INCREASING should be defined as Y. If a smaller value means greater load, INCREASING should be defined as N.
Optional. Applies to dynamic resources only.
Defines the time interval (in seconds) at which the resource is sampled by the ELIM.
If INTERVAL is defined for a numeric resource, it becomes an external load index.
If INTERVAL is not given, the resource is considered static.
Applies to numeric shared resources only, such as floating licenses.
Controls whether openlava releases the resource when a job using the resource is suspended. When a job using a shared resource is suspended, the resource is held or released by the job depending on the configuration of this parameter.
Specify N to hold the resource, or specify Y to release the resource.
RESOURCENAME TYPE INTERVAL INCREASING RELEASE DESCRIPTION
mips Boolean () () () (MIPS architecture)
dec Boolean () () () (DECStation system)
sparc Boolean () () () (SUN SPARC)
bsd Boolean () () () (BSD unix)
hpux Boolean () () () (HP-UX UNIX)
aix Boolean () () () (AIX UNIX)
solaris Boolean () () () (SUN SOLARIS)
myResource String () () () (MIPS architecture)
static_sh1 Numeric () N () (static)
external_1 Numeric 15 Y () (external)
|lsf.shared (5)||"openlava Version 4.2 - June 2001"|