Knowledge Center Previous Next Index |
Charting Resources with Windows Performance Monitor
LSF integrates with Windows Performance Monitor, so you can chart LSF cluster, host, queue, and job performance information. Windows Performance Monitor can also be used to trigger external commands when specified thresholds are exceeded.
A service called LSF Monitor passes information from LSF to the Windows Performance Monitor. LSF Monitor must be installed separately. On UNIX, you can use xlsmon to chart load information or you can use Platform SiteAssure to manage LSF resources on UNIX. To chart historical data, you can use Platform Analyzer.
This chapter provides instructions for installing, configuring, and using LSF Monitor.
Contents
- LSF Monitor statistics
- Installing LSF Monitor
- Configuring LSF Monitor
- Using LSF Monitor
- Uninstalling LSF Monitor
LSF Monitor statistics
Once installed, LSF Monitor automatically sends information to the Windows Performance Monitor. Use the Windows Performance Monitor to chart LSF performance information.
The host, queue, and job objects support multiple instances.
The following LSF information is available:
Cluster information
- Number of available servers
- Number of unavailable servers
- Number of servers where an LSF daemon (sbatchd or RES service) is down
- Number of unlicensed servers
- Number of pending jobs in the cluster
- Number of running jobs in the cluster
- Number of suspended jobs in the cluster
- Number of sick jobs (jobs submitted with no password, jobs with job dependency never satisfied, and jobs pending more than 3 days)
- Response time of LIM (as measured by the time to make an ls_load call)
- Response time of mbatchd (as measured by the time to make an lsb_queueinfo call)
Host information
- Load indices: r15s, r15m, mem, swap, pg, ut
- Number of running jobs
- Number of suspended jobs
- Number of reserved job slots
- External load Indices
Queue information
- Number of pending jobs
- Number of running jobs
- Number of suspended jobs
- Number of reserved job slots
Job information
- CPU time used by the job
- Memory used by the job (for jobs running on UNIX only)
- Swap space used by the job (for jobs running on UNIX only)
External information
- Values of one or two external load indices (configured by the LSF administrator)
Installing LSF Monitor
Requirements
LSF version
You must have a cluster running LSF version 4.0 or higher.
Host
Install LSF Monitor on any LSF server or client host running Windows.
Cluster
The cluster can include UNIX hosts.
Account
The LSF Monitor service runs under the account of an LSF cluster administrator. You will need to provide an LSF cluster administrator account and password.
Install LSF Monitor
The LSF Monitor setup program is installed with LSF. Use lsfmon -install to actually install the LSF Monitor service:
- Log on to a Windows host as an LSF user in an existing LSF cluster.
- In a command prompt, type:
lsfmon -install
LSF Monitor is installed.
- On the Windows Control Panel, click Services.
The Services window opens.
- Right-click LSF Monitor and click Properties.
- In the Log On As section, deselect System Account, select This Account, and specify an LSF cluster administrator account (such as Administrator).
- Type in the password twice and click OK.
- In the Services window, select LSF Monitor and click Start to start the service.
Configuring LSF Monitor
You can configure sample intervals for host, queue and job information along with external load indices.
In this section
Configure sample intervals
Overview
LSF Monitor periodically samples information from LSF and updates the Windows Performance Monitor.
Default sample intervals
By default, information is sampled at the following intervals:
- Host information = 30 seconds
- Queue information = 45 seconds
- Job information = 60 seconds
Change sample intervals
To change the sample intervals for LSF host, job, or queue information, modify the Windows Registry settings.
Back up your registry before you make any changes.
To modify the registry:
- Select the Registry subkey:
HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Services\LSFMonitor- Edit the appropriate value, and specify the new sample interval in seconds:
- SampleIntervalHost
- SampleIntervalJob
- SampleIntervalQueue
Configure external load indices
You can use the Windows Performance Monitor to chart up to 2 external load indices, which you specify.
To configure LSF Monitor to monitor external load indices, modify the Windows Registry settings. Update your backups before you modify the Registry.
- Go to the Registry subkey :HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Services\LSFMonitor
- Specify the appropriate value, and type the name of an external load index that is configured in your cluster:
- ExternalLoadIndex1
- ExternalLoadIndex2
Using LSF Monitor
Start and stop LSF Monitor
Use the Windows Control Panel to start or stop the LSF Monitor service.
View information about LSF Monitor
Errors related to LSF API calls and the operation of LSF services will be logged to the Windows event log. You can use the Windows Event Viewer to view this information.
Uninstalling LSF Monitor
Use lsfmon -remove to uninstall LSF Monitor. This command will first stop the LSF Monitor service if it is running, then remove it and remove related information from the Windows Registry.
Uninstall LSF Monitor
To uninstall LSF Monitor:
- In a DOS window, type:
lsfmon -remove
Platform Computing Inc.
www.platform.com |
Knowledge Center Previous Next Index |