Installing Platform LSF in a Mixed Cluster

Set up a Linux cluster with Windows compute nodes

Complete the following steps to set up a Linux cluster with Windows compute nodes.

  1. Install the Linux cluster on a shared file system.
  2. Configure the Linux cluster.
  3. Restart the LSF cluster.
  4. Install the Windows compute node.
  5. Start the Windows compute node.

Install the Linux cluster on a shared file system

Install the Linux cluster as described in the Linux install guide, with exceptions to allow for Windows compute nodes.

Edit install.conf and specify the following:

  1. Enable dynamic hosts.

    Enable or add the following line:

    ENABLE_DYNAMIC_HOSTS=Y

  2. Optional. Allow EGO to control the LSF daemons.

    Enable or add the following line:

    EGO_DAEMON_CONTROL=Y

  3. Specify the cluster administrator.

    LSF_ADMIN=user_account

    For example:

    LSF_ADMIN=lsfadmin

  4. Specify the installation directory.

    LSF_TOP=directory

    For example:

    LSF_TOP=$SHARE/LSF_8.0

Configure the Linux cluster

  1. Optional. If you allowed EGO to control the LSF daemons, add Windows compute node information to the LSF service configuration files.
    1. Edit LSF_TOP/conf/ego/cluster_name/eservice/esc/conf/services/res.xml
    2. Navigate to the section with the correct Windows host type.

      For 64-bit compute hosts, navigate to NTX64. For IA hosts, navigate to NTIA64. For other Windows compute hosts, navigate to NTX86.

    3. Add the proper Windows compute node information for the Command and ExecutionUser tags.

      For example:

      <sc:ActivityDescription>
        <ego:Attribute name="hostType" type="xsd:string">NTX86</ego:Attribute>
        <ego:ActivitySpecification>
        <ego:Command>C:\LSF_8.0\8.0\etc\res.exe -3</ego:Command>
        <ego:ExecutionUser>LSF\lsfadmin</ego:ExecutionUser>
        <ego:EnvironmentVariable name="LSF_ENVDIR">C:\LSF_8.0\conf</ego:EnvironmentVariable>
        ...
    4. Edit LSF_TOP/conf/ego/cluster_name/eservice/esc/conf/services/ sbatchd.xml
    5. Navigate to the section with the correct Windows host type.

      For 64-bit compute hosts, navigate to NTX64. For IA hosts, navigate to NTIA64. For other Windows compute hosts, navigate to NTX86.

    6. Add the proper Windows compute node information for the Command and ExecutionUser tags.

      For example:

      <sc:ActivityDescription>
        <ego:Attribute name="hostType" type="xsd:string">NTX86</ego:Attribute>
        <ego:ActivitySpecification>
        <ego:Command>C:\LSF_8.0\8.0\etc\sbatchd.exe -3</ego:Command>
        <ego:ExecutionUser>LSF\lsfadmin</ego:ExecutionUser>
        ...
  2. Add the Windows cluster administrator account to the your cluster file.
    1. Edit LSF_CONFDIR/lsf.cluster.cluster_name
    2. In the ClusterAdmins section, add LSF\lsfadmin to the Administrators list.

      For example:

      Begin ClusterAdmins
      Administrators = lsfadmin LSF\lsfadmin
      End ClusterAdmins
  3. Add the LSF user domain to the lsf.conf file.
    1. Edit LSF_TOP/LSF8.0/conf/lsf.conf
    2. Add the LSF user domain.

      LSF_USER_DOMAIN=lsf_user_domain

      For example:

      LSF_USER_DOMAIN=LSF

  4. Register the Windows execution password to your Linux cluster.
    1. If not using EGO to control the LSF daemons:
      • Register Windows user passwords to your cluster password file for all users submitting jobs to LSF.

        lspasswd ‑u "domain\admin" ‑p password

        For example:

        lspasswd ‑u "LSF\lsfadmin" ‑p lsfpasswd

        Passwords must be 31 characters or less.

    2. If using EGO to control the LSF daemons:
      • Log on to any host in the cluster as egoadmin.

      • Log on to EGO as the cluster administrator.

        For example:

        egosh user logon ‑u Admin ‑x mypasswd

      • Register Windows user passwords to your cluster password file for all users submitting jobs to LSF.

        egosh ego execpasswd ‑u "domain\admin" ‑x password -noverify

        For example:

        egosh ego execpasswd ‑u "LSF\lsfadmin" ‑x lsfpasswd -noverify

        The password must be 31 characters or less.

        The -noverify option is required since only a Windows host can verify the password for a Windows user.

Restart the Platform LSF cluster

Restart the LSF cluster.

lsfstartup

Install the Windows compute node

  1. Install the Windows compute node as described in the Windows install guide, with exceptions to be part of a Linux cluster.

    Specify the following options during installation:

    1. Specify the Linux master host as your master host name.

      Master_Name=linux_master_name

    2. If you allowed EGO to control the LSF daemons in your cluster master host, allow EGO to control the LSF daemons in your compute node.

      EGO_DAEMON_CONTROL=Y

    3. Specify the same port number as that of the Linux master host.

      Port_Number=base_port_number

    4. Specify the cluster administrator to be the same as on your master host.

      LSF_ADMIN=domain\user_account

      For example:

      LSF_ADMIN=LSF\lsfadmin

    5. Specify the installation directory.

      LSF_TOP=directory

      For example:

      LSF_TOP=C:\LSF_8.0

  2. Register the Windows execution user password to your cluster password file.

    lspasswd ‑u "domain\admin" ‑p password

    For example:

    lspasswd ‑u "LSF\lsfadmin" ‑p lsfpasswd

    The password must be 31 characters or less.

Start the Windows compute node

  1. Start the Windows compute node:

    lsadmin limstartup

  2. If you did not allow EGO to control the LSF daemons, manually start the LSF services:

    lsfadmin resstartup

    badmin hstartup

Set up a Windows cluster with Linux compute nodes

Complete the following steps to set up a Windows cluster with Linux compute nodes if Ego is not controlling the LSF daemons.

  1. Install the Windows cluster master host as described in the Windows installation guide.
    1. Add the LSF user domain to the lsf.conf file.
      • Edit C:\LSF8.0\conf\lsf.conf

      • Add the LSF user domain.

        LSF_USER_DOMAIN=lsf_user_domain

        For example:

        LSF_USER_DOMAIN=LSF

    2. Restart the Windows cluster.

      lsfrestart

    3. Register Windows execution user passwords to your cluster password file for all users submitting jobs to LSF.

      lspasswd ‑u "domain\admin" ‑p password

      For example:

      lspasswd ‑u "LSF\lsfadmin" ‑p lsfpasswd

      Passwords must be 31 characters or less.

  2. Install the Linux compute nodes as described in the Linux installation guide.
    1. Specify the Windows master host as your master host name.

      Master_Name=windows_master_name

    2. Start the Linux compute nodes

Set up a Windows cluster with Linux compute nodes and EGO controlling Platform LSF daemons

Complete the following steps to set up a Windows cluster with Linux compute nodes and EGO controlling the LSF daemons.

  1. Install the Windows cluster master host as described in the Windows installation guide and allow EGO to control the LSF daemons.
    1. Specify the following option:

      EGO_DAEMON_CONTROL=Y

    2. Add the LSF user domain to the lsf.conf file.
      • Edit C:\LSF8.0\conf\lsf.conf

      • Add the LSF user domain.

        LSF_USER_DOMAIN=lsf_user_domain

        For example:

        LSF_USER_DOMAIN=LSF

  2. Install the Linux compute node as described in the Linux installation guide and allow EGO to control the LSF daemons.

    Edit install.conf and add the following line:

    EGO_DAEMON_CONTROL=Y

  3. Restart the Windows cluster.

    egosh ego restart