Qlogic QME7342 Podręcznik Użytkownika

Przeglądaj online lub pobierz Podręcznik Użytkownika dla Oprogramowanie Qlogic QME7342. Qlogic QME7342 User guide Instrukcja obsługi

  • Pobierz
  • Dodaj do moich podręczników
  • Drukuj

Podsumowanie treści

Strona 1 - Release 1.5.4

IB0054606-02 AOFED+ Host SoftwareRelease 1.5.4User Guide

Strona 2 - Document Revision History

x IB0054606-02 AOFED+ Host Software Release 1.5.4User GuideOpen MPI Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Strona 3 - Table of Contents

5–Using Other MPIsMVAPICH25-4 IB0054606-02 AHere is an example of a simple mpirun command running with four processes:$ mpirun -np 4 -hostfile mpihos

Strona 4

5–Using Other MPIsManaging MVAPICH, and MVAPICH2 with the mpi-selector UtilityIB0054606-02 A 5-5Running MVAPICH2 ApplicationsBy default, the MVAPICH2

Strona 5 - IB0054606-02 A v

5–Using Other MPIsPlatform MPI 85-6 IB0054606-02 AThe example shell scripts mpivars.sh and mpivars.csh, for registering with mpi-selector, are provid

Strona 6 - 5 Using Other MPIs

5–Using Other MPIsIntel MPIIB0054606-02 A 5-7to,MPI_ICMOD_PSM__PSM_PATH = "^"Compiling Platform MPI 8 ApplicationsAs with Open MPI, QLogic

Strona 7

5–Using Other MPIsIntel MPI5-8 IB0054606-02 AInstallationFollow the instructions for download and installation of Intel MPI from the Intel web site.S

Strona 8

5–Using Other MPIsIntel MPIIB0054606-02 A 5-9Using DAPL 2.0.$ rpm -qa | grep dapldapl-devel-static-2.0.19-1compat-dapl-1.2.14-1dapl-2.0.19-1dapl-debu

Strona 9

5–Using Other MPIsIntel MPI5-10 IB0054606-02 ASubstitute bin if using 32-bit.Compiling Intel MPI ApplicationsAs with Open MPI, QLogic recommended tha

Strona 10 - G Commands and Files

5–Using Other MPIsIntel MPIIB0054606-02 A 5-11uDAPL 1.2:-genv I_MPI_DEVICE rdma:OpenIB-cma uDAPL 2.0:-genv I_MPI_DEVICE rdma:ofa-v2-ib To help with d

Strona 11 - H Recommended Reading

5–Using Other MPIsImproving Performance of Other MPIs Over IB Verbs5-12 IB0054606-02 AImproving Performance of Other MPIs Over IB VerbsPerformance of

Strona 12 - List of Figures

IB0054606-02 A 6-16 SHMEM Description and ConfigurationOverviewQLogic SHMEM is a user-level communications library for one-sided operations. It imple

Strona 13 - List of Tables

IB0054606-02 A xiOFED+ Host Software Release 1.5.4User Guideiba_packet_capture. . . . . . . . . . . . . . . . G-21ibhosts . . . . . . . . . . . . .

Strona 14 - User Guide

6–SHMEM Description and ConfigurationInstallation6-2 IB0054606-02 AThe -qlc suffix denotes that this is the QLogic PSM version. MVAPICH version 1.2.

Strona 15 - Documentation Conventions

6–SHMEM Description and ConfigurationSHMEM ProgramsIB0054606-02 A 6-3By default QLogic SHMEM is installed with a prefix of /usr/shmem/qlogic into the

Strona 16 - License Agreements

6–SHMEM Description and ConfigurationSHMEM Programs6-4 IB0054606-02 AThe % character in the previous example is used to indicate the shell prompt and

Strona 17 - Technical Support

6–SHMEM Description and ConfigurationSHMEM ProgramsIB0054606-02 A 6-5-Wl,--export-dynamic,--allow-shlib-undefined-L $SHMEM_DIR/lib64/default-lqlogic_

Strona 18 - Knowledge Database

6–SHMEM Description and ConfigurationSHMEM Programs6-6 IB0054606-02 ABy default mpirun is picked up from the path and is assumed to be called mpirun.

Strona 19 - 1 Introduction

6–SHMEM Description and ConfigurationQLogic SHMEM Relationship with MPIIB0054606-02 A 6-7QLogic SHMEM Relationship with MPIQLogic SHMEM requires the

Strona 20 - Overview

6–SHMEM Description and ConfigurationSlurm Integration6-8 IB0054606-02 ASlurm IntegrationQLogic SHMEM relies on an MPI implementation to provide a ru

Strona 21 - Interoperability

6–SHMEM Description and ConfigurationSizing Global Shared MemoryIB0054606-02 A 6-9The salloc allocates 16 nodes and runs one copy of shmemrun on the

Strona 22 - 1-4 IB0054606-02 A

6–SHMEM Description and ConfigurationSizing Global Shared Memory6-10 IB0054606-02 AThe QLogic SHMEM library pre-allocates room in the virtual address

Strona 23 - 2 Step-by-Step Cluster Setup

6–SHMEM Description and ConfigurationProgress ModelIB0054606-02 A 6-11Alternatively, if $SHMEM_SHMALLOC_BASE_ADDR is specified as 0, then each SHMEM

Strona 24 - Using MPI

xii IB0054606-02 AOFED+ Host Software Release 1.5.4User GuideList of Figures3-1 QLogic OFED+ Software Structure . . . . . . . . . . . . . . . . . .

Strona 25 - 3 InfiniBand

6–SHMEM Description and ConfigurationProgress Model6-12 IB0054606-02 AActive ProgressIn the active progress mode SHMEM progress is achieved when the

Strona 26 - Installed Layout

6–SHMEM Description and ConfigurationEnvironment VariablesIB0054606-02 A 6-13 SHMEM's long message protocol is disabled. This is because the lo

Strona 27

6–SHMEM Description and ConfigurationEnvironment Variables6-14 IB0054606-02 A$SHMEM_SHMALLOC_CHECK on Shared memory consistency checks set for 0 to

Strona 28

6–SHMEM Description and ConfigurationImplementation BehaviorIB0054606-02 A 6-15The command shmemrun automatically propagates SHMEM* environment varia

Strona 29 - IPoIB Administration

6–SHMEM Description and ConfigurationImplementation Behavior6-16 IB0054606-02 AFor a put operation, these descriptions use the terms "local comp

Strona 30 - IB Bonding

6–SHMEM Description and ConfigurationApplication Programming InterfaceIB0054606-02 A 6-17 8 byte put to a sync location Target side: Wait for the

Strona 31 - Red Hat EL5 and EL6

6–SHMEM Description and ConfigurationApplication Programming Interface6-18 IB0054606-02 ATable 6-3. SHMEM Application Programming Interface CallsOper

Strona 32

6–SHMEM Description and ConfigurationApplication Programming InterfaceIB0054606-02 A 6-19shmem_putshmem_put32shmem_put64shmem_put128shmem_putmemNon-b

Strona 33

6–SHMEM Description and ConfigurationApplication Programming Interface6-20 IB0054606-02 Ashmem_short_iputIndexed Put Operations shmem_ixputshmem_ixpu

Strona 34 - Subnet Manager Configuration

6–SHMEM Description and ConfigurationApplication Programming InterfaceIB0054606-02 A 6-21shmem_get32shmem_get64shmem_get128shmem_getmemNon-blocking G

Strona 35

IB0054606-02 A xiiiOFED+ Host Software Release 1.5.4User GuideList of Tables3-1 ibmtu Values. . . . . . . . . . . . . . . . . . . . . . . . . . . .

Strona 36 - 3-12 IB0054606-02 A

6–SHMEM Description and ConfigurationApplication Programming Interface6-22 IB0054606-02 AIndexed Get Operations shmem_ixgetshmem_ixget32shmem_ixget64

Strona 37 - Default Configuration

6–SHMEM Description and ConfigurationApplication Programming InterfaceIB0054606-02 A 6-23shmem_short_swapshmem_int_swapshmem_long_swapshmem_longlong_

Strona 38 - 3-14 IB0054606-02 A

6–SHMEM Description and ConfigurationApplication Programming Interface6-24 IB0054606-02 Ashmem_long_faddshmem_longlong_faddReductions shmem_int_and_t

Strona 39 - [I

6–SHMEM Description and ConfigurationApplication Programming InterfaceIB0054606-02 A 6-25shmem_short_max_to_allshmem_complexd_sum_to_allcomplex colle

Strona 40 - [

6–SHMEM Description and ConfigurationApplication Programming Interface6-26 IB0054606-02 Ashmem_clear_lockshmem_test_lockEvents clear_eventset_eventwa

Strona 41 - IB0054606-02 A 3-17

6–SHMEM Description and ConfigurationSHMEM Benchmark ProgramsIB0054606-02 A 6-27SHMEM Benchmark ProgramsThe following SHMEM micro-benchmark programs

Strona 42 - ScanFrequency

6–SHMEM Description and ConfigurationSHMEM Benchmark Programs6-28 IB0054606-02 AAdditional SHMEM micro-benchmark programs are included to measure get

Strona 43 - Other Settings

6–SHMEM Description and ConfigurationSHMEM Benchmark ProgramsIB0054606-02 A 6-29QLogic SHMEM all-to-all benchmarkshmem-alltoall: all-to-all put bench

Strona 44 - Changing the MTU Size

6–SHMEM Description and ConfigurationSHMEM Benchmark Programs6-30 IB0054606-02 AQLogic SHMEM barrier benchmarkshmem-barrier: barrier benchmarkUsage:

Strona 45 - Managing the ib_qib Driver

6–SHMEM Description and ConfigurationSHMEM Benchmark ProgramsIB0054606-02 A 6-31Table 6-8. QLogic SHMEM reduce benchmark optionsOption Description-b

Strona 46

xiv IB0054606-02 AOFED+ Host Software Release 1.5.4User Guide

Strona 47

6–SHMEM Description and ConfigurationSHMEM Benchmark Programs6-32 IB0054606-02 A

Strona 48

IB0054606-02 A 7-17 Virtual Fabric support in PSMIntroductionPerformance Scaled Messaging (PSM) provides support for full Virtual Fabric (vFabric) in

Strona 49 - Performance Tuning

7–Virtual Fabric support in PSMVirtual Fabric Support7-2 IB0054606-02 AVirtual Fabric SupportVirtual Fabric (vFabric) in PSM is supported with the QL

Strona 50 - Default Parameter Settings

7–Virtual Fabric support in PSMUsing Service IDIB0054606-02 A 7-3Using Service ID Full vFabric integration with PSM is available, allowing the user t

Strona 51

7–Virtual Fabric support in PSMVerifying SL2VL tables on QLogic 7300 Series Adapters7-4 IB0054606-02 AVerifying SL2VL tables on QLogic 7300 Series Ad

Strona 52 - Intel CPU Systems

IB0054606-02 A 8-18 Dispersive RoutingInfiniband® uses deterministic routing that is keyed from the Destination LID (DLID) of a port. The Fabric Mana

Strona 53

8–Dispersive Routing8-2 IB0054606-02 AInternally, PSM utilizes dispersive routing differently for small and large messages. Large messages are any me

Strona 54

8–Dispersive RoutingIB0054606-02 A 8-3 Static_Dest: The path selection is based on the CPU index of the destination process. Multiple paths can be u

Strona 55 - IB0054606-02 A 3-31

8–Dispersive Routing8-4 IB0054606-02 A

Strona 56

IB0054606-02 A 9-19 gPXEgPXE is an open source (GPL) network bootloader. It provides a direct replacement for proprietary PXE ROMs. See http://etherb

Strona 57

IB0054606-02 A xvPrefaceThe QLogic OFED+ Host Software User Guide shows end users how to use the installed software to setup the fabric. End users in

Strona 58

9–gPXEPreparing the DHCP Server in Linux9-2 IB0054606-02 A A Linux install image like kickstart, which then installs software to the local hard driv

Strona 59 - Compute Nodes

9–gPXEPreparing the DHCP Server in LinuxIB0054606-02 A 9-3Installing DHCPgPXE requires that the DHCP server runs on a machine that supports IP over I

Strona 60 - Test Execution

9–gPXEPreparing the DHCP Server in Linux9-4 IB0054606-02 AConfiguring DHCP1. From the client host, find the GUID of the HCA by using p1info or look a

Strona 61 - Homogeneous Nodes

9–gPXENetbooting Over IBIB0054606-02 A 9-54. Restart the DHCP serverNetbooting Over IBThe following procedures are an example of netbooting over IB,

Strona 62 - Adapter and Other Settings

9–gPXENetbooting Over IB9-6 IB0054606-02 A1. Install Apache.2. Create an images.conf file and a kernels.conf file and place them in the /etc/httpd/co

Strona 63 - Remove Unneeded Services

9–gPXENetbooting Over IBIB0054606-02 A 9-7To add an IB driver into the initrd file, The IB modules need to be copied to the diskless image. The host

Strona 64 - Configuring for ssh

9–gPXENetbooting Over IB9-8 IB0054606-02 Ab. The infinipath rpm will install the file /usr/share/infinipath/gPXE/gpxe-qib-modify-initrd with contents

Strona 65

9–gPXENetbooting Over IBIB0054606-02 A 9-9# extract previous contentsgunzip -dc ../initrd-ib-${kern}.img | cpio --quiet -id# add infiniband modulesmk

Strona 66

9–gPXENetbooting Over IB9-10 IB0054606-02 A IFS=' ' v6cmd='/sbin/insmod /lib/'${xfrm}'.ko'"$v6cmd" cryp

Strona 67

9–gPXENetbooting Over IBIB0054606-02 A 9-11/sbin/insmod /lib/ib/ib_sa.ko/sbin/insmod /lib/ib/ib_cm.ko/sbin/insmod /lib/ib/ib_uverbs.ko/sbin/insmod /l

Strona 68 - Process Limitation with ssh

PrefaceLicense Agreementsxvi IB0054606-02 A Tex t i n blue font indicates a hyperlink (jump) to a figure, table, or section in this guide, and links

Strona 69

9–gPXENetbooting Over IB9-12 IB0054606-02 A# and show the differences.echo -e '\nChanges in files in initrd image\n'diff Orig-listing New-l

Strona 70

9–gPXENetbooting Over IBIB0054606-02 A 9-13The following is an example of a uniboot.php file:<?header ( 'Content-type: text/plain' );fun

Strona 71

9–gPXEHTTP Boot Setup9-14 IB0054606-02 AThis is the kernel that will boot.This file can be copied from any machine that has RHEL5.3 installed.2. Star

Strona 72 - 3-48 IB0054606-02 A

9–gPXEHTTP Boot SetupIB0054606-02 A 9-155. Create an images.conf file and a kernels.conf file using the examples in Step 2 of Boot Server Setup and p

Strona 73 - 4 Running MPI on QLogic

9–gPXEHTTP Boot Setup9-16 IB0054606-02 A

Strona 74

IB0054606-02 A A-1A Benchmark ProgramsSeveral MPI performance measurement programs are installed by default with the MPIs you choose to install (such

Strona 75 - Running Open MPI Applications

A–Benchmark ProgramsBenchmark 1: Measuring MPI Latency Between Two NodesA-2 IB0054606-02 AThe program osu_latency, from Ohio State University, measur

Strona 76

A–Benchmark ProgramsBenchmark 1: Measuring MPI Latency Between Two NodesIB0054606-02 A A-3-H (or --hosts) allows the specification of the host list o

Strona 77 - To Use Another Compiler

A–Benchmark ProgramsBenchmark 2: Measuring MPI Bandwidth Between Two NodesA-4 IB0054606-02 ABenchmark 2: Measuring MPI Bandwidth Between Two NodesThe

Strona 78 - Table 4-3. Intel

A–Benchmark ProgramsBenchmark 2: Measuring MPI Bandwidth Between Two NodesIB0054606-02 A A-5Typical output might look like:# OSU MPI Bandwidth Test v

Strona 79 - Process Allocation

PrefaceTechnical SupportIB0054606-02 A xviiTechnical SupportCustomers should contact their authorized maintenance provider for technical support of t

Strona 80

A–Benchmark ProgramsBenchmark 3: Messaging Rate MicrobenchmarksA-6 IB0054606-02 ABenchmark 3: Messaging Rate MicrobenchmarksOSU Multiple Bandwidth /

Strona 81 - IB0054606-02 A 4-9

A–Benchmark ProgramsBenchmark 3: Messaging Rate MicrobenchmarksIB0054606-02 A A-7This was run on 12-core compute nodes, so we used Open MPI's -n

Strona 82 - PSM_SHAREDCONTEXTS=NO

A–Benchmark ProgramsBenchmark 3: Messaging Rate MicrobenchmarksA-8 IB0054606-02 A N/2 is dynamically calculated at the end of the run. You can use

Strona 83 - Running in Shared Memory Mode

A–Benchmark ProgramsBenchmark 3: Messaging Rate MicrobenchmarksIB0054606-02 A A-9The benchmark has been updated with code to dynamically determine wh

Strona 84

A–Benchmark ProgramsBenchmark 3: Messaging Rate MicrobenchmarksA-10 IB0054606-02 ANote the improved message rate at small message sizes of ~25 millio

Strona 85 - Using Open MPI’s mpirun

A–Benchmark ProgramsBenchmark 3: Messaging Rate MicrobenchmarksIB0054606-02 A A-11The higher peak bi-directional messaging rate of 34.6 million messa

Strona 86

A–Benchmark ProgramsBenchmark 3: Messaging Rate MicrobenchmarksA-12 IB0054606-02 A

Strona 87 - Environment for Node Programs

IB0054606-02 A B-1B SRP ConfigurationSRP Configuration OverviewSRP stands for SCSI RDMA Protocol. It allows the SCSI protocol to run over IB for Stor

Strona 88

B–SRP ConfigurationQLogic SRP ConfigurationB-2 IB0054606-02 A A SRP Initiator Extension is a 64-bit numeric value that is appended to the port GUID

Strona 89 - Setting MCA Parameters

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-3Stopping, Starting and Restarting the SRP DriverTo stop the qlgc_srp driver, use the fol

Strona 90

PrefaceTechnical Supportxviii IB0054606-02 AKnowledge DatabaseThe QLogic knowledge database is an extensive collection of QLogic product information

Strona 91 - Name Description

B–SRP ConfigurationQLogic SRP ConfigurationB-4 IB0054606-02 A1. By the port GUID of the IOC, or2. By the IOC profile string that is created by the VI

Strona 92

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-5The system returns input similar to the following:st187:~/qlgc-srp-1_3_0_0_1 # ib_qlgc_s

Strona 93 - is used by other

B–SRP ConfigurationQLogic SRP ConfigurationB-6 IB0054606-02 A0x0000494353535250 service 3 : name SRP.T10:0000000000000004 id 0x0000494353535250

Strona 94 - Debugging MPI Programs

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-7Enter ib_qlgc_srp_build_cfg. The system provides output similar to the following:# qlgc_

Strona 95 - The TotalView

B–SRP ConfigurationQLogic SRP ConfigurationB-8 IB0054606-02 A noverify: 0 description: "SRP Virtual HBA 0"endThe ib_qlgc_srp_build_cfg co

Strona 96 - 4-24 IB0054606-02 A

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-9Specifying a SRP Target PortThe SRP target can be specified in two different ways. To co

Strona 97

B–SRP ConfigurationQLogic SRP ConfigurationB-10 IB0054606-02 ASpecifying a SRP Target Port of a Session by IOCGUIDThe following example specifies a t

Strona 98

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-11Restarting the SRP ModuleFor changes to take effect, including changes to the SRP map o

Strona 99

B–SRP ConfigurationQLogic SRP ConfigurationB-12 IB0054606-02 AWhen the qlgc_srp module encounters an adapter command, that adapter is assigned all pr

Strona 100 - MVAPICH2

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-13endadapterbegindescription: "Test Device 1"endConfiguring Fibre Channel Failo

Strona 101 - Running MVAPICH2 Applications

IB0054606-02 A 1-11 IntroductionHow this Guide is OrganizedThe QLogic OFED+ Host Software User Guide is organized into these sections: Section 1, pr

Strona 102 - Platform MPI 8

B–SRP ConfigurationQLogic SRP ConfigurationB-14 IB0054606-02 AFailover Configuration File 1: Failing over from one SRP Initiator port to anotherIn th

Strona 103 - Intel MPI

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-15 adapterIODepth: 1000 lunIODepth: 16 adapterMaxIO: 128 adapterMaxLUNs: 512 adapter

Strona 104 - Installation

B–SRP ConfigurationQLogic SRP ConfigurationB-16 IB0054606-02 AOn the VIO hardware side, the following needs to be ensured: The target device is disc

Strona 105

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-17On the VIO hardware side, the following need to be ensured on each FVIC involved in the

Strona 106

B–SRP ConfigurationQLogic SRP ConfigurationB-18 IB0054606-02 A The target device is discovered and configured through the appropriate FC port The S

Strona 107

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-192 Adapter Ports and 2 Ports on a Single VIO ModuleIn this example, traffic is load bala

Strona 108 - 5-12 IB0054606-02 A

B–SRP ConfigurationQLogic SRP ConfigurationB-20 IB0054606-02 AUsing the roundrobinmode ParameterIn this example, the two sessions use different VIO h

Strona 109 - 6 SHMEM Description and

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-21Configuring SRP for Native IB Storage1. Review ib_qlgc_srp_query.QLogic Corporation. Vi

Strona 110

B–SRP ConfigurationQLogic SRP ConfigurationB-22 IB0054606-02 A2. Edit /etc/sysconfig/qlgc_srp.cfg to add this information.# service : name SRP.T10:0

Strona 111 - SHMEM Programs

B–SRP ConfigurationQLogic SRP ConfigurationIB0054606-02 A B-23 roundrobinmode: 0 # set to 1 if you do not want target connectivity verification no

Strona 112 - Compiling SHMEM Programs

ii IB0054606-02 AOFED+ Host Software Release 1.5.4User GuideInformation furnished in this manual is believed to be accurate and reliable. However, QL

Strona 113 - Running SHMEM Programs

1–IntroductionOverview1-2 IB0054606-02 A Appendix C, describes two methods the administrator can use to allow users to submit MPI jobs through batch

Strona 114

B–SRP ConfigurationOFED SRP ConfigurationB-24 IB0054606-02 AAdditional Details All LUNs found are reported to the Linux SCSI mid-layer. Linux may n

Strona 115

B–SRP ConfigurationOFED SRP ConfigurationIB0054606-02 A B-253. Choose the device you want to use, and run the command again with the -c option (as a

Strona 116 - Slurm Integration

B–SRP ConfigurationOFED SRP ConfigurationB-26 IB0054606-02 ANotes

Strona 117 - Sizing Global Shared Memory

IB0054606-02 A C-1C Integration with a Batch Queuing SystemMost cluster systems use some kind of batch queuing system as an orderly way to provide us

Strona 118 - 6-10 IB0054606-02 A

C–Integration with a Batch Queuing SystemClean-up PSM Shared Memory FilesC-2 IB0054606-02 AThis command displays a list of processes using InfiniPath

Strona 119 - Progress Model

C–Integration with a Batch Queuing SystemClean-up PSM Shared Memory FilesIB0054606-02 A C-3#!/bin/shfiles=`/bin/ls /dev/shm/psm_shm.* 2> /dev/null

Strona 120 - Passive Progress

C–Integration with a Batch Queuing SystemClean-up PSM Shared Memory FilesC-4 IB0054606-02 A

Strona 121 - Environment Variables

IB6054601-00 F Preliminary Using LEDs to Check the State of the Adapter-1D TroubleshootingThis appendix describes some of the tools you can use to di

Strona 122

D–TroubleshootingBIOS SettingsD-2 IB0054606-02 ABIOS SettingsThis section covers issues related to BIOS settings.The most important setting is Advanc

Strona 123 - Implementation Behavior

D–TroubleshootingKernel and Initialization IssuesIB0054606-02 A D-3Driver Load Fails Due to Unsupported KernelIf you try to load the InfiniPath drive

Strona 124 - 6-16 IB0054606-02 A

1–IntroductionInteroperabilityIB0054606-02 A 1-3 An embedded subnet manager can be used in one or more managed switches. QLogic offers the QLogic Em

Strona 125 - IB0054606-02 A 6-17

D–TroubleshootingKernel and Initialization IssuesD-4 IB0054606-02 AA zero count in all CPU columns means that no InfiniPath interrupts have been deli

Strona 126 - Operation Calls

D–TroubleshootingKernel and Initialization IssuesIB0054606-02 A D-5InfiniPath ib_qib Initialization FailureThere may be cases where ib_qib was not pr

Strona 127

D–TroubleshootingOpenFabrics and InfiniPath IssuesD-6 IB0054606-02 AMPI Job Failures Due to Initialization ProblemsIf one or more nodes do not have t

Strona 128

D–TroubleshootingOpenFabrics and InfiniPath IssuesIB0054606-02 A D-7Manual Shutdown or Restart May Hang if NFS in UseIf you are using NFS over IPoIB

Strona 129

D–TroubleshootingSystem Administration TroubleshootingD-8 IB0054606-02 Aibsrpdm Command Hangs when Two Host Channel Adapters are Installed but Only U

Strona 130

D–TroubleshootingPerformance IssuesIB0054606-02 A D-9Broken Intermediate LinkSometimes message traffic passes through the fabric while other traffic

Strona 131

D–TroubleshootingPerformance IssuesD-10 IB0054606-02 AErratic PerformanceSometimes erratic performance is seen on applications that use interrupts. A

Strona 132

D–TroubleshootingPerformance IssuesIB0054606-02 A D-11This method is not the first choice because, on some systems, there may be two rows of ib_qib o

Strona 133

D–TroubleshootingOpen MPI TroubleshootingD-12 IB0054606-02 APerformance Warning if ib_qib Shares Interrupts with eth0When ib_qib shares interrupts wi

Strona 134

IB0054606-02 A E-1E ULP TroubleshootingTroubleshooting VirtualNIC and VIO Hardware IssuesTo verify that an IB host can access an Ethernet system thro

Strona 135 - SHMEM Benchmark Programs

1–IntroductionInteroperability1-4 IB0054606-02 A

Strona 136

E–ULP TroubleshootingTroubleshooting VirtualNIC and VIO Hardware IssuesE-2 IB0054606-02 AVerify that the proper VirtualNIC driver is runningCheck tha

Strona 137

E–ULP TroubleshootingTroubleshooting VirtualNIC and VIO Hardware IssuesIB0054606-02 A E-3Verifying that the host can communicate with the I/O Control

Strona 138 - QLogic SHMEM reduce benchmark

E–ULP TroubleshootingTroubleshooting VirtualNIC and VIO Hardware IssuesE-4 IB0054606-02 A ID: Chassis 0x00066A00010003F2, Slot 1, IOC 3

Strona 139 - Option Description

E–ULP TroubleshootingTroubleshooting VirtualNIC and VIO Hardware IssuesIB0054606-02 A E-5If the host can not see applicable IOCs, there are two thing

Strona 140 - 6-32 IB0054606-02 A

E–ULP TroubleshootingTroubleshooting VirtualNIC and VIO Hardware IssuesE-6 IB0054606-02 AAnother reason why the host might not be able to see the nec

Strona 141 - 7 Virtual Fabric support in

E–ULP TroubleshootingTroubleshooting VirtualNIC and VIO Hardware IssuesIB0054606-02 A E-7DEVICE=eioc1BOOTPROTO=staticIPADDR=172.26.48.132BROADCAST=17

Strona 142 - Using SL and PKeys

E–ULP TroubleshootingTroubleshooting VirtualNIC and VIO Hardware IssuesE-8 IB0054606-02 AThere are up to 6 IOC GUIDs on each VIO hardware module (6 f

Strona 143 - Using Service ID

E–ULP TroubleshootingTroubleshooting SRP IssuesIB0054606-02 A E-9Troubleshooting SRP Issuesib_qlgc_srp_stats showing session in disconnected statePro

Strona 144 - Adapters

E–ULP TroubleshootingTroubleshooting SRP IssuesE-10 IB0054606-02 A: 0x0000000000000000 Completed Receives : 0x00000000000002c0 | Receive Error

Strona 145 - 8 Dispersive Routing

E–ULP TroubleshootingTroubleshooting SRP IssuesIB0054606-02 A E-11Solution: Perhaps an interswitch cable has been disconnected, or the VIO hardware i

Strona 146 - 8-2 IB0054606-02 A

IB0054606-02 A 2-12 Step-by-Step Cluster Setup and MPI Usage ChecklistsThis section describes how to set up your cluster to run high-performance Mess

Strona 147 - IB0054606-02 A 8-3

E–ULP TroubleshootingTroubleshooting SRP IssuesE-12 IB0054606-02 AFollowing is an example:SCSI Host # : 17 | Mode :

Strona 148 - 8-4 IB0054606-02 A

E–ULP TroubleshootingTroubleshooting SRP IssuesIB0054606-02 A E-13SWUs : 0x00000000000003e8 Busy SWUs : 0x0000000000000000 | SRP Req

Strona 149 - IB0054606-02 A 9-1

E–ULP TroubleshootingTroubleshooting SRP IssuesE-14 IB0054606-02 ASolution 1:The host initiator has not been configured as an SRP initiator on the VI

Strona 150 - Required Steps

E–ULP TroubleshootingTroubleshooting SRP IssuesIB0054606-02 A E-15Solution:This indicates a problem in the path between the VIO hardware and the targ

Strona 151 - Installing DHCP

E–ULP TroubleshootingTroubleshooting SRP IssuesE-16 IB0054606-02 ASolution 2:Make certain that all sessions have a map to the same disk defined. The

Strona 152 - Configuring DHCP

E–ULP TroubleshootingTroubleshooting SRP IssuesIB0054606-02 A E-17In a failover configuration, if everything is configured correctly, one session wil

Strona 153 - Netbooting Over IB

E–ULP TroubleshootingTroubleshooting SRP IssuesE-18 IB0054606-02 AThe system displays information similar to the following:st106:~ # ibv_devinfo -i 1

Strona 154

E–ULP TroubleshootingTroubleshooting SRP IssuesIB0054606-02 A E-19Need to determine the SRP driver version.Solution:To determine the SRP driver versi

Strona 155

E–ULP TroubleshootingTroubleshooting SRP IssuesE-20 IB0054606-02 A

Strona 156

IB0054606-02 A F-1F Write CombiningIntroductionWrite Combining improves write bandwidth to the QLogic driver by writing multiple words in a single bu

Strona 157

2–Step-by-Step Cluster Setup and MPI Usage ChecklistsUsing MPI2-2 IB0054606-02 A8. Set up the host environment to use ssh. Two methods are discussed

Strona 158

F–Write CombiningMTRR Mapping and Write CombiningF-2 IB0054606-02 ARevert to using MTRR-only behavior by following one of the two suggestions in MTRR

Strona 159

F–Write CombiningVerify Write Combining is WorkingIB0054606-02 A F-3The test results will list any problems, if they exist, and provide suggestions o

Strona 160

F–Write CombiningVerify Write Combining is WorkingF-4 IB0054606-02 ANotes

Strona 161

IB0054606-02 A G-1G Commands and FilesThe most useful commands and files for debugging, and common tasks, are presented in the following sections. Ma

Strona 162 - HTTP Boot Setup

G–Commands and FilesRestarting InfiniPathG-2 IB0054606-02 ARestarting InfiniPathWhen the driver status appears abnormal on any node, you can try rest

Strona 163 - /etc/init.d/httpd start

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-3ibtracertaDetermines the path that IB packets travel between two nodesibv_d

Strona 164 - 9-16 IB0054606-02 A

G–Commands and FilesSummary and Descriptions of CommandsG-4 IB0054606-02 Admesgdmesg prints out bootup messages. It is useful for checking for initia

Strona 165 - A Benchmark Programs

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-5-S/--sgid GID — Source GID. (Can be in GID (“0x########:0x########”) or ine

Strona 166

G–Commands and FilesSummary and Descriptions of CommandsG-6 IB0054606-02 ASample output:# iba_opp_query --slid 0x31 --dlid 0x75 --sid 0x107Query Para

Strona 167

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-7 resv2 0x0 resv3 0x0Explanation o

Strona 168 - Between Two Nodes

IB0054606-02 A 3-13 InfiniBand® Cluster Setup and AdministrationThis section describes what the cluster administrator needs to know about the QLogic

Strona 169

G–Commands and FilesSummary and Descriptions of CommandsG-8 IB0054606-02 AExamples:Query by LID and SID:iba_opp_query -s 0x31 -d 0x75 -i 0x107iba_opp

Strona 170 - (osu_mbw_mr)

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-9iba_hca_revThis command scans the system and reports hardware and firmware

Strona 171

G–Commands and FilesSummary and Descriptions of CommandsG-10 IB0054606-02 A[ADAPTER]PSID = MT_0D80120009pcie_gen2_speed_supported = trueadapter_dev_i

Strona 172 - A-8 IB0054606-02 A

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-11port1_sd2_ob_preemp_pre_qdr = 0x0port2_sd2_ob_preemp_pre_qdr = 0x0port1_sd

Strona 173

G–Commands and FilesSummary and Descriptions of CommandsG-12 IB0054606-02 Aport2_sd3_muxmain_qdr = 0x1fmellanox_qdr_ib_support = truemellanox_ddr_ib_

Strona 174

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-13auto_ddr_option_4.tx_preemp = 0x0auto_ddr_option_5.tx_preemp_pre = 0x5auto

Strona 175 - IB0054606-02 A A-11

G–Commands and FilesSummary and Descriptions of CommandsG-14 IB0054606-02 Aauto_ddr_option_11.tx_preemp_msb = 0x0auto_ddr_option_11.tx_preemp_post =

Strona 176 - A-12 IB0054606-02 A

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-15auto_ddr_option_6.rx_offs_lowpass_en = 0x0auto_ddr_option_7.rx_offs_lowpas

Strona 177 - B SRP Configuration

G–Commands and FilesSummary and Descriptions of CommandsG-16 IB0054606-02 Aauto_ddr_option_5.rx_main = 0xeauto_ddr_option_6.rx_main = 0xfauto_ddr_opt

Strona 178 - QLogic SRP Configuration

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-17auto_ddr_option_11.rx_muxeq = 0x04auto_ddr_option_11.rx_muxmain = 0x1fauto

Strona 179 - Specifying a Session

3–InfiniBand® Cluster Setup and AdministrationInstalled Layout3-2 IB0054606-02 AInstalled LayoutThis section describes the default installed layout f

Strona 180 - B-4 IB0054606-02 A

G–Commands and FilesSummary and Descriptions of CommandsG-18 IB0054606-02 Albist_shift_freq = 3pll_stabilize = 0x13flash_div = 0x3lbist_array_bypass

Strona 181

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-19FW image verification succeeded. Image is bootable.######################i

Strona 182 - script as follows:

G–Commands and FilesSummary and Descriptions of CommandsG-20 IB0054606-02 Alinkwidth (link width supported) – use -i for integer value (1=1X, 2=4X, 3

Strona 183

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-21Exampleiba_manage_switch -t 0x00066a00e3001234 -f QLogic_12000_V1_firmware

Strona 184 - Port Indexes

G–Commands and FilesSummary and Descriptions of CommandsG-22 IB0054606-02 A-a alarm – number of seconds for alarm trigger to dump capture and exit-s

Strona 185 - Specifying a SRP Target Port

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-23Following is a sample output for the DDR adapters:# ibstatusInfiniband dev

Strona 186 - Specifying an Adapter

G–Commands and FilesSummary and Descriptions of CommandsG-24 IB0054606-02 Aibv_devinfoThis program displays information about IB devices, including v

Strona 187 - Restarting the SRP Module

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-25If the /lib/modules/OS_version/updates directory is not present, then the

Strona 188

G–Commands and FilesSummary and Descriptions of CommandsG-26 IB0054606-02 Aipath_checkout performs the following seven tests on the cluster:1. Execut

Strona 189

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-27In most cases of failure, the script suggests recommended actions. Also re

Strona 190 - SRP Initiator port to another

3–InfiniBand® Cluster Setup and AdministrationIB and OpenFabrics Driver OverviewIB0054606-02 A 3-3IB and OpenFabrics Driver OverviewThe ib_qib module

Strona 191

G–Commands and FilesSummary and Descriptions of CommandsG-28 IB0054606-02 AHere is sample usage and output:% ipath_control -i$Id: QLogic OFED Release

Strona 192

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-29MTRR is used by the InfiniPath driver to enable write combining to the QLo

Strona 193

G–Commands and FilesSummary and Descriptions of CommandsG-30 IB0054606-02 A Test the IB link and bandwidth between two InfiniPath IB adapters. Usin

Strona 194 - FVIC in Chassis

G–Commands and FilesSummary and Descriptions of CommandsIB0054606-02 A G-31mpirunmpirun determines whether the program is being run against a QLogic

Strona 195

G–Commands and FilesCommon Tasks and CommandsG-32 IB0054606-02 AThis option poisons receive buffers at initialization and after each receive; pre-ini

Strona 196

G–Commands and FilesCommon Tasks and CommandsIB0054606-02 A G-33Table G-3. Common Tasks and Commands Summary Function CommandCheck the system state i

Strona 197 - 1. Review ib_qlgc_srp_query

G–Commands and FilesSummary and Descriptions of Useful FilesG-34 IB0054606-02 ASummary and Descriptions of Useful FilesUseful files are summarized in

Strona 198 - Native IB Storage SRP Driver

G–Commands and FilesSummary and Descriptions of Useful FilesIB0054606-02 A G-35This information is useful for reporting problems to Technical Support

Strona 199

G–Commands and FilesSummary of Configuration FilesG-36 IB0054606-02 AThis same directory contains other files with information related to status. The

Strona 200 - OFED SRP Configuration

G–Commands and FilesSummary of Configuration FilesIB0054606-02 A G-37Table G-7. Configuration Files Configuration File Name Description/etc/modprobe

Strona 201

3–InfiniBand® Cluster Setup and AdministrationIPoIB Network Interface Configuration3-4 IB0054606-02 AThis example assumes that no hosts files exist,

Strona 202 - B-26 IB0054606-02 A

G–Commands and FilesSummary of Configuration FilesG-38 IB0054606-02 A

Strona 203 - C Integration with a Batch

IB0054606-02 A H-1H Recommended ReadingReference material for further reading is provided in this appendix.References for MPIThe MPI Standard specifi

Strona 204

H–Recommended ReadingOpenFabricsH-2 IB0054606-02 AOpenFabricsInformation about the OpenFabrics Alliance (OFA) is located at:http://www.openfabrics.or

Strona 206 - C-4 IB0054606-02 A

Corporate Headquarters QLogic Corporation 26650 Aliso Viejo Parkway Aliso Viejo, CA 92656 949.389.6000 www.qlogic.comInternational Office

Strona 207 - D Troubleshooting

3–InfiniBand® Cluster Setup and AdministrationIPoIB AdministrationIB0054606-02 A 3-5IPoIB AdministrationAdministering IPoIBStopping, Starting and Res

Strona 208 - BIOS Settings

IB0054606-02 A iiiTable of ContentsPrefaceIntended Audience . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x

Strona 209

3–InfiniBand® Cluster Setup and AdministrationIB Bonding3-6 IB0054606-02 ANAME field specified in the CREATE block. The following is an example of th

Strona 210

3–InfiniBand® Cluster Setup and AdministrationIB BondingIB0054606-02 A 3-7Red Hat EL5 and EL6The following is an example for bond0 (master). The file

Strona 211

3–InfiniBand® Cluster Setup and AdministrationIB Bonding3-8 IB0054606-02 ASuSE Linux Enterprise Server (SLES) 10 and 11The following is an example fo

Strona 212 - InfiniPath

3–InfiniBand® Cluster Setup and AdministrationIB BondingIB0054606-02 A 3-9Verify the following line is set to the value of yes in /etc/sysconfig/boot

Strona 213 - SDP Module Not Loading

3–InfiniBand® Cluster Setup and AdministrationSubnet Manager Configuration3-10 IB0054606-02 AExample of ifconfig output:st2169:/etc/sysconfig # ifcon

Strona 214

3–InfiniBand® Cluster Setup and AdministrationSubnet Manager ConfigurationIB0054606-02 A 3-11OpenSM is a component of the OpenFabrics project that pr

Strona 215 - Performance Issues

3–InfiniBand® Cluster Setup and AdministrationQLogic Distributed Subnet Administration3-12 IB0054606-02 AQLogic Distributed Subnet AdministrationAs I

Strona 216 - Erratic Performance

3–InfiniBand® Cluster Setup and AdministrationQLogic Distributed Subnet AdministrationIB0054606-02 A 3-13Virtual Fabrics and the Distributed SAThe IB

Strona 217

3–InfiniBand® Cluster Setup and AdministrationQLogic Distributed Subnet Administration3-14 IB0054606-02 AIf you are using the QLogic Fabric Manager i

Strona 218 - Open MPI Troubleshooting

3–InfiniBand® Cluster Setup and AdministrationQLogic Distributed Subnet AdministrationIB0054606-02 A 3-15Figure 3-4. Distributed SA Multiple Virtual

Strona 219

iv IB0054606-02 AOFED+ Host Software Release 1.5.4User GuideSubnet Manager Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Strona 220

3–InfiniBand® Cluster Setup and AdministrationQLogic Distributed Subnet Administration3-16 IB0054606-02 AFigure 3-6. Virtual Fabrics with PSM_MPI Vir

Strona 221

3–InfiniBand® Cluster Setup and AdministrationQLogic Distributed Subnet AdministrationIB0054606-02 A 3-17Second, the Distributed SA handles overlaps

Strona 222

3–InfiniBand® Cluster Setup and AdministrationQLogic Distributed Subnet Administration3-18 IB0054606-02 ASIDThe SID is the primary configuration sett

Strona 223

3–InfiniBand® Cluster Setup and AdministrationQLogic Distributed Subnet AdministrationIB0054606-02 A 3-19DbgThis parameter controls how much logging

Strona 224

3–InfiniBand® Cluster Setup and AdministrationChanging the MTU Size3-20 IB0054606-02 AChanging the MTU SizeThe Maximum Transfer Unit (MTU) size enabl

Strona 225 - IB0054606-02 A E-7

3–InfiniBand® Cluster Setup and AdministrationManaging the ib_qib DriverIB0054606-02 A 3-21Managing the ib_qib DriverThe startup script for ib_qib is

Strona 226

3–InfiniBand® Cluster Setup and AdministrationManaging the ib_qib Driver3-22 IB0054606-02 ASee the ib_qib man page for more details.Configure the ib_

Strona 227 - Troubleshooting SRP Issues

3–InfiniBand® Cluster Setup and AdministrationManaging the ib_qib DriverIB0054606-02 A 3-23You can check to see if opensmd is configured to autostart

Strona 228

3–InfiniBand® Cluster Setup and AdministrationMore Information on Configuring and Loading Drivers3-24 IB0054606-02 A/ipathfs/1/counter_names/ipathfs/

Strona 229

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management TipsIB0054606-02 A 3-25Performance TuningTuning compute or storage

Strona 230 - Following is an example:

IB0054606-02 A vOFED+ Host Software Release 1.5.4User GuideConfiguring for ssh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Strona 231

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management Tips3-26 IB0054606-02 AIf cpuspeed or powersaved are being used as

Strona 232 - E-14 IB0054606-02 A

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management TipsIB0054606-02 A 3-27Increasing the number of kernel receive queu

Strona 233 - IB0054606-02 A E-15

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management Tips3-28 IB0054606-02 AIn the rare case that the node has more than

Strona 234 - E-16 IB0054606-02 A

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management TipsIB0054606-02 A 3-29For setting all C-States to 0 where there is

Strona 235 - IB0054606-02 A E-17

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management Tips3-30 IB0054606-02 AHigh Risk Tuning for Intel Harpertown CPUsFo

Strona 236

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management TipsIB0054606-02 A 3-31Additional Driver Module Parameter Tunings A

Strona 237 - Solution:

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management Tips3-32 IB0054606-02 A value is the parameter value for the parti

Strona 238 - E-20 IB0054606-02 A

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management TipsIB0054606-02 A 3-33This command lets the driver automatically d

Strona 239

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management Tips3-34 IB0054606-02 AFor example:# cat /etc/modprobe.d/ib_ipoib.c

Strona 240 - # ipath_mtrr

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management TipsIB0054606-02 A 3-35The values picked for the various checks and

Strona 241

vi IB0054606-02 AOFED+ Host Software Release 1.5.4User GuideDebugging MPI Programs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Strona 242 - F-4 IB0054606-02 A

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management Tips3-36 IB0054606-02 AAUTOMATIC vs. INTERACTIVE MODEThe tool perfo

Strona 243

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management TipsIB0054606-02 A 3-37Applying the ResultsAutomatic mode versus in

Strona 244 - Restarting InfiniPath

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management Tips3-38 IB0054606-02 A rpm (see “rpm” on page G-32)strings (see

Strona 245 - Program Name Function

3–InfiniBand® Cluster Setup and AdministrationPerformance Settings and Management TipsIB0054606-02 A 3-39Remove Unneeded ServicesThe cluster administ

Strona 246

3–InfiniBand® Cluster Setup and AdministrationHost Environment Setup for MPI3-40 IB0054606-02 AOther services may be required by your batch queuing s

Strona 247 - IB0054606-02 A G-5

3–InfiniBand® Cluster Setup and AdministrationHost Environment Setup for MPIIB0054606-02 A 3-41“Configuring for ssh Using ssh-agent” on page 3-43 sho

Strona 248 - Sample output:

3–InfiniBand® Cluster Setup and AdministrationHost Environment Setup for MPI3-42 IB0054606-02 A3. On each of the IB node systems, create or edit the

Strona 249

3–InfiniBand® Cluster Setup and AdministrationHost Environment Setup for MPIIB0054606-02 A 3-43At this point, any end user should be able to login to

Strona 250

3–InfiniBand® Cluster Setup and AdministrationChecking Cluster and Software Status3-44 IB0054606-02 A5. Finally, test by logging into the front end n

Strona 251

3–InfiniBand® Cluster Setup and AdministrationChecking Cluster and Software StatusIB0054606-02 A 3-45iba_opp_query iba_opp_query is used to check the

Strona 252

IB0054606-02 A viiOFED+ Host Software Release 1.5.4User GuideRunning SHMEM Programs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Strona 253

3–InfiniBand® Cluster Setup and AdministrationChecking Cluster and Software Status3-46 IB0054606-02 A mtu 0x4 rate

Strona 254

3–InfiniBand® Cluster Setup and AdministrationChecking Cluster and Software StatusIB0054606-02 A 3-47ibv_devinfo ibv_devinfo queries RDMA devices. Us

Strona 255

3–InfiniBand® Cluster Setup and AdministrationChecking Cluster and Software Status3-48 IB0054606-02 A

Strona 256

IB0054606-02 A 4-14 Running MPI on QLogic AdaptersThis section provides information on using the Message-Passing Interface (MPI) on QLogic IB HCAs. E

Strona 257

4–Running MPI on QLogic AdaptersOpen MPI4-2 IB0054606-02 AInstallationFollow the instructions in the QLogic Fabric Software Installation Guide for in

Strona 258

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-3These wrapper scripts pass most options on to the underlying compiler. Use the documentatio

Strona 259

4–Running MPI on QLogic AdaptersOpen MPI4-4 IB0054606-02 AThe first choice will use verbs by default, and any with the _qlc string will use PSM by de

Strona 260

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-5Configuring MPI Programs for Open MPIWhen configuring an MPI program (generating header fil

Strona 261

4–Running MPI on QLogic AdaptersOpen MPI4-6 IB0054606-02 AThe easiest way to use other compilers with any MPI that comes with QLogic OFED+ is to use

Strona 262 - G-20 IB0054606-02 A

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-7For Fortran 90 programs:$ mpif90 -f90=pgf90 -show pi3f90.f90 -o pi3f90pgf90 -I/usr/include/

Strona 263

viii IB0054606-02 AOFED+ Host Software Release 1.5.4User GuideA Benchmark ProgramsBenchmark 1: Measuring MPI Latency Between Two Nodes . . . . . . .

Strona 264

4–Running MPI on QLogic AdaptersOpen MPI4-8 IB0054606-02 A The default hardware context/CPU mappings can be changed on the QDR IB Adapters (QLE734x).

Strona 265

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-9Performance can be improved in some cases by disabling IB hardware contexts when they are n

Strona 266

4–Running MPI on QLogic AdaptersOpen MPI4-10 IB0054606-02 ATo explicitly disable context sharing, set this environment variable in one of the two fol

Strona 267 - OS_version/updates

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-11Context Sharing Error MessagesThe error message when the context limit is exceeded is:No f

Strona 268

4–Running MPI on QLogic AdaptersOpen MPI4-12 IB0054606-02 Ampihosts File DetailsAs noted in “Create the mpihosts File” on page 4-3, a hostfile (also

Strona 269

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-13 The command line option -hostfile can be used as shown in the following command line:$mp

Strona 270

4–Running MPI on QLogic AdaptersOpen MPI4-14 IB0054606-02 AThis option spawns n instances of program-name. These instances are called node programs.G

Strona 271

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-15Open MPI directs UNIX standard output and error from remote nodes to the node that invoked

Strona 272 - # modprobe -v ib_qib

4–Running MPI on QLogic AdaptersOpen MPI4-16 IB0054606-02 AOpen MPI adds the base-name of the current node’s bindir (the directory where Open MPI’s e

Strona 273

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-17Setting MCA ParametersThe -mca switch allows the passing of parameters to various Modular

Strona 274 - Common Tasks and Commands

IB0054606-02 A ixOFED+ Host Software Release 1.5.4User GuideConfiguring SRP for Native IB Storage . . . . . . . . . . . . . . . . . . . . . . . . B-

Strona 275 - Function Command

4–Running MPI on QLogic AdaptersOpen MPI4-18 IB0054606-02 AEnvironment VariablesTable 4-6 contains a summary of the environment variables that are re

Strona 276 - Table G-4. Useful Files

4–Running MPI on QLogic AdaptersOpen MPIIB0054606-02 A 4-19IPATH_NO_CPUAFFINITY When set to 1, the PSM library will skip trying to set processor affi

Strona 277

4–Running MPI on QLogic AdaptersOpen MPI4-20 IB0054606-02 AJob Blocking in Case of Temporary IB Link FailuresBy default, as controlled by mpirun’s qu

Strona 278

4–Running MPI on QLogic AdaptersOpen MPI and Hybrid MPI/OpenMP ApplicationsIB0054606-02 A 4-21Open MPI and Hybrid MPI/OpenMP ApplicationsOpen MPI sup

Strona 279

4–Running MPI on QLogic AdaptersDebugging MPI Programs4-22 IB0054606-02 ADebugging MPI ProgramsDebugging parallel programs is substantially more diff

Strona 280 - G-38 IB0054606-02 A

4–Running MPI on QLogic AdaptersDebugging MPI ProgramsIB0054606-02 A 4-23NOTEThe TotalView® debugger can be used with the Open MPI supplied in this r

Strona 281

4–Running MPI on QLogic AdaptersDebugging MPI Programs4-24 IB0054606-02 A

Strona 282 - Other Software Packages

IB0054606-02 A 5-15 Using Other MPIsThis section provides information on using other MPI implementations. Detailed information on using Open MPI is p

Strona 283

5–Using Other MPIsInstalled Layout5-2 IB0054606-02 AThese MPI implementations run on multiple interconnects, and have their own mechanisms for select

Strona 284

5–Using Other MPIsOpen MPIIB0054606-02 A 5-3Open MPI Open MPI is an open source MPI-2 implementation from the Open MPI Project. Pre-compiled versions

Komentarze do niniejszej Instrukcji

Brak uwag