HP StorageWorks Fabric OS 5.
Legal and notice information ©Copyright 2005 Hewlett-Packard Development Company, L.P. ©Copyright2005BrocadeCommunicationsSystems,Incorporated. Hewlett-Packard Company makes no warranty of any kind with regard to this material, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose.
Contents About this guide . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Intended audience . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Related documentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Document conventions and symbols . . . . . . . . . . . . . . . . . . . . . . . . . HP technical support . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3 Activating and accessing Fabric Watch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 Activating Fabric Watch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Activating with telnet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Activating with Advanced Web Tools. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
14 15 16 17 18 19 fwFruCfg configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Configuring Fabric Watch using SNMP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Enabling Fabric Watch Traps in SNMP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Configuring SNMP management host IP address . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
About this guide This administrator guide provides information about: • Setting up HP StorageWorks Fabric Watch software • Managing your SAN via HP StorageWorks Fabric Watch software Intended audience This guide is intended for: • system administrators responsible for setting up HP StorageWorks Fibre Channel Storage Area Network (SAN) switches • technicians responsible for maintaining the Fabric Operating System (OS) Related documentation Documentation, including white papers and best practices documents,
Document conventions and symbols Table 1 Document conventions Convention Element Medium blue text: Figure 1 Cross-reference links and e-mail addresses Medium blue, underlined text (http://www.hp.
HP technical support Telephone numbers for worldwide technical support are listed on the HP support web site: http://www.hp.com/support/. Collect the following information before calling: • Technical support registration number (if applicable) • Product serial numbers • Product model names and numbers • Applicable error messages • Operating system type and revision level • Detailed, specific questions For continuous quality improvement, calls may be recorded or monitored.
1 An introduction to Fabric Watch This chapter contains the following information: • Fabric Watch overview, page 11 • Introduction to fabric health, page 12 Fabric Watch overview Fabric Watch is an optional Storage Area Network (SAN) monitoring software for B-Series HP StorageWorks switches running Fabric OS 2.2 or higher. It enables each switch to constantly watch its SAN fabric for potential faults and to automatically alert you to problems long before they become costly failures.
Following an event, Fabric Watch adds an entry to the internal port log for an individual switch and freezes the log to ensure that detail-level information is available. • RapiTrap Following an event, Fabric Watch forwards event information to a proxy switch, which then forwards the information to a server to notify you. • Email notification Following an event, Fabric Watch creates and sends an Informational email to a designated recipient. Fabric Watch is designed for rapid deployment.
2 Fabric Watch concepts This chapter contains the following sections: • Fabric watch components, page 13 • Configuring events, page 19 • Port persistence, page 25 • Notification methods, page 25 • Switch policies, page 27 • Interpreting event messages, page 27 Fabric watch components Fabric Watch uses a hierarchical organization to track the network device information it monitors. There is a class, area, and element associated with every monitored behavior.
Table 2 Fabric Watch classes (continued) Class Description Performance Monitor Serves as a tuning tool. Performance Monitor classes group areas that track the source and destination of traffic. Use the Performance Monitor class thresholds and alarms to determine traffic load and flow and to reallocate resources appropriately. The Performance Monitor class is divided into the areas AL_PA Performance Monitor, EE (end-to-end) Performance Monitor, and Filter Performance Monitor.
Fabric class areas Table 4 lists Fabric Watch areas in the Fabric class and describes each area. Table 4 Fabric class sreas Area Description Domain ID Changes Monitors forcible domain ID changes. Forcible domain ID changes occur when there is a conflict of domain IDs in a single fabric and the principal switch has to assign another domain ID to a switch. Fabric Logins Occurs when ports and devices initialize with the fabric.
• HP StorageWorks SAN Switch 2/8V, 2/16V and 2/16N • HP StorageWorks SAN Switch 2/32 • HP StorageWorks SAN Switch 4/32 Performance monitor class areas Table 6 lists Fabric Watch areas in the Performance Monitor class and describes each area. Table 6 Performance monitor class areas Area Indicates Customer Define Relies on performance monitor telnet commands. For more information on this area, refer to the HP StorageWorks Fabric OS 5.x command reference guide.
Table 7 Port class areas (continued) Area Indicates Receive (RX) Performance The percentage of maximum bandwidth consumed in packet receipts. State Changes The state of the port has changed for one of the following reasons: • The port has gone offline. • The port has come online. • The port is testing. • The port is faulty. • The port has become an E_Port. • The port has become an F/FL_Port. • The port has segmented. • The port has become a trunk port.
Table 9 Security class areas (continued) Area Indicates Invalid Signatures If a switch cannot verify the signature of a packet, the switch rejects the packet and the signature becomes invalid. Invalid Timestamps If a time interval becomes too great from the time a packet is sent to the time it is received, the timestamp of the packet becomes invalid and the switch rejects it. Login Violation A login violation occurs when a secure fabric detects a login failure.
Table 10 SFP class areas (continued) Area Description Transmit Power The transmit power area measures the amount of outgoing laser, in µwatts. Use this to determine the condition of the SFP. If the counter often exceeds the threshold, the SFP is deteriorating. Current The current area measures the amount of supplied current to the SFP transceiver. Current area events indicate hardware failures. Supply Voltage The supply voltage area measures the amount of voltage supplied to the SFP.
Triggered event behavior If you do not want notification during each sample period from the port hardware failure to the time of its repair, you can define the event behavior as triggered. When an event behavior is defined as triggered, Fabric Watch sends only one event notification when the fabric meets the criteria for the event. It does not send out any more notifications. For example, when a port fails, Fabric Watch sends you a notification of the failure.
Figure shows an example in which each time a signal crosses the high limit, an event occurs. The blue arrows indicate the area where the event criteria is met. In this case, there is a great deal of fluctuation. Even when the monitor is set to triggered, a number of messages are sent. Figure 1 Threshold monitoring Figure shows how to limit the number of event notifications using a buffer. When you specify a buffer, events cannot occur both above the high threshold and below the low threshold.
Specifying a time base If you specify a time base value other than none (seconds, minute, hour, or day), Fabric Watch does not use the current data value. Instead, it calculates the difference between the current data value and the data value as it existed one time base ago. It compares this difference to the threshold boundary limit. For example, if you specify the time base minute, Fabric Watch calculates the counter value difference between two samples a minute apart.
does not trigger an event even though the absolute value of the counter reaches 4, which is well above the high threshold. Figure 5 Example without an event Event settings This section describes how Fabric Watch compares a fabric element’s data value against a threshold value to determine whether or not to trigger an event. It describes how a specified buffer zone impacts event triggering.
Below event trigger The Below event trigger generates an event when a data value becomes less than the low threshold boundary. When a buffer is defined, the data value must be below the buffer value and the low threshold. Changed event trigger Use the Changed event trigger for an element that requires “rate of change” monitoring. When Fabric Watch detects a change in the counter value between two sample periods (defined by the time base), it triggers an event regardless of high or low threshold settings.
Port persistence In the case of port monitoring, there is an additional factor to consider. The data collected in port monitoring can vary a lot over short time periods. Therefore, the port can become a source of frequent event messages (the data can exceed the threshold range and return to a value within the threshold range). Fabric Watch uses port persistence for a port event that requires the transition of the port into a marginal status.
The trap stores event information but does not actively send alerts. Port changes do not generate SNMP traps. RAPITrap RAPITrap is a Fabric Watch alarm that actively alerts you to events. After you enable RAPITrap, Fabric Watch forwards all event information to a designated proxy switch. The host API automatically configures the proxy switch, based on firmware version. The switch forwards the information to a server and alerts the SAN manager to event activity.
Switch policies Switch policies are a series of rules that define specific states for the overall switch. Fabric OS interacts with Fabric Watch using these policies. Each rule defines the number of types of errors that transitions the overall switch state into a state that is not healthy. For example, you can specify a switch policy so that if a switch has two port failures, it is considered to be in a marginal state; if it has four failures, it is in a down state.
Fabric Watch concepts
3 Activating and accessing Fabric Watch This chapter contains the following sections: • Activating Fabric Watch, page 29 • Accessing Fabric Watch, page 29 Activating Fabric Watch Fabric Watch must be activated on each switch individually before use. Use telnet or Brocade Advanced Web Tools to activate Fabric Watch, as described next. Web Tools offers a user-friendly graphical interface that most users find convenient.
• ”Telnet” on page 30 • ”Advanced Web Tools” on page 30 • ”SNMP-Based enterprise managers” on page 30 • ”Configuration file” on page 32 Telnet Use a telnet session to: • Observe the current monitors on a switch with the fwShow command. • Query and modify threshold and alarm configurations (whether default or customized) with the fwConfigure command. • View and configure the FRU module with the fwFruCfg command.
• Receive alarm notification via SNMP traps. • View and configure the mail database. NOTE: The following instructions apply to the AdvantNet MIB browser. There may be some variation in the procedures when other MIB browsers are used. To configure Fabric Watch with an SNMP-based enterprise manager, begin by connecting to the switch using a MIB browser: 1. Open a MIB browser. 2. If not already done, load the appropriate MIB files.
5. Locate the Fabric Watch OID information per the following screen: Configuration file Use a configuration file to: • Upload a configuration file, make changes in a text editor, and download the file to all switches. • Upload and download the configuration file through a telnet session or with Web Tools. Uploading and downloading a configuration file to multiple switches efficiently populates your SAN with consistent Fabric Watch settings. For details about configuration file usage, see Appendix C.
4 Configuring Fabric Watch This chapter describes the procedures used to configure Fabric Watch and contains the following sections: • Configuring Fabric Watch thresholds, page 33 • Configuring notifications, page 46 • Configuring switch status policy, page 49 • Configuring FRUs, page 51 • Configuring Fabric Watch using Web Tools, page 51 • Configuring Fabric Watch using SNMP, page 52 Configuring Fabric Watch thresholds After it is activated, Fabric Watch starts using a set of default factory settings tha
The fwConfigure menu contains 12 menu items. The first 11 items correspond to the classes available for configuration. Item 12, which is the default, exits the fwConfigure application. 4. From the list displayed, enter the number corresponding to the class that you want to configure. For example, if you enter 5, the menu corresponding to the E-Port class appears.
Table 12 Element listing information - RXPerformance area menu Heading Meaning LastVal The data value of the element at the time of the last event LastState The last detected state of the element See ”Fabric watch components” on page 13 for more details about classes and areas. Step 2: Configure thresholds After you’ve identified and selected the appropriate class and areas, you can configure thresholds for those classes and areas.
Fabric Watch redraws the element table with the selected element disabled. The second row of information about the selected element does not appear any more, and the status of the element is set to disabled, as follows: Select threshold index => : (8..
4. advanced configuration To customize Fabric Watch monitoring to suit to your environment, use the advanced configuration option as follows: 1. Enter 4 at the command prompt. The system generates output similar to the following screen. The output you see varies based on the class and area you select. In the Advanced Configuration menu shown here, the output is based on the E-Port class and RXPerformance area.
Table 13 Element listing information - Advanced Configuration Menu Heading Meaning Index A numeric identifier assigned to the element ThresholdName A string identifier assigned to the element BehaviorType Frequency of alarm notifications BehaviorInt The element behavior interval, in seconds The threshold boundary section of the Advanced Configuration menu includes the threshold information for the selected area.
Fabric Watch displays the units of measurement (Unit), time base (Time base), low threshold (Low), high threshold (High) and buffer size (BufSize) for each column. See the following screen. In this example, a value of 80% is chosen as the custom high value for RXPerformance. The default value is 10.
the value, it does not take effect.
Step 3: Configure alarms Alarms act as a signal or alert that notifies you when a threshold has been crossed. You can configure the following types of notification settings for Fabric Watch: • Triggered A triggered behavior type signals you once, after a threshold has been crossed. Triggered is the default behavior type signal for all class areas. • Continuous A continuous behavior type signals you continuously after a threshold has been crossed.
2. Enter the total at the prompt. See the following example: 1 : change behavior type 11 2 : change behavior interval 12 3 : change threshold boundary level 13 4 : change custom unit 14 5 : change custom time base 15 6 : change custom low 16 7 : change custom high 17 8 : change custom buffer 18 9 : apply threshold boundary changes 10 : cancel threshold boundary changes Select choice => : (1..
Figure shows how to select the custom settings for the threshold alarm level for the RXPerformance area. The options are either to accept the default settings or provide custom settings.
Table 15 describes the 18 customization options displayed at the end of the Advanced Configuration menu. Table 15 44 Advanced configuration options Option Effect Input information change behavior type Changes the behavior type of a single element to either Triggered or Continuous. The change is volatile because this option is not saved to flash memory. Every time the switch is rebooted, this option is reset.
Table 15 Advanced configuration options (continued) Option Effect Input information change changed alarm Changes the notification method for changed event occurrences for this method, but only affects the custom column. The required notification methods change above alarm Changes the notification method for above event occurrences for this method, but only affects the custom column.
A port is not considered disabled if one of the port thresholds is still enabled. To enable all the thresholds for a port, at the command prompt enter: swd77:admin> fwconfigure --enable --port 9 Configuring notifications You can be notified of an alarm condition through a notification.
Configuring email notifications In environments where it is critical that you are notified about errors quickly, you might want to use email notifications. With email notifications, you can be notified of serious errors via email or a pager, so you can react quickly. To configure email notifications in a telnet session, enter the fwMailcfg command at the prompt. The fwMailcfg menu, shown in Figure , appears.
The following confirmation message appears: Email Alert is disabled! The system returns to the main fwMailCfg menu. 3: Enable Email Alert 1. Enter 3 in the fwMailCfg menu (shown in Figure ) to enable email alert for a specific class. The Config Show menu (shown in Figure ) appears. 2. Select a class for which Fabric Watch should enable email alerts.
Enter the email address of the person responsible for the specific class of alerts. Fabric Watch uses the default value, located between the brackets in the prompt, as the current email address for the class. A value of NONE indicates that no email address has been provided. NOTE: Email addresses must not exceed 128 characters. The system displays a confirmation message and returns to the main fwMailCfg menu. 6: Quit Enter 6 in the fwMailCfg menu (shown in Figure ) to exit the menu.
Monitor Health factors Fans Fan thresholds, faulty fans. WWN Faulty WWN card (applies to modular switches). CP Switch does not have a redundant CP (applies to modular switches). Blade Faulty blades (applies to modular switches). Flash Flash thresholds. Marginal Ports Port, E-Port, optical port, and copper port thresholds. Whenever these thresholds are persistently high, the port is Marginal. Faulty Ports Hardware-related port faults. Missing SFPs Ports that are missing SFP media.
asked to provide values for each FRU alarm state and alarm action. To accept the default value for each FRU (as shown in Figure ), press Return. After you have configured a FRU alarm state and alarm action, the values apply to all FRUs of that type. For example, the values specified for a slot FRU will apply to all slots in the enclosure.
2. Load the appropriate MIB files. First, load the Brocade common MIB file (BRCD_v5_0.mib), followed by the Brocade software MIB file (SW_v5_2.mib). If this is successful, the system displays a screen similar to Figure 2. Figure 15 Configuring Fabric Watch using SNMP In Figure 2, the MIB browser has populated the left side of the screen with a MIB tree that can be navigated. 3.
swd77:admin> agtcfgset Customizing MIB-II system variables ... At each prompt, do one of the following: o to accept current value, o enter the appropriate new value, o to skip the rest of configuration, or o to cancel any change. To correct any input mistake: erases the previous character, erases the whole line, sysDescr: [Fibre Channel Switch.] sysLocation: [End User Premise.] sysContact: [Field Support.
Fabric Watch displays a screen similar to the one shown in Figure . Figure 18 Example OID tree 6. Obtain the specific identifier for the element that will be modified. To get the identifier, click the swFwThresholdTable and swFwThresholdEntry directory, and run a get operation on swFwName. A list of elements appears in which each element is preceded by an identifier. Remember the numeric portion of the identifier, which appears before the “==>” symbol.
In this example, 83.1 is numeric identifier for the element referenced as resFlash000. 7. Traverse the fields beneath swFwClassAreaTable and swFwThresholdTable, appending the numeric identifier from the previous step to each field before performing a get or write operation. For example, to get and modify information specific to the resFlash000 element, select one of the fields and append “83.1” in the Object ID field on the right side of the screen. To modify information, you must define a write community.
Configuring Fabric Watch
A Default threshold values This appendix lists Fabric Watch default threshold values for all classes except the FRU class, which has none. The following tables list all of the default values used for the default Fabric Watch configuration settings when running Fabric OS v5.x. Environment class Table 17 provides default settings for areas in the Environment class. These defaults are hardware-dependent. Check the appropriate switch installation guide for differences in environmental requirements.
NOTE: For the 4/32 SAN Switch, there is no fan default threshold because the fans are not monitored by Fabric Watch. You can use fanShow to view the 4/32 SAN Switch fan status (OK or NOT OK). However, you cannot use fwConfigure to manipulate the threshold or alarm actions against any fans.
Table 17 Environment class threshold defaults (continued) Area Description Default threshold settings Default alarm settings Threshold state Power Supply Monitors power supply condition Unit: 1/0 (OK/FAULTY) Time Base: none Changed: 0 Below: 3 Above: 3 In-Between: 0 Informative Out_of_range In_range Informative Changed: 0 Below: 3 Above: 3 In-Between: 3 Informative Out_of_range Out_of_range In_range The default threshold settings for all platforms are: Low: 1 High: 0 Buffer:0 Temperature Moni
Fabric class Table 18 provides default settings for areas in the Fabric class. These defaults are hardware-dependent.
Table 18 Fabric Class threshold defaults (continued) Area Description Default threshold settings Default alarm settings Threshold state Zoning Changes Monitors changes to currently enabled zoning configurations Unit: zone change(s) Time Base: none Low: 0 High: 0 Buffer: 0 Changed: 0 Below: 0 Above: 0 In-Between: 0 Informative Informative Informative Informative SFP State Changes Monitors SFP state changes Unit: Change(s) Time Base: Low: 0 High: 0 Buffer: 0 Changed: 0 Exceeded: 0 Below: 0 Abov
Table 21 provides default settings for areas in the End-to-End Performance Monitor class.
Table 22 Port Class threshold defaults (continued) Area Description Default threshold settings Default alarm settings Threshold state Primitive Sequence Protocol Error Monitors the number of primitive sequence errors Unit: error(s) Time Base: minute Low: 0 High: 5 Buffer: 0 Changed: 0 Below: 0 Above: 0 In-Between: 0 Informative Informative Out_of_range In_range Receive Performance Monitors receive rate, by percentage Unit: percentage Time Base: minute Low: 0 High: 100 Buffer: 0 Changed: 0 Bel
Table 23 E-Port class threshold defaults (continued) Area Description Default threshold settings Default alarm settings Threshold state Loss of Synchronization Count Monitors the number of loss of synchronization errors Unit: error(s) Time Base: minute Low: 0 High: 5 Buffer: 0 Changed: 0 Below: 0 Above: 0 In-Between: 0 Informative Informative Out_of_range In_range Primitive Sequence Protocol Error Monitors the number of primitive sequence errors Unit: error(s) Time Base: minute Low: 0 High: 5
Table 24 F/FL-Port class threshold defaults (continued) Area Description Default Threshold Settings Default Alarm Settings Threshold State Transmit Performance Monitors the transmit rate, by percentage Unit: percentage Time Base: minute Low: 0 High: 100 Buffer: 0 Changed: 0 Below: 0 Above: 0 In-Between: 0 Informative Informative Informative Informative Informative Invalid CRC Count Monitors the number of CRC errors Unit: error(s) Time Base: minute Low: 1 High: 5 Buffer: 0 Changed: 0 Below: 0
Security class Table 26 provides default settings for areas in the Security class.
Table 26 Security class threshold defaults (continued) Area Description Default threshold settings Default alarm settings Threshold state Login Violations Monitors login violations Unit: violation(s) Time Base: minute Low: 1 High: 2 Buffer: 0 Changed: 0 Below: 0 Above: 3 In-Between: 0 Informative Informative Out_of_range In_range MS Violations Monitors MS violations Unit: violation(s) Time Base: minute Low: 1 High: 2 Buffer: 0 Changed: 0 Below: 0 Above: 3 In-Between: 0 Informative Informativ
Table 26 Security class threshold defaults (continued) Area Description Default threshold settings Default alarm settings Threshold state Telnet Violations Monitors telnet violations Unit: violation(s) Time Base: minute Low: 1 High: 2 Buffer: 0 Changed: 0 Below: 0 Above: 3 In-Between: 0 Informative Informative Out_of_range In_range TS Out of Sync Monitors instances in which the timestamp is out of sync Unit: violation(s) Time Base: minute Low: 1 High: 2 Buffer: 0 Changed: 0 Below: 0 Above: 3
B Basic Fabric Watch configuration guidelines A default Fabric Watch configuration is available for the purpose of saving setup time. As you gain familiarity with Advanced Fabric Watch features, they can be tailored to suit the fabric environment. The custom settings available in Fabric Watch provide an advanced user much needed flexibility of redefining boundary thresholds and alarm notification methods.
Basic Fabric Watch configuration guidelines
C Using Fabric Watch with configuration files When you activate Fabric Watch, the software starts, using the default settings described in Chapter 5. You cannot alter these default settings; if the default values do not suit your specific needs, configure Fabric Watch to use more appropriate settings. When you configure the new settings for Fabric Watch, your switch stores the settings in the configuration file.
Using Fabric Watch with configuration files
Index A above event triggers 23 activating with advanced web tool 29 with telnet 29 activating Fabric Watch 29 Admin View 29 advanced configuration options 44 alarms configuring 41 notifications 46 areas 14 assigning notification methods 26 audience 7 authorized reseller, HP 9 B below event trigger 24 buffer values 20 C email alert 26 environment class areas 14 event behavior types 19 event settings 23 F fabric class areas 15 Fabric Watch components 13 FRU class areas 15 fsconfigure 30 fwclassinit 29 fw
R rack stability, warning 9 RapiTrap 26 related documentation 7 resource class area 17 S security class areas 17 setting time base to none 21 SFP class areas 18 SNMP capabilities 30 SNMP trap 25 specifying a time base 22 Subscriber’s choice, HP 9 switch event (error) log entry 25 switch policies 27 switch status policy 49 symbols in text 8 system requirements 29 T Table 16, 17 technical support, HP 9 telnet capabilities 30 text symbols 8 threshold values 57 threshold values 20 thresholds configuring 35 di