Dell OpenManage Server Administrator Version 7.
Notes and Cautions NOTE: A NOTE indicates important information that helps you make better use of your computer. CAUTION: A CAUTION indicates potential damage to hardware or loss of data if instructions are not followed. ____________________ Information in this document is subject to change without notice. © 2011 Dell Inc. All rights reserved. Reproduction of these materials in any manner whatsoever without the written permission of Dell Inc. is strictly forbidden.
Contents 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . What’s New in this Release . . . . . . . . . . . . . . . . Sample Event Message Text 8 . . . . . . . . . . . . . 8 . . . . . . . . . . . . Viewing Alerts and Event Messages . . . . . . . . . . Viewing Events in Microsoft Windows Server 2008 . 11 12 . . . . . . . . . 12 Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server . . . . . . . . . 13 . . . . . . . 14 . . . . . . . . . . .
Chassis Intrusion Messages . . . . . . . . . . . . . . 35 Redundancy Unit Messages . . . . . . . . . . . . . . . 38 . . . . . . . . . . . . . . . . 42 Power Supply Messages Memory Device Messages Fan Enclosure Messages . . . . . . . . . . . . . . . 46 . . . . . . . . . . . . . . . . 47 AC Power Cord Messages . . . . . . . . . . . . . . . . Hardware Log Sensor Messages . . . . . . . . . . . . 50 Processor Sensor Messages . . . . . . . . . . . . . . 52 Pluggable Device Messages . . . .
4 System Event Log Messages for IPMI Systems 237 Temperature Sensor Events . . . . . . . . . . . . . . . 237 . . . . . . . . . . . . . . . . . 239 . . . . . . . . . . . . . . . . . . . . 241 Voltage Sensor Events Fan Sensor Events Processor Status Events . . . . . . . . . . . . . . . . 243 . . . . . . . . . . . . . . . . . . 245 . . . . . . . . . . . . . . . . . . . 250 Power Supply Events Memory ECC Events BMC Watchdog Events Memory Events . . . . . . . . . . . . . . . . . 251 . . . . .
Contents
1 Introduction Dell OpenManage Server Administrator generates event messages stored primarily in the operating system or Server Administrator event logs and sometimes in Simple Network Management Protocol (SNMP) traps. This document describes the event messages that are created by Server Administrator version 7.0 and displayed in the Server Administrator alert log. Server Administrator creates events in response to sensor status changes and other monitored parameters.
What’s New in this Release The following new alerts are added: • 2425 - State change on Physical disk from READY to Non-RAID. • 2426 - State change on Physical disk from Non-RAID to READY. • 2429 - Drive Prepared for Removal. • 2430 - Drive Log Exported. Messages Not Described in This Guide This guide describes only event messages logged by Server Administrator and Storage Management that are displayed in the Server Administrator alert log.
Table 1-1. Understanding Event Messages Icon Alert Severity Component Status Warning / Non-critical An event that is not necessarily significant, but may indicate a possible future problem. For example, a Warning/Non-critical alert may indicate that a component (such as a temperature probe in an enclosure) has crossed a warning threshold. Critical / Failure / Error A significant event that indicates actual or imminent loss of data or loss of function.
• Memory Prefailure Sensor — Monitors memory modules by counting the number of Error Correction Code (ECC) memory corrections. • Fan Enclosure Sensor — Monitors protective fan enclosures by detecting their removal from and insertion into the system, and by measuring how long a fan enclosure is absent from the chassis. This sensor monitors the chassis and in attached system(s). • AC Power Cord Sensor — Monitors the presence of AC power for an AC power cord.
Viewing Alerts and Event Messages An event log is used to record information about important events. Server Administrator generates alerts that are added to the operating system event log and to the Server Administrator alert log. To view these alerts in Server Administrator: 1 Select the System object in the tree view. 2 Select the Logs tab. 3 Select the Alert tab. You can also view the event log using your operating system’s event viewer.
Logging Messages to a Unicode Text File Logging messages to a Unicode text file is optional. By default, the feature is disabled in the Server Administrator. To enable this feature, modify the Event Manager section of the dcemdy.ini configuration file where xx is 32 or 64 bit depending on the operating system, as follows: • On systems running Microsoft Windows operating systems, you can locate the configuration file in the \dataeng\ini directory and set the property UnitextLog.
The System Log window displays a list of recently logged events. 4 To view the details of an event, double-click one of the event items. NOTE: You can also look up the dcsys.xml file, in the \omsa\log directory, to view the separate event log file, where the default install_path is C:\Program Files\Dell\SysMgt and xx is 32 or 64 depending on the operating system that is installed. Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server 1 Log in as root.
chassis intrusion Chassis location: Main System Chassis Previous state was: Critical (Failed) Chassis intrusion state: Closed Viewing Events in VMware ESX/ESXi 1 Log in to the system running VMware ESX/ESXi with VMware vSphere Client. 2 Click ViewAdministrationSystem Logs. 3 Select Server Log /var/log/messages entry from the drop-down list. Viewing the Event Information The event log for each operating system contains some or all of the following information: • Date — The date the event occurred.
Table 1-2.
Table 1-2.
Table 1-2.
Table 1-2. Event Description Reference (continued) Description Line Item Explanation Voltage sensor value (in Volts): Specifies the voltage sensor value in volts, for example: 18 Introduction Voltage sensor value (in Volts): 1.
Server Management Messages 2 The following tables lists in numerical order each event ID and its corresponding description, along with its severity and cause. NOTE: For corrective actions, see the appropriate documentation. Server Administrator General Messages The messages in Table 2-1 indicate that certain alert systems are up and working. Table 2-1. Server Administrator General Messages Event Description ID Severity 0000 Information User cleared the log from Server Administrator.
Table 2-1. Server Administrator General Messages (continued) Event Description ID Severity Cause 1003 A previously scheduled Information The user decides to cancel the system BIOS update has flash BIOS update, or an error been canceled occurs during the flash. 1004 Thermal shutdown protection has been initiated Error This message is generated when a system is configured for thermal shutdown due to an error event.
Table 2-1. Server Administrator General Messages (continued) Event Description ID Severity Cause 1007 User initiated host system control action Action requested was: Information User requested a host system control action to reboot, power off, or power cycle the system. Alternatively, the user had indicated protective measures to be initiated in the event of a thermal shutdown. 1008 Systems Management Data Manager Started Information Systems Management Data Manager services were started.
Table 2-1. Server Administrator General Messages (continued) Event Description ID Severity Cause 1013 System Peak Power detected new peak value Peak value (in Watts): Information The system peak power sensor detected a new peak value in power consumption. The new peak value in Watts is provided.
Table 2-2. Temperature Sensor Messages Event Description ID Severity Cause 1050 Temperature sensor has failed Error A temperature sensor on the backplane board, system board, or the carrier in the specified system failed. The sensor location, chassis location, previous state, and temperature sensor value are provided.
Table 2-2. Temperature Sensor Messages (continued) Event Description ID Severity Cause 1052 Temperature sensor returned to a normal value Information A temperature sensor on the backplane board, Sensor location: drive carrier in the Chassis location: returned to a valid range after crossing Previous state was: a failure threshold.
Table 2-2. Temperature Sensor Messages (continued) Event Description ID Severity Cause 1054 Temperature sensor detected a failure value Error A temperature sensor on the backplane board, system board, or drive carrier in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and temperature sensor value are provided.
Cooling Device Messages The cooling device sensors listed in Table 2-3 monitor how well a fan is functioning. Cooling device messages provide status and warning information for fans in a particular chassis. Table 2-3. Cooling Device Messages Event Description ID Severity Cause 1100 Fan sensor has failed Error A fan sensor in the specified system is not functioning. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Table 2-3. Cooling Device Messages (continued) Event Description ID Severity 1102 Fan sensor returned to a normal value Information A fan sensor reading on the specified system returned to a valid range after crossing a warning threshold. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Table 2-3. Cooling Device Messages (continued) Event Description ID Severity Cause 1105 Fan sensor detected a non-recoverable value Error A fan sensor detected an error from which it cannot recover. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Voltage Sensor Messages The voltage sensors listed in Table 2-4 monitor the number of volts across critical components. Voltage sensor messages provide status and warning information for voltage sensors in a particular chassis. Table 2-4. Voltage Sensor Messages Event Description ID Severity Cause 1150 Voltage sensor has failed Error A voltage sensor in the specified system failed. The sensor location, chassis location, previous state, and voltage sensor value information is provided.
Table 2-4. Voltage Sensor Messages (continued) Event Description ID Severity 1152 Voltage sensor returned to a normal value Information A voltage sensor in the specified system returned to a valid range after crossing a failure threshold. The sensor location, chassis location, previous state, and voltage sensor value information is provided.
Table 2-4. Voltage Sensor Messages (continued) Event Description ID Severity Cause 1154 Voltage sensor detected a failure value Error A voltage sensor in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and voltage sensor value information is provided. Error A voltage sensor in the specified system detected an error from which it cannot recover.
Current Sensor Messages The current sensors listed in Table 2-5 measure the amount of current (in amperes) that is traversing critical components. Current sensor messages provide status and warning information for current sensors in a particular chassis. Table 2-5. Current Sensor Messages Event Description ID Severity Cause 1200 Current sensor has failed Error A current sensor in the specified system failed. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1201 Current sensor value unknown Error A current sensor in the specified system could not obtain a reading. The sensor location, chassis location, previous state, and a nominal current sensor value information is provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1203 Current sensor detected a warning value Warning A current sensor in the specified system exceeded its warning threshold. The sensor location, chassis location, previous state, and current sensor value are provided. Error A current sensor in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1205 Current sensor detected a non-recoverable value Error A current sensor in the specified system detected an error from which it cannot recover. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-6. Chassis Intrusion Messages Event Description ID Severity Cause 1250 Error A chassis intrusion sensor in the specified system failed. The sensor location, chassis location, previous state, and chassis intrusion state are provided. Error A chassis intrusion sensor in the specified system could not obtain a reading. The sensor location, chassis location, previous state, and chassis intrusion state are provided.
Table 2-6. Chassis Intrusion Messages (continued) Event Description ID Severity Cause 1253 Warning A chassis intrusion sensor in the specified system detected that a system cover is currently being opened and the system is operating. The sensor location, chassis location, previous state, and chassis intrusion state information is provided. Critical A chassis intrusion sensor in the specified system detected that the system cover was opened while the system was operating.
Redundancy Unit Messages Redundancy means that a system chassis has more than one of certain critical components. Fans and power supplies, for example, are so important for preventing damage or disruption of a computer system that a chassis may have “extra” fans or power supplies installed. Redundancy allows a second or nth fan to keep the chassis components at a safe temperature when the primary fan has failed. Redundancy is normal when the intended number of critical components are operating.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity 1302 Redundancy not applicable Information A redundancy sensor in the specified system detected that a unit was not redundant. The redundancy location, chassis location, previous redundancy state, and the number of devices required for full redundancy information is provided.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity 1304 Redundancy regained Information A redundancy sensor in the specified system detected that a “lost” redundancy device has been reconnected or replaced; full redundancy is in effect. The redundancy unit location, chassis location, previous redundancy state, and the number of devices required for full redundancy information is provided.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity Cause 1306 Redundancy lost Error A redundancy sensor in the specified system detected that one of the components in the redundant unit has been disconnected, has failed, or is not present. The redundancy unit location, chassis location, previous redundancy state, and the number of devices required for full redundancy are provided.
Power Supply Messages The power supply sensors monitor how well a power supply is functioning. The power supply messages listed in Table 2-8 provide status and warning information for power supplies present in a particular chassis. Table 2-8. Power Supply Messages Event Description ID Severity Cause 1350 Error A power supply sensor in the specified system failed.
Table 2-8. Power Supply Messages (continued) Event Description ID 1351 Severity Cause Information A power supply sensor in the specified system could not Sensor location:
Table 2-8. Power Supply Messages (continued) Event Description ID Severity Cause 1353 Warning A power supply sensor reading in the specified system exceeded a user-definable warning threshold. The sensor location, chassis location, previous state, power supply type, additional power supply status, and configuration error type information are provided. Error A power supply has been disconnected or has failed.
Table 2-8. Power Supply Messages (continued) Event Description ID 1355 Severity Power supply sensor detected Error a non-recoverable value Sensor location: Chassis location: Previous state was: Power Supply type: Cause A power supply sensor in the specified system detected an error from which it cannot recover.
Memory Device Messages The memory device messages listed in Table 2-9 provide status and warning information for memory modules present in a particular system. Memory devices determine health status by monitoring the ECC memory correction rate and the type of memory events that have occurred. NOTE: A critical status does not always indicate a system failure or loss of data. In some instances, the system has exceeded the ECC correction rate.
Fan Enclosure Messages Some systems are equipped with a protective enclosure for fans. Fan enclosure messages listed in Table 2-10 monitor whether foreign objects are present in an enclosure and how long a fan enclosure is missing from a chassis. Table 2-10. Fan Enclosure Messages Event Description ID Severity Cause 1450 Critical/ Failure / Error The fan enclosure sensor in the specified system failed. The sensor and chassis location information is provided.
Table 2-10. Fan Enclosure Messages (continued) Event Description ID Severity Cause 1454 Error A fan enclosure has been removed from the specified system for a user-definable length of time. The sensor and chassis location information is provided. Error A fan enclosure sensor in the specified system detected an error from which it cannot recover. The sensor and chassis location are provided.
AC Power Cord Messages The AC power cord messages listed in Table 2-11 provide status and warning information for power cords that are part of an AC power switch, if your system supports AC switching. Table 2-11. AC Power Cord Messages Event Description ID Severity Cause 1500 Critical/ Failure/ Error An AC power cord sensor in the specified system failed. The AC power cord status cannot be monitored. The sensor and chassis location information is provided.
Table 2-11.
Table 2-12. Hardware Log Sensor Messages Event Description ID Severity Cause 1550 Warning A hardware log sensor in the specified system is disabled. The log type information is provided. Log monitoring has been disabled Log type: 1551 Log status is unknown Information A hardware log sensor in the specified system could not Log type: obtain a reading. The log type information is provided.
Processor Sensor Messages The processor sensors monitor how well a processor is functioning. Processor messages listed in Table 2-13 provide status and warning information for processors in a particular chassis. Table 2-13. Processor Sensor Messages Event Description ID Severity Cause 1600 Critical/ Failure/ Error A processor sensor in the specified system is not functioning. The sensor location, chassis location, previous state and processor sensor status information is provided.
Table 2-13. Processor Sensor Messages (continued) Event Description ID Severity 1602 Information A processor sensor in the specified system transitioned back to a normal state. The sensor location, chassis location, previous state and processor sensor status are provided.
Table 2-13. Processor Sensor Messages (continued) Event Description ID Severity Cause 1604 Error A processor sensor in the specified system is disabled, has a configuration error, or experienced a thermal trip. The sensor location, chassis location, previous state and processor sensor status are provided. Error A processor sensor in the specified system has failed. The sensor location, chassis location, previous state and processor sensor status are provided.
Pluggable Device Messages The pluggable device messages listed in Table 2-14 provide status and error information when some devices, such as memory cards, are added or removed. Table 2-14. Pluggable Device Messages Event Description ID 1650 Severity Cause Information A pluggable device event message of unknown type was received. The device location, chassis Device location: location, and additional event
Table 2-14. Pluggable Device Messages (continued) Event Description ID Severity 1652 Information A device was removed from the specified system. The device location, chassis location, and additional event details, if available, are provided.
Battery Sensor Messages The battery sensors monitor how well a battery is functioning. The battery messages listed in Table 2-15 provide status and warning information for batteries in a particular chassis. Table 2-15.
Table 2-15. Battery Sensor Messages (continued) Event Description ID 1702 Battery sensor returned to a normal value 1703 Battery sensor detected a warning value Severity Information A battery sensor in the specified system detected that a Sensor Location: back to a normal Chassis Location:
Table 2-15. Battery Sensor Messages (continued) Event Description ID Severity Cause 1705 Error A battery sensor in the specified system could not retrieve a value. The sensor location, chassis location, previous state, and battery sensor status information is provided.
Table 2-16. SD Card Device Messages Event ID Description 1751 SD card device sensor value unknown 1752 SD card device returned to Information An SD card device normal sensor in the specified system detected that Sensor location: transitioned back to a Chassis location: sensor location, chassis location, previous state, Previous state was: and SD card device type information is SD card device type:
Table 2-16. SD Card Device Messages Event ID Description Severity Cause 1753 SD card device detected a warning Warning An SD card device sensor in the specified system detected a warning condition. The sensor location, chassis location, previous state, and SD card device type information is provided. The SD card state is provided if an SD card is present in the SD card device. Error An SD card device sensor in the specified system detected an error.
Table 2-16. SD Card Device Messages Event ID Description 1755 SD card device sensor Error detected a non-recoverable value Sensor location: Chassis location: Previous state was: SD card device type: SD card state: 62 Server Management Messages Severity Cause An SD card device sensor in the specified system detected an error from which it cannot recover.
Chassis Management Controller Messages The Alerts sent by Dell M1000e Chassis Management Controller (CMC) are organized by severity. That is, the event ID of the CMC trap indicates the severity (informational, warning, critical, or non-recoverable) of the alert. Each CMC alert includes the originating system name, location, and event message text. The alert message text matches the corresponding Chassis Event Log message text that is logged by the sending CMC for that event. Table 2-17.
Server Management Messages
Storage Management Message Reference 3 The Dell OpenManage Server Administrator Storage Management’s alert or event management features let you monitor the health of storage resources such as controllers, enclosures, physical disks, and virtual disks. Alert Monitoring and Logging The Storage Management Service performs alert monitoring and logging. By default, the Storage Management service starts when the managed system starts up.
Alert Message Format with Substitution Variables When you view an alert in the Server Administrator alert log, the alert identifies the specific components such as the controller name or the virtual disk name to which the alert applies. In an actual operating environment, a storage system can have many combinations of controllers and disks as well as user-defined names for virtual disks and other components. Each environment is unique in its storage configuration and user-defined names.
NOTE: A, B, C and X, Y, Z in the following examples are variables representing the storage object name or number. Table 3-2. Message Format with Variables for Each Storage Object Storage Object Message Variables Controller Message Format: Controller A (Name) Message Format: Controller A For example, 2326 A foreign configuration has been detected: Controller 1 (PERC 5/E Adapter) NOTE: The controller name is not always displayed.
Table 3-2. Message Format with Variables for Each Storage Object (continued) Storage Object Message Variables SAS Power Supply Message Format: Power Supply X Controller A, Connector B, Enclosure C For example, 2312 A power supply in the enclosure has an AC failure: Power Supply 1, Controller 1, Connector 0, Enclosure 2 SCSI Temperature Probe Message Format: Temperature Probe X Controller A, Connector B, Target ID C where C is the SCSI ID number of the EMM managing the temperature probe.
Alert Message Change History The following table describes the changes made to the Storage Management alerts from the previous release of Storage Management to the current release. Table 3-3. Alert Message Change History Storage Management 4.0 Product Versions to which changes apply Storage Management 4.0 Server Administrator 5.0 Dell OpenManage 7.0 New Alerts 2425, 2426, 2429, 2430 Deleted Alerts None Modified Alerts None Storage Management 3.
Table 3-3. Alert Message Change History (continued) Product Versions to which changes apply Storage Management 3.3.0 Server Administrator 4.3.0 Dell OpenManage 6.3.0 New Alerts 2394, 2395, 2396, 2397, 2398, 2399, 2400, 2401, 2402, 2403, 2404 Deleted Alerts None Modified Alerts Alert severity changed for 1151 and 1351 Storage Management 3.2 Product Versions to which changes apply Storage Management 3.2.0 Server Administrator 4.2.0 Dell OpenManage 6.2.
For more information regarding alert descriptions and the appropriate corrective actions, see the online help. Table 3-4. Storage Management Messages Event ID Description Severity Cause and Action 2048 Device failed Critical / Cause: A storage Failure / Error component such as a physical disk or an enclosure has failed. The failed component may have been identified by the controller while performing a task such as a rescan or a check consistency. Action: Replace the failed component.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2049 Physical disk removed Warning / Non-critical Cause: A physical disk has been removed from the disk group. This alert can also be caused by loose or defective cables or by problems with the enclosure.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2050 Physical disk offline Warning / Non-critical Cause: A physical disk in the disk group is offline. The user may have manually put the physical disk offline. Clear Alert 903 Number: 2158 Action: Perform a rescan. You can also select the offline disk and perform a Make Online operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2052 Physical disk inserted OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert Number: 2065, 2305, 2367 LRA Number: None 2053 Virtual disk created OK / Normal / Cause: This alert is for Clear Alert: 1201 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2056 Virtual disk failed Critical / Cause: One or more Failure / Error physical disks included in the virtual disk have failed. If the virtual disk is non-redundant (does not use mirrored or parity data), then the failure of a single physical disk can cause the virtual disk to fail.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2057 Virtual disk degraded Warning / Non-critical Cause 1: This alert message occurs when a physical disk included in a redundant virtual disk fails. Because the virtual disk is redundant (uses mirrored or parity information) and only one physical disk has failed, the virtual disk can be rebuilt.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers Cause 2: A physical disk in the disk group has been removed. 2057 contd. Action 2: If a physical disk was removed from the disk group, either replace the disk or restore the original disk. You can identify which disk has been removed by locating the disk that has a red “X” for its status. Perform a rescan after replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2060 Copy of data started from physical disk %2 to physical disk %1. OK / Normal Cause: This alert is for Clear Alert 1201 /Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2063 Virtual disk OK / Normal / Cause: This alert is for Clear Alert 1201 reconfiguratio Informational informational purposes. Number: n started 2090 Action: None Related Alert Number: None LRA Number: None 2064 Virtual disk OK / Normal / Cause: This alert is for Clear Alert 1201 rebuild started Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2067 Virtual disk check consistency cancelled OK / Normal / Cause: The check Informational consistency operation was cancelled because a physical disk in the array has failed or because a user cancelled the check consistency operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2070 Virtual disk initialization cancelled OK / Normal / Cause: The virtual disk Informational initialization cancelled because a physical disk included in the virtual disk has failed or because a user cancelled the virtual disk initialization. Clear Alert 1201 Number: None OK / Normal / Cause: The user has Informational cancelled the rebuild operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2075 Copy of data completed from physical disk %2 to physical disk %1. OK / Normal / Cause: This alert is Clear Alert 1201 Informational provided for Number: informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2077 Virtual disk format failed. Critical / Cause: A physical disk Failure / Error included in the virtual disk failed. Action: Replace the failed physical disk. You can identify which physical disk has failed by locating the disk that has a red X for its status. Rebuild the physical disk. When finished, restart the virtual disk format operation. 2079 Virtual disk initialization failed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2080 Physical disk initialization failed Critical / Cause: The physical Clear Alert 904 Failure / Error disk has failed or is not Number: functioning. None Action: Replace the failed or non-functional disk. You can identify a disk that has failed by locating the disk that has a red “X” for its status. Restart the initialization.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers Software RAID: 2081 contd. • Perform a backup with the Verify option. • If the file backup fails, try to restore the failed file from a previous backup. • When the backup with the Verify option is complete without any errors, delete the Virtual Disk. • Recreate a new Virtual Disk with new drives. • Restore the data from backup.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2083 Physical disk rebuild failed Critical / Cause: A physical disk Failure / Error included in the virtual disk has failed or is not functioning. A user may also have cancelled the rebuild. Related SNMP Alert Trap Information Numbers Clear Alert 904 Number: None Related Alert Number: None Action: Replace the failed or non-functional LRA disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2086 Virtual disk format completed OK / Normal / Cause: This alert is for Clear Alert 1201 Informational informational purposes. Status: Alert 2086 Action: None is a clear alert for alert 2059.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2088 Virtual disk initialization completed OK / Normal / Cause: This alert is for Clear Alert 1201 Informational informational purposes. Status: Alert 2088 Action: None is a clear alert for alerts 2061 and 2136.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2090 Virtual disk OK / Normal / Cause: This alert is for Clear Alert 1201 reconfiguration Informational informational purposes. Status: completed Alert 2090 Action: None is a clear alert for alert 2063.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2092 Physical disk rebuild completed OK / Normal / Cause: This alert is for Clear Alert 901 Informational informational purposes. Status: Alert 2092 Action: None is a clear alert for alert 2065.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2094 Predictive Failure reported. Warning / Non-critical Cause: The physical disk is predicted to fail. Many physical disks contain Self Monitoring Analysis and Reporting Technology (SMART). When enabled, SMART monitors the health of the disk based on indications such as the number of write operations that have been performed on the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description 2094 cond. Severity Cause and Action Related SNMP Alert Trap Information Numbers If this disk is a hot spare, then unassign the hot spare; perform the Prepare to Remove task on the disk; replace the disk; and assign the new disk as a hot spare. CAUTION: If this disk is part of a nonredundant disk, back up your data immediately. If the disk fails, you cannot recover the data. 2095 SCSI sense data %1.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2099 Global hot spare unassigned OK / Normal / Cause: A physical disk Informational that was assigned as a hot spare has been unassigned and is no longer functioning as a hot spare. The physical disk may have been unassigned by a user or automatically unassigned by Storage Management. Storage Management unassigns hot spares that have been used to rebuild data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Temperature exceeded the maximum warning threshold Warning / Non-critical Cause: The physical disk enclosure is too hot. A variety of factors can cause the excessive temperature. For example, a fan may have failed, the thermostat may be set too high, or the room temperature may be too hot. Action: Check for factors that may cause overheating. For example, verify that the enclosure fan is working.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2101 Temperature Warning / dropped below Non-critical the minimum warning threshold Cause and Action Related SNMP Alert Trap Information Numbers Cause: The physical disk enclosure is too cool. Clear Alert 1053 Number: 2353 Action: Check if the thermostat setting is too low and if the room temperature is too cool.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2102 Temperature exceeded the maximum failure threshold Critical / Cause: The physical disk Failure / Error enclosure is too hot. A variety of factors can cause the excessive temperature. For example, a fan may have failed, the thermostat may be set too high, or the room temperature may be too hot. Action: Check for factors that may cause overheating. For example, verify that the enclosure fan is working.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2103 Temperature Critical / Cause: The physical dropped below Failure / Error disk enclosure is too the minimum cool. failure Action: Check if the threshold thermostat setting is too low and if the room temperature is too cool.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2106 SMART FPT exceeded Warning / Non-critical Cause: A disk on the specified controller has received a SMART alert (predictive failure) indicating that the disk is likely to fail in the near future. Clear Alert 903 Number: None Related Alert Number: None Action: Replace the LRA disk that has received Number: the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2107 SMART configuration change Critical / Cause: A disk has Failure / Error received a SMART alert (predictive failure) after a configuration change. The disk is likely to fail in the near future. Related SNMP Alert Trap Information Numbers Clear Alert 904 Number: None Related Alert Number: None Action: Replace the disk that has received LRA the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2108 SMART warning Warning / Non-critical Cause: A disk has received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert 903 Number: None Related Alert Number: None Action: Replace the disk that has received the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2109 SMART warning temperature Warning / Non-critical Cause: A disk has reached an unacceptable temperature and received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert 903 Number: None Related Alert Number: None LRA Action 1: Determine Number: why the physical disk 2070 has reached an unacceptable temperature.
Table 3-4. Storage Management Messages (continued) Event ID 2109 contd Description Severity Cause and Action Make sure the enclosure has enough ventilation and that the room temperature is not too hot. See the physical disk enclosure documentation for more diagnostic information. Action 2: If you cannot identify why the disk has reached an unacceptable temperature, then replace the disk. If the physical disk is a member of a non-redundant virtual disk, then back up the data before replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2110 SMART warning degraded Warning / Non-critical Cause: A disk is degraded and has received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert 903 Number: None Related Alert Number: None Action: Replace the disk that has received LRA the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2112 Enclosure was Critical / Cause: The physical shut down Failure / Error disk enclosure is either hotter or cooler than the maximum or minimum allowable temperature range. Related SNMP Alert Trap Information Numbers Clear Alert 854 Number: None Related Alert Number: None Action: Check for factors that may cause LRA overheating or excessive Number: cooling.
Table 3-4. Storage Management Messages (continued) Event ID Description 2114 A consistency OK / Normal / check on a Informational virtual disk has been paused (suspended) 2115 Severity A consistency OK / Normal / check on a Informational virtual disk has been resumed Cause and Action Related SNMP Alert Trap Information Numbers Cause: The check consistency operation on a virtual disk was paused by a user.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2116 A virtual disk OK / Normal / Cause: A user has caused and its mirror Informational a mirrored virtual disk to have been split be split. When a virtual disk is mirrored, its data is copied to another virtual disk in order to maintain redundancy. After being split, both virtual disks retain a copy of the data although the mirror is no longer intact.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2118 Change write policy OK / Normal / Cause: A user has Informational changed the write policy for a virtual disk. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 1201 Number: None Related Alert Number: None LRA Number: None 2120 Enclosure firmware mismatch Warning / Non-critical Cause: The firmware on the EMM is not the same version.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2121 Device returned to normal OK / Normal / Cause: A device that Informational was previously in an error state has returned to a normal state. For example, if an enclosure became too hot and subsequently cooled down, you may receive this alert. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2122 Redundancy degraded Warning / Non-critical Cause: One or more of Clear Alert 1305 the enclosure Status: components has failed. 2124 For example, a fan or power supply may have failed. Although the enclosure is currently operational, the failure of additional components could cause the enclosure to fail.
Table 3-4. Storage Management Messages (continued) Event ID 2122 contd. Description Severity Cause and Action The controller status displayed on the Health subtab indicates whether a controller has a Failed or Degraded component. See the enclosure documentation for information on replacing enclosure components and for other diagnostic information.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2123 Redundancy lost Warning / Non-critical Cause: A virtual disk or an enclosure has lost data redundancy. In the case of a virtual disk, one or more physical disks included in the virtual disk have failed. Due to the failed physical disk or disks, the virtual disk is no longer maintaining redundant (mirrored or parity) data.
Table 3-4. Storage Management Messages (continued) Event ID 2123 contd. Description Severity Cause and Action The controller status displayed on the Health subtab indicates whether a controller has a Failed or Degraded component. Click the controller that displays a Warning or Failed status. This action displays the controller Health subtab which displays the status of the individual controller components.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2124 Redundancy normal OK / Normal / Cause: Data Informational redundancy has been restored to a virtual disk or an enclosure that previously suffered a loss of redundancy. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 1304 Number: Alert 2124 is a clear alert for alerts 2122 and 2123.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2126 SCSI sense Warning / sector reassign Non-critical Cause and Action Related SNMP Alert Trap Information Numbers Cause: A sector of the physical disk is corrupted and data cannot be maintained on this portion of the disk. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2127 Background initialization (BGI) started OK / Normal / Cause: BGI of a virtual Informational disk has started. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 1201 Status: 2130 Related Alert Number: None LRA Number: None 2128 BGI cancelled OK / Normal / Cause: BGI of a virtual Informational disk has been cancelled.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2130 BGI completed OK / Normal / Cause: BGI of a virtual Informational disk has completed. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 1201 Number: Alert 2130 is a clear alert for alert 2127.
Table 3-4. Storage Management Messages (continued) Event ID Description 2132 Driver version Warning / mismatch Non-critical 2135 Severity Array Manager Warning / is installed on Non-critical the system NOTE: This is not supported on Dell OpenManage Server Administrator version 6.0.1. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller Clear Alert 753 driver is not a supported Number: version. None Action: Install a supported version of the driver.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2136 Virtual disk initialization OK / Normal / Cause: Virtual disk Informational initialization is in progress. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2137 Communication timeout Warning / Non-critical Cause: The controller is unable to communicate with an enclosure. There are several reasons why communication may be lost. For example, there may be a bad or loose cable. An unusual amount of I/O may also interrupt communication with the enclosure.
Table 3-4. Storage Management Messages (continued) Event ID 2137 contd. 2138 Description Severity Cause and Action Related SNMP Alert Trap Information Numbers Action: Check for problems with the cables. See the online help for more information on checking the cables. You should also check to see if the enclosure has degraded or failed components. To do so, select the enclosure object in the tree view and click the Health subtab. The Health subtab displays the status of the enclosure components.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2139 Enclosure OK / Normal / Cause: A user has alarm disabled Informational disabled the enclosure alarm. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 851 Number: None Related Alert Number: None LRA Number: None 2140 Dead disk segments restored OK / Normal / Cause: Disk space that Informational was formerly “dead” or inaccessible to a redundant virtual disk has been restored.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2142 Controller rebuild rate has changed OK / Normal / Cause: A user has Informational changed the controller rebuild rate. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 751 Number: None Related Alert Number: None LRA Number: None 2143 Controller OK / Normal / Cause: A user has alarm enabled Informational enabled the controller alarm.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2145 Controller battery low Warning / Non-critical Cause: The controller battery charge is low. Clear Alert: 1153 None Action: Recondition the battery. See the online help for more information. Related Alert: None Cause: A portion of a physical disk is damaged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2149 Bad block Warning / extended sense Non-critical error Cause and Action Related SNMP Alert Trap Information Numbers Cause: A portion of a physical disk is damaged. Clear Alert: 753 None Action: See the Dell OpenManage Server Administrator Storage Management online help for more information. 2150 Bad block extended medium error Warning / Non-critical Cause: A portion of a physical disk is damaged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2153 Enclosure service tag changed OK / Normal / Cause: An enclosure Informational service tag was changed. In most circumstances, this service tag should only be changed by Dell support or your service provider. Action: Ensure that the tag was changed under authorized circumstances.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2157 Controller OK / Normal / Cause: A user has reset configuration Informational the controller has been reset configuration. See the online help for more information. This alert is for informational purposes. Action: None 2158 Physical disk online OK / Normal / Cause: An offline Informational physical disk has been made online. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2159 Virtual disk renamed OK / Normal / Cause: A user has Informational renamed a virtual disk. When renaming a virtual disk on a PERC 4/SC, 4/DC, 4e/DC, 4/Di, CERC ATA100/4ch, PERC 5/E, PERC 5/i or SAS 5/iR controller, this alert displays the new virtual disk name.
Table 3-4. Storage Management Messages (continued) Event ID Description 2161 Dedicated hot OK / Normal / Cause: A physical disk spare Informational that was assigned as a unassigned hot spare has been unassigned and is no longer functioning as a hot spare. The physical disk may have been unassigned by a user or automatically unassigned by Storage Management. Storage Management unassigns hot spares that have been used to rebuild data.
Table 3-4. Storage Management Messages (continued) Event ID Description Cause and Action Related SNMP Alert Trap Information Numbers Action: Although this alert is provided for informational purposes, you may need to assign a new hot spare to the virtual disk. 2161 Cont. 2162 Severity Communicatio OK / Normal / Cause: Communication n regained Informational with an enclosure has been restored. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2164 See the OK / Normal / Readme file for Informational a list of validated controller driver versions Cause and Action Related SNMP Alert Trap Information Numbers Cause: Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller drivers. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2165 The RAID Warning / controller Non-critical firmware and driver validation was not performed. The configuration file cannot be opened. Cause and Action Related SNMP Alert Trap Information Numbers Cause: Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller firmware and drivers. This situation may occur for a variety of reasons.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2166 The RAID Warning / controller Non-critical firmware and driver validation was not performed. The configuration file is out of date, missing the required information, or not properly formatted to complete the comparison.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2167 The current Warning / kernel version Non-critical and the non-RAID SCSI driver version are older than the minimum required levels. See readme.txt for a list of validated kernel and driver versions. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The version of the kernel and the driver do not meet the minimum requirements.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2168 The non-RAID Warning / SCSI driver Non-critical version is older than the minimum required level. See readme.txt for the validated driver version. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The version of the driver does not meet the minimum requirements.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2170 The controller OK / Normal / Cause: This alert is for Clear Alert: 1151 battery charge Informational informational purposes. None level is normal. Action: None Related Alert: None LRA Number: None 2171 The controller Warning / battery Non-critical temperature is above normal.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2172 The controller OK / Normal / Cause: This alert is for battery Informational informational purposes. temperature is Action: None normal. Related SNMP Alert Trap Information Numbers Clear Alert 1151 Status: Alert 2172 is a clear alert for alert 2171. Related Alert: None LRA Number: None 2173 136 Unsupported configuration detected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2174 The controller Warning / battery has Non-critical been removed. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller cannot communicate with the battery. The battery may be removed, or the contact point between the controller and the battery may be burnt or corroded. Clear Alert: 1153 None Action: Replace the battery if it has been removed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2176 The controller OK / Normal / Cause: This alert is for battery Learn Informational informational purposes. cycle has Action: None started. Related SNMP Alert Trap Information Numbers Clear Alert 1151 Number: 2177 Related Alert: None LRA Number: None 2177 The controller OK / Normal / Cause: This alert is for battery Learn Informational informational purposes. cycle has Action: None completed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2178 The controller Warning / battery Learn Non-critical cycle has timed out. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller battery must be fully charged before the Learn cycle can begin. The battery may be unable to maintain a full charge causing the Learn cycle to timeout.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2180 The controller OK / Normal / battery Learn Informational cycle starts in %1 days. Cause and Action Related SNMP Alert Trap Information Numbers Cause: This alert is for informational purposes. The %1 indicates a substitution variable. The text for this substitution variable is displayed with the alert in the alert log and can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2183 Copyback failed on physical disk %1 from physical disk %2. Critical / Cause: The physical Failure / Error disk participating in the copyback operation has failed. Action: None Related SNMP Alert Trap Information Numbers Clear Alert: 904 None Related Alert Number: 2060 LRA Number: None 2184 Physical disk Copyback cancelled.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2186 The controller Warning / cache has been Non-critical discarded. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller has flushed the cache and any data in the cache has been lost. This may happen if the system has memory or battery problems that cause the controller to distrust the cache.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2188 The controller OK / Normal / write policy Informational has been changed to Write Through. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller battery is unable to maintain cached data for the required period of time. For example, if the required period of time is 24 hours, the battery is unable to maintain cached data for 24 hours.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2190 The controller OK / Normal / Cause: This alert is for has detected a Informational informational purposes. hot-add of an Action: None enclosure. Related SNMP Alert Trap Information Numbers Clear Alert: 751 None Related Alert: None LRA Number: None 2191 Multiple Critical / Cause: There are too enclosures are Failure / Error many enclosures attached to the attached to the controller.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2192 The virtual OK / Normal / disk Check Informational Consistency has made corrections and completed. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The virtual disk Check Consistency has identified errors and made corrections. For example, the Check Consistency may have encountered a bad disk block and remapped the disk block to restore data consistency. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2195 Dedicated hot OK / Normal / Cause: This alert is for spare assigned. Informational informational purposes. Physical disk Action: None %1 Related SNMP Alert Trap Information Numbers Clear Alert 1201 Number: 2196 Related Alert: None LRA Number: None 2196 Dedicated hot OK / Normal / Cause: This alert is for Informational informational purposes. spare unassigned.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2198 The physical disk is too small to be used for copyback. OK / Normal / Cause: This alert is for Clear Alert 903 Informational informational purposes. Number: None Action: None Related Alert Number: None LRA Number: None 2199 The virtual disk cache policy has changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2201 A global hot spare failed. Warning / Non-critical Cause: The controller is not able to communicate with a disk that is assigned as a dedicated hot spare. The disk may have been removed. There may also be a bad or loose cable. Clear Alert: 903 None Action: Check if the disk is healthy and that it has not been removed. Check the cables.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2203 A dedicated hot spare failed. Warning / Non-critical Cause: The controller is unable to communicate with a disk that is assigned as a dedicated hot spare. The disk may have failed or been removed. There may also be a bad or loose cable. Clear Alert: 903 None Action: Check if the disk is healthy and that it has not been removed. Check the cables.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2205 A dedicated hot spare has been automatically unassigned. OK / Normal / Cause: The hot spare is Informational no longer required because the virtual disk it was assigned to has been deleted. Action: None Related SNMP Alert Trap Information Numbers Clear Alert: 901 None Related Alert Number: 2098, 2161, 2196 LRA Number: None 2206 The only hot Warning / spare available Non-critical is a SATA disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2207 The only hot Warning / spare available Non-critical is a SAS disk. SAS disks cannot replace SATA disks. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The only physical disk available to be assigned as a hot spare is using SAS technology. The physical disks in the virtual disk are using SATA technology.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2211 The physical disk is not supported. Warning / Non-critical Cause: The physical disk may not have a supported version of the firmware or the disk may not be supported by Dell. Clear Alert: 903 None Action: If the disk is supported by Dell, update the firmware to a supported version.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2214 Battery charge OK / Normal / Cause: This alert is for Clear Alert: 1151 in progress Informational informational purposes. None None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2215 Battery charge OK / Normal / Cause: This alert is for Clear Alert: 1151 process Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2218 None of the Controller Property are set. OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes. None 2219 2220 2221 154 Cause and Action Action: You should change at least one controller property and run the command again.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2222 Loadbalance and Auto Copyback on Predictive Failure changed. OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes. None 2223 2224 2225 Cause and Action Action: Change at least one controller property and run the command again. Abort Check OK / Normal / Consistency on Informational Error, Copyback and Loadbalance changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2226 Load balance changed OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes. None Action: Change at least one controller property and run the command again.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2230 Auto Copyback on Predictive Failure changed. OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes. None 2231 2232 Cause and Action Action: Change at least one controller property and run the command again. Copyback and OK / Normal / and Abort Informational Check Consistency on Error changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2234 The Patrol Read rate has changed. OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2235 The Check Consistency rate has changed. OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2238 The controller OK / Normal / debug log file Informational has been exported. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The user has attempted to export the controller debug log. This alert is for informational purposes. Clear Alert: 751 None Action: None 2239 A foreign configuration has been cleared.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2242 The Patrol OK / Normal / Cause: The controller Read operation Informational has started the Patrol has started. Read operation. This alert is for informational purposes. Action: None 2243 The Patrol OK / Normal / Cause: The controller Read operation Informational has stopped the Patrol has stopped. Read operation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2245 A virtual disk blink has ceased. OK / Normal / Cause: This alert is for Clear Alert: 1201 Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2246 The controller Warning / battery is Non-critical degraded. Cause: The temperature of the the battery is high. This maybe due to the battery being charged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2248 The controller OK / Normal / Cause: This alert is for battery is Informational informational purposes. executing a Action: None Learn cycle. Related SNMP Alert Trap Information Numbers Clear Alert: 1151 None Related Alert: None LRA Number: None 2249 The physical disk Clear operation has started. OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2252 The physical disk blink has ceased. OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2253 Redundant path restored OK / Normal / Cause: This alert is Clear Alert: 751 Informational provided for None informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2255 The physical disk has been started. OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2259 An enclosure OK / Normal / Cause: This alert is for Clear Alert 851 blink operation Informational informational purposes. Number: has initiated. 2260 Action: None Related Alert: None LRA Number: None 2260 An enclosure blink has ceased. OK / Normal / Cause: This alert is for Clear Alert: 851 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2263 SMART thermal shutdown is disabled. OK / Normal / Cause: This alert is for Clear Alert: 101 Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2264 A device is missing. Warning / Non-critical Cause: The controller cannot communicate with a device. The device may be removed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2265 A device is in an unknown state. Warning / Non-critical Cause: The controller cannot communicate with a device. The state of the device cannot be determined. There may be a bad or loose cable. The system may also be experiencing problems with the application programming interface (API). There could also be a problem with the driver or firmware.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2266 Controller log OK / Normal / Cause: The %1 file entry: %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2268 %1, Storage Critical / Cause: Storage Management Failure / Error Management has lost has lost communication with a communicatio controller. This may n with the conoccur if the controller troller. An driver or firmware is immediate experiencing a problem. reboot is The %1 indicates a strongly substitution variable. recommended The text for this to avoid substitution variable is further displayed with the alert problems.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2270 The physical disk Clear operation failed. Critical / Cause: A Clear task was Failure / Error being performed on a physical disk but the task was interrupted and did not complete successfully. The controller may have lost communication with the disk. The disk may have been removed or the cables may be loose or defective.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2272 Patrol Read found an uncorrectable media error. Critical / Cause: The Patrol Read Failure / Error task has encountered an error that cannot be corrected. There may be a bad disk block that cannot be remapped. Action: Back up your data. If you are able to back up the data successfully, then fully initialize the disk and then restore from back up.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2273 A block on the Critical / Cause: The controller physical disk Failure / Error encountered an has been unrecoverable medium punctured by error when attempting the controller. to read a block on the physical disk and marked that block as invalid.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2276 The dedicated Warning / hot spare is too Non-critical small. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The dedicated hot spare is not large enough to protect all virtual disks that reside on the disk group. Clear Alert: 903 None Action: Assign a larger disk as the dedicated hot spare. 2277 The global hot Warning / spare is too Non-critical small.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2278 The controller OK / Normal / battery charge Informational level is below a normal threshold. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The battery is discharging. A battery discharge is a normal activity during the battery Learn cycle. The battery Learn cycle recharges the battery. You should receive alert 2179 when the recharge occurs.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2280 A disk media OK / Normal / Cause: A disk media error has been Informational error was detected corrected. while the controller was completing a background task. A bad disk block was identified. The disk block has been remapped. Related SNMP Alert Trap Information Numbers Clear Alert: 1201 None Related Alert: None LRA Number: None Action: Consider replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2282 Hot spare SMART polling failed. Critical / Cause: The controller Failure / Error firmware attempted a SMART polling on the hot spare but was unable to complete it. The controller has lost communication with the hot spare. Action: Check the health of the disk assigned as a hot spare. You may need to replace the disk and reassign the hot spare. Make sure the cables are attached securely.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2283 A redundant Warning / path is broken. Non-critical Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller has two connectors that are connected to the same enclosure. The communication path on one connector has lost connection with the enclosure. The communication path on the other connector is reporting this loss.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2285 A disk media error was corrected during recovery. OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes. None 2286 2287 Cause and Action Action: None Related Alert: None LRA Number: None A Learn cycle OK / Normal / Cause: This alert is for start is pending Informational informational purposes. while the Action: None battery charges.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2289 Multi-bit ECC Critical / Cause: An error error on Failure / Error involving multiple bits controller has been encountered DIMM. during a read or write operation. The error correction algorithm recalculates parity data during read and write operations. If an error involves only a single bit, it may be possible for the error correction algorithm to correct the error and maintain parity data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2290 Single-bit ECC error on controller DIMM. Warning / Non-critical Cause: An error involving a single bit has been encountered during a read or write operation. The error correction algorithm has corrected this error. Clear Alert: 753 None Action: None 2291 2292 An enclosure management module (EMM) has been discovered.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2293 The EMM has Critical / Cause: The failure may failed. Failure / Error be caused by a loss of power to the EMM. The EMM self test may also have identified a failure. There could also be a firmware problem or a multi-bit error. Related SNMP Alert Trap Information Numbers Clear Alert: 854 None Related Alert: None LRA Number: 2091 Action: Replace the EMM.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2296 An EMM has OK / Normal / Cause: This alert is for Clear Alert: 951 been inserted. Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2297 2298 An EMM has Critical / Cause: An EMM has been removed. Failure / Error been removed. The enclosure Warning / has a bad Non-critical sensor %1. Action: Reinsert the EMM.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2299 Bad PHY %1 Critical / Cause: There is a Failure / Error problem with a physical connection or PHY. The %1 indicates a substitution variable. The text for this substitution variable is displayed with the alert in the alert log and can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2300 The enclosure Critical / Cause: The controller is is unstable. Failure / Error not receiving a consistent response from the enclosure. There could be a firmware problem or an invalid cabling configuration. If the cables are too long, they degrade the signal. Action: Power down all enclosures attached to the system and reboot the system.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2301 The enclosure Critical / Cause: The enclosure or has a hardware Failure / Error an enclosure error. component is in a Failed or Degraded state. Clear Alert: 854 None Cause: The enclosure or The enclosure Critical / is not Failure / Error an enclosure responding. component is in a Failed or Degraded state.
Table 3-4. Storage Management Messages (continued) Event ID Description 2304 An attempt to OK / Normal / Cause: This alert is for hot plug an Informational informational purposes. EMM has been Action: None detected. This type of hot plug is not supported. 2305 186 The physical disk is too small to be used for a rebuild. Severity Warning / Non-critical Cause and Action Cause: The physical disk is too small to rebuild the data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2306 Bad block table Warning / is 80% full. Non-critical Cause and Action Related SNMP Alert Trap Information Numbers Cause: The bad block table is used for remapping bad disk blocks. This table fills, as bad disk blocks are remapped. When the table is full, bad disk blocks can no longer be remapped, and disk errors can no longer be corrected. At this point, data loss can occur. The bad block table is now 80% full.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2307 Bad block table Critical / Cause: The bad block is full. Unable Failure / Error table is used for to log block %1 remapping bad disk blocks. This table fills, as bad disk blocks are remapped. When the table is full, bad disk blocks can no longer be remapped and disk errors can no longer be corrected. At this point, data loss can occur. The %1 indicates a substitution variable.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2309 A physical disk Warning / is Non-critical incompatible. Cause and Action Related SNMP Alert Trap Information Numbers Cause: You have attempted to replace a disk with another disk that is using an incompatible technology. For example, you may have replaced one side of a mirror with a SAS disk when the other side of the mirror is using SATA technology.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2311 The firmware Warning / on the EMMs Non-critical is not the same version. EMM0 %1 EMM1 %2 Cause and Action Related SNMP Alert Trap Information Numbers Cause: The firmware on the EMM modules is not the same version. It is required that both modules have the same version of the firmware. This alert may be caused if you attempt to insert an EMM module that has a different firmware version than an existing module.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2313 A power supply Warning / in the Non-critical enclosure has a DC failure. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The power Clear Alert 1003 supply has a DC failure. Number: 2323 Action: Replace the power supply. Related Alert Number: 2122, 2322.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2315 Diagnostic message %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the utility that ran the diagnostics and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2318 2319 Cause and Action Related SNMP Alert Trap Information Numbers Problems with Warning / the battery or Non-critical the battery charger have been detected. The battery health is poor. Cause: The battery or the battery charger is not functioning properly. Clear Alert: 1153 None Warning / Non-critical Cause: The DIMM is beginning to malfunction. Single-bit ECC error. The DIMM is degrading.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2320 Single-bit ECC error. The DIMM is critically degraded. Critical / Cause: The DIMM is Failure / Error malfunctioning. Data loss or data corruption may be imminent. Related SNMP Alert Trap Information Numbers Clear Alert: 754 None Related Alert Number: 2321 Action: Replace the DIMM immediately to LRA avoid data loss or data Number: corruption. The DIMM 2061 is a part of the controller battery pack.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2322 The DC power Critical / Cause: The power supply is Failure / Error supply unit is switched switched off. off. Either a user switched off the power supply unit or it is defective. Related SNMP Alert Trap Information Numbers Clear Alert 1004 Number: 2323 Related Alert: None LRA Action: Check if the Number: power switch is turned 2091 off. If it is turned off, turn it on.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2324 The AC power Critical / Cause: The power cable supply cable Failure / Error may be pulled out has been or removed. The power removed. cable may also have overheated and become warped and nonfunctional. Action: Replace the power cable. 2325 The power supply cable has been inserted.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2326 A foreign configuration has been detected. OK / Normal / Cause: This alert is for Informational informational purposes. The controller has physical disks that were moved from another controller. These physical disks contain virtual disks that were created on the other controller.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2327 The NVRAM has corrupted data. The controller is reinitializing the NVRAM. Warning / Non-critical Cause: The nonvolatile random access memory (NVRAM) is corrupt. This may occur after a power surge, a battery failure, or for other reasons. The controller is reinitializing the NVRAM.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2329 SAS port report: %1 Warning / Non-critical Cause: The text for this alert is generated by the controller and can vary depending on the situation. The %1 indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2330 SAS port report: %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2332 A controller OK / Normal / Cause: This alert is for Clear Alert: 751 hot plug has Informational informational purposes. None been detected. Action: None Related Alert: None LRA Number: None 2334 Controller event log: %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2335 Controller event log: %1 Warning / Non-critical Cause: The %1 indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text is from events in the controller event log that were generated while Storage Management was not running.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2336 Controller event log: %1 Critical / Cause: The %1 Failure / Error indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text is from events in the controller event log that were generated while Storage Management was not running. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2337 The controller is Critical / Cause: The controller unable to Failure / Error was unable to recover recover cached data from the cache. data from the This may occur when battery backup the system is without unit (BBU). power for an extended period of time when the battery is discharged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2340 The BGI com- Critical / Cause: The BGI task pleted with Failure / Error encountered errors that uncorrectable cannot be corrected. errors. The virtual disk contains physical disks that have unusable disk space or disk errors that cannot be corrected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2342 The Check Consistency found inconsistent parity data. Data redundancy may be lost. Warning / Non-critical Cause: The data on a source disk and the redundant data on a target disk is inconsistent. Clear Alert: 1203 None The Check Consistency logging of inconsistent parity data is disabled.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2345 The virtual disk initialization failed. Critical / Cause: The controller Failure / Error cannot communicate with attached devices. A disk may be removed or contain errors. Cables may also be loose or defective. Action: Verify the health of attached devices. Review the Alert Log for significant events. Make sure the cables are attached securely.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2346 Error occurred: Warning / %1 Non-critical Cause and Action Related SNMP Alert Trap Information Numbers Cause: A physical device may have an error. The %1 indicates a substitution variable. The text for this substitution variable is generated by the firmware and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2347 The rebuild Critical / Hardware RAID: failed due to Failure / Error Cause: You are errors on the attempting to rebuild source physical data that resides on a disk. defective disk. Action: Replace the source disk and restore from backup.
Table 3-4. Storage Management Messages (continued) Event ID Description 2348 The rebuild Critical / Cause: You are failed due to Failure / Error attempting to rebuild errors on the data on a disk that is target physical defective. disk. Action: Replace the target disk. If a rebuild does not automatically start after replacing the disk, initiate the Rebuild task. You may need to assign the new disk as a hot spare to initiate the rebuild.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2351 A physical disk OK / Normal / Cause: This alert is for Clear Alert 901 is marked as Informational informational purposes. Number: missing. 2352 Action: None. Related Alert: None LRA Number: None 2352 A physical disk OK / Normal / Cause: This alert is for Informational informational purposes. that was marked as Action: None. missing has been replaced.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2354 Enclosure firmware download in progress. OK / Normal / Cause: This alert is Clear Alert 851 Informational provided for Status: informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2355 212 Enclosure firmware download failed. Warning / Non-critical Cause: The system was unable to download firmware to the enclosure.
Table 3-4. Storage Management Messages (continued) Event ID 2355 Cont. Description Severity Cause and Action Related SNMP Alert Trap Information Numbers Action: Attempt to download the enclosure firmware again. If problems continue, verify that the controller can communicate with the enclosure. Make sure that the enclosure is powered on. Check the cables. See the Cables Attached Correctly section for more information on checking the cables. Verify the health of the enclosure and its components.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2356 SAS SMP Critical / Cause: The text for this communicatio Failure / Error alert is generated by the ns error %1 firmware and can vary depending on the situation. The reference to SMP in this text refers to SAS Management Protocol. Action: There may be a SAS topology error. See the hardware documentation for information on correct SAS topology configurations.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2357 SAS expander error: %1 Critical / Cause: The %1 Failure / Error indicates a substitution variable. The text for this substitution variable is generated by the firmware and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2359 The physical disk is not certified. Warning / Non-critical Cause: The physical disk does not comply with the standards set by Dell and is not supported. Clear Alert: 903 None Action: Replace the physical disk with a physical disk that is supported.
Table 3-4. Storage Management Messages (continued) Event ID Description 2362 Physical OK / Normal / Cause: This alert is for disk(s) have Informational informational purposes. been removed Action: None. from a virtual disk. The virtual disk is in Failed state during the next system reboot. Clear Alert: 751 None All virtual disks OK / Normal / Cause: This alert is for are missing Informational informational purposes. from the Action: None. controller. This situation was discovered during system startup.
Table 3-4. Storage Management Messages (continued) Event ID Description 2367 Rebuild is not Warning / possible Non-critical because mixing of different media type (SSD/HDD) and bus protocols (SATA/SAS) is not supported on the same virtual disk. 2368 218 Severity Cause and Action Related SNMP Alert Trap Information Numbers Cause: The physical disk is using an incompatible technology.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2369 Virtual Disk Redundancy has been degraded. OK / Normal / Cause: A physical disk Informational in a RAID 6 virtual disk has either failed or been removed. Action: Replace the missing or failed physical disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2372 Attempted import of Virtual Disk exceeding the limit supported on the controller. OK / Normal / Cause: This alert is Clear Alert: 751 Informational provided for None informational purposes. Related Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2376 Attempted import of Virtual Disk with stale physical disk OK / Normal / Cause: User is Informational attempting to import a foreign virtual disk with a stale physical disk. This alert is provided for informational purposes. Action: None. 2377 Attempted import of an orphan drive OK / Normal / Cause: User is Informational attempting to import an orphan drive.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2380 Foreign configuration has been partially imported. Some configuration failed to import. OK / Normal / Cause: This alert is Clear Alert: 751 Informational provided for None informational purposes. Related Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2381 Controller preserved cache is recovered. OK / Normal / Cause: This alert is Clear Alert: 751 Informational provided for None informational purposes. Related Action: None Alert: None LRA Number: None 2382 An unWarning / supported Non-critical configuration was detected.
Table 3-4. Storage Management Messages (continued) Event ID Description 2384 The Warning Warning / level set for the Non-critical hot spare protection policy is violated for the Virtual Disk. 2385 2386 224 Severity Cause and Action Related SNMP Alert Trap Information Numbers Cause: The number of physical disks you specified for the hot spare protection policy is violated.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2387 A virtual disk bad block medium error is detected. Critical / Cause: Virtual disk bad Failure / Error blocks are due to presence of unrecoverable bad blocks on one or more member physical disks. Action: 1 Perform a backup of the virtual disk with the Verify option selected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2387 contd. 2388 Cause and Action Related SNMP Alert Trap Information Numbers 2 To clear these bad blocks, execute the Clear Virtual Disk Bad Blocks task. 3 Run Patrol Read to ensure no new bad blocks are found. The Controller OK / Normal / Encryption Informational Key is destroyed. Cause: The Controller Encryption Key is destroyed. Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2392 The drive Encryption Key is invalid. Warning / Non-critical Cause: The controller failed to verify the specified Passphrase. Clear Alert: 753 None The virtual disk is encrypted. OK / Normal / Cause: The Encrypted Informational virtual disk operation on normal virtual disk (created using Selfencrypting disks only) is successful.
Table 3-4. Storage Management Messages (continued) Event ID Description 2396 The Check Critical / Cause: The Check Clear Alert: 1204 Consistency Failure / Error Consistency task None detected detects uncorrectable Related uncorrectable multiple errors. Alert: None multiple Action: Replace the LRA medium errors failed physical disk. You Number: can identify the failed None disk by locating the disk that has a red “X” for its status. Rebuild the physical disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2399 The Physical OK / Normal / Disk Power Informational status changed from 1% to 2% Cause and Action Related SNMP Alert Trap Information Numbers Cause: The physical disk power status is changed from one state to another. A physical disk can have the following power statuses: spun down, transition, and spun up.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2403 Virtual Disk is OK / Normal / Cause: The operating available Informational system detects the newly created virtual disk. Action: None NOTE: This alert also appears when a CacheCade is created but is not available for the operating system (as it is a CacheCade and not a Virtual Disk).
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2407 Controller Encryption mode is enabled in LKM Informational Cause: The Local Key Management (LKM) encryption mode is enabled. 2411 Cause and Action Action: None Controller Informational Cause: Using Manage LKM Encryption Key Encryption key operations, encryption is changed key is changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2414 Controller CacheCade is deleted Informational Cause: This alert is Clear Alert: 1201 provided for None informational purposes. Related Alert: None Action: None LRA Number: None 2415 Controller battery is discharging Informational Cause: The battery learn cycle has started.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2417 There is an unrecoverable medium error detected on virtual disk Critical / Cause: Unrecoverable Failure / Error medium error found on one or more member physical disks of a virtual disk. Related SNMP Alert Trap Information Numbers Clear Alert: 1204 None Related Alert: None LRA Number: Action: Perform a None backup of the virtual disk with the Verify option selected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2417 cntd. NOTE: If the unrecoverable medium error has not been corrected, it may be reported again by the system. This error can be fixed by writing data on the affected area or deleting and recreating the Virtual Disk as demonstrated in the following procedure. 1 Back up the data. 2 Delete the Virtual Disk. 3 Recreate the Virtual Disk using the same parameters like size, RAID level, disks, etc.
Table 3-4. Storage Management Messages (continued) Event ID Description 2426 State change Informational Cause: User triggered on Physical action. disk from NonAction: Configure the RAID to drive to be ready using READY. CLI/GUI. Clear Alert: 901 None Related Alert: None Drive Prepared Informational Cause: User triggered for Removal. action.
Storage Management Message Reference
System Event Log Messages for IPMI Systems 4 The tables in this chapter list the system event log (SEL) messages, their severity, and cause. NOTE: For corrective actions, see the appropriate documentation. Temperature Sensor Events The temperature sensor event messages help protect critical components by alerting the systems management console when the temperature rises inside the chassis.
Table 4-1. Temperature Sensor Events (continued) Event Message Severity Warning temperature sensor returned to warning state . Cause Temperature of the backplane board, system board, or the carrier in the specified system returned from critical state to non-critical state. temperature sensor returned to normal state .
Table 4-1. Temperature Sensor Events (continued) Event Message Severity Cause The temperature is within range. Information Temperature of the backplane, system board, system inlet, or the carrier in the specified system returned to a normal operating range. Voltage Sensor Events The voltage sensor event messages monitor the number of volts across critical components.
Table 4-2. Voltage Sensor Events (continued) Event Message Severity Cause voltage sensor detected a warning . Warning Voltage of the monitored entity exceeded the warning threshold. voltage sensor returned to normal . Information The voltage of a previously reported is returned to normal state. The voltage is less than the lower warning threshold.
Fan Sensor Events The cooling device sensors monitor how well a fan is functioning. These messages provide status warning and failure messages for fans for a particular chassis. Table 4-3. Fan Sensor Events Event Message Severity Critical Fan sensor detected a failure where is the entity that this sensor is monitoring. For example "BMC Back Fan" or "BMC Front Fan.
Table 4-3. Fan Sensor Events (continued) Event Message Severity Cause Information The fan specified by may have started redundancy regained functioning again and hence, the redundancy has been regained. Fan RPM is less than the lower warning threshold. Warning The speed of the specified fan might not provide enough cooling to the system. Fan RPM is less than the lower critical threshold.
Table 4-3. Fan Sensor Events (continued) Event Message Severity Cause Fan redundancy is lost. Critical One or more required fans may have failed or removed and hence, the redundancy was lost. Fan redundancy is degraded. Warning One or more fans may have failed or removed and hence, the redundancy has been degraded. Processor Status Events The processor status messages monitor the functionality of the processors in a system.
Table 4-4. Processor Status Events (continued) Event Message Severity Cause status processor sensor terminator not present. Information This event is generated if the terminator is missing on an empty processor slot. presence was deasserted. Critical presence was asserted. Information This event is generated when the earlier processor detection error was corrected. thermal tripped was deasserted.
Table 4-4. Processor Status Events (continued) Event Message Severity Cause CPU terminator is Information This event is generated if the present. terminator is present on a processor slot. CPU terminator is Warning absent. This event is generated if the terminator is missing on an empty processor slot. CPU is throttled. Warning This event is generated when the processor slows down to prevent overheating. CPU is absent.
Table 4-5. Power Supply Events (continued) Event Message Severity Cause power supply sensor power supply that failed or returned to normal state. removed was replaced and the state has returned to normal. PS Redundancy sensor redundancy degraded. Information Power supply redundancy is degraded if one of the power supply sources is removed or failed. PS Redundancy sensor redundancy lost.
Table 4-5. Power Supply Events (continued) Event Message Severity PS 1 Status: Power supply Critical sensor for PS 1, failure was asserted Cause This event is generated when the power supply has failed. PS 1 Status: Power supply Information This event is generated when the sensor for PS 1, failure power supply has recovered from was deasserted an earlier failure event.
Table 4-5. Power Supply Events (continued) Event Message Severity Cause A predictive failure detected on power supply . Warning This event is generated when the power supply is about to fail. The power input for power Critical supply is lost. This event is generated when input power is removed from the power supply. The input power for power Information This event is generated if the supply has been power supply has been restored. reconnected or replaced.
Table 4-5. Power Supply Events (continued) Event Message Severity Cause An over current fault detected on power supply . Critical The specified power supply detected an over current condition. Fan failure detected on power supply . Critical The specified power supply fan has failed. Communication has been restored to power supply . Information This event is generated when the power supply has recovered from an earlier communication problem.
Memory ECC Events The memory ECC event messages monitor the memory modules in a system. These messages monitor the ECC memory correction rate and the type of memory events that occurred. Table 4-6. Memory ECC Events Event Message Severity ECC error correction detected on Bank # DIMM [A/B]. Information This event is generated when there is a memory error correction on a particular Dual Inline Memory Module (DIMM). ECC uncorrectable error detected on Bank # [DIMM].
BMC Watchdog Events The BMC watchdog operations are performed when the system hangs or crashes. These messages monitor the status and occurrence of these events in a system. Table 4-7. BMC Watchdog Events Event Message Severity Cause BMC OS Watchdog timer expired. Information This event is generated when the BMC watchdog timer expires and no action is set. BMC OS Watchdog performed system reboot.
Table 4-7. BMC Watchdog Events (continued) Event Message Severity Cause The OS watchdog timer powered cycle the system. Critical This event is generated when the BMC watchdog detects that the system has crashed (timer expired because no response was received from Host) and the action is set to power cycle. The OS watchdog timer powered off the system.
Table 4-8. Memory Events (continued) Event Message Severity Cause Memory Mirrored redundancy degraded. Warning This event is generated when there is a memory failure in a mirrored memory configuration. Memory Mirrored redundancy lost. Critical This event is generated when redundancy is lost in a mirrored memory configuration. Memory Mirrored redundancy regained. Information This event is generated when the redundancy lost or degraded earlier is regained in a mirrored memory configuration.
Table 4-8. Memory Events (continued) Event Message Severity Memory mirror is redundant. Information This event is generated when the memory redundancy mode has change to mirror redundant. Memory mirror Critical redundancy is lost. Check memory device at location(s) . Cause This event is generated when redundancy is lost in a mirror-configured memory configuration. Memory mirror redundancy is degraded. Check memory device at location . Warning Memory spare is redundant.
Table 4-9. Hardware Log Sensor Events Event Message Severity Cause Log full detected. Critical This event is generated when the SEL device detects that only one entry can be added to the SEL before it is full. Log cleared. Information This event is generated when the SEL is cleared. Drive Events The drive event messages monitor the health of the drives in a system. These events are generated when there is a fault in the drives indicated. Table 4-10.
Table 4-10. Drive Events (continued) Event Message Severity Drive Informational This event is generated when the drive is taken out of hot spare. hot spare was deasserted Drive Warning consistency check in progress was asserted Drive consistency check in progress was deasserted Drive Cause This event is generated when the drive is placed in consistency check. Informational This event is generated when the consistency check of the drive is completed.
Table 4-10. Drive Events (continued) Event Message Severity Cause Fault detected on drive . Critical This event is generated when the specified drive in the array is faulty. Intrusion Events The chassis intrusion messages are a security measure. Chassis intrusion alerts are generated when the system's chassis is opened. Alerts are sent to prevent unauthorized removal of parts from the chassis. Table 4-11.
Table 4-11. Intrusion Events (continued) Event Message Severity Cause The chassis is closed Information This event is generated when the while the power is on. earlier intrusion has been corrected while the power is on. The chassis is open while the power is off. Critical This event is generated when the intrusion sensor detects an intrusion while the system is off. The chassis is closed while the power is off.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause System Event PCIE Fatal Err. Critical This error is generated when a fatal error is detected on the PCIE bus. POST Err Critical This event is generated when an error occurs during system boot. See the system documentation for more information on the error code. POST fatal error # Critical or This event is generated when a fatal error occurs during system boot.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause Information This event is generated when memory is added to the system. (BANK# DIMM#) presence was asserted Memory Add (BANK# DIMM#) presence was asserted Information This event is generated when memory is removed from the system. Memory Cfg Err Critical Memory Removed configuration error (BANK# DIMM#) was asserted This event is generated when memory configuration is incorrect for the system.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause USB Over-current Critical This event is generated when the USB exceeds a predefined current level. transition to non-recoverable Hdwr version err hardware Critical incompatibility (BMC/iDRAC Firmware and CPU mismatch) was asserted This event is generated when there is a mismatch between the BMC and iDRAC firmware and the processor in use or vice versa.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause LinkT/FlexAddr: Link Tuning sensor, device option ROM failed to support link tuning or flex address (Mezz XX) was asserted Critical This event is generated when the PCI device option ROM for a NIC does not support link tuning or the Flex addressing feature. LinkT/FlexAddr: Link Tuning sensor, failed to program virtual MAC address () was asserted.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause A PCI system error was Critical detected on a component at bus device function . This is generated when the system has crashed and recovered. A PCI system error was Critical detected on a component at slot . This is generated when the system has crashed and recovered. A bus correctable error was detected on a component at bus device function .
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause A fatal IO error detected on a component at bus device function . Critical This error is generated when a fatal IO error is detected. A fatal IO error detected on a component at slot . Critical This error is generated when a fatal IO error is detected. A non-fatal PCIe error Warning detected on a component at bus device function .
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Memory device at location Critical is overheating. Cause This event is generated when system memory reaches critical temperature. An OEM diagnostic event occurred. Information This event is generated when an OEM event occurs. OEM events can be used by Dell service team to better understand the cause of the failure. CPU protocol error detected.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity A hardware incompatibility Critical detected between BMC/iDRAC firmware and CPU. Cause This event is generated when there is a mismatch between the BMC and iDRAC firmware and the processor in use or vice versa. A hardware incompatibility Information This event is generated when an was corrected between BMC/ earlier mismatch between the iDRAC firmware and CPU. BMC and iDRAC firmware and the processor is corrected.
POST Code Table Table 4-13 lists the POST Code errors that are generated when a fatal error occurs during system boot. Table 4-13. POST Code Errors Fatal Error Description Code Cause 80 No memory detected This error code implies that no memory is installed. 81 Memory detected but is not configurable This error code indicates memory configuration error that could be a result of bad memory, mismatched memory or bad socket. 82 Memory configured but not usable.
Table 4-13. POST Code Errors (continued) Fatal Error Description Code Cause C0 Shutdown test failure This error code indicates a shutdown test failure. C1 POST Memory test failure This error code indicates bad memory detection. C2 RAC configuration failure Check screen for the actual error message C3 CPU configuration failure Check screen for the actual error message C4 Incorrect memory configuration Memory population order not correct.
Table 4-14. Operating System Generated Events (continued) A runtime critical stop occurred. Critical The operating system encountered a critical error and was stopped abnormally. An OS graceful stop occurred. Information The operating system was stopped. An OS graceful shut-down occurred. Information The operating system was shutdown normally. Cable Interconnect Events The cable interconnect messages in Table 4-15 are used for detecting errors in the hardware cabling. Table 4-15.
Battery Events Table 4-16. Battery Events Description Severity Cause Critical This event is generated when the sensor detects a failed or missing battery. Information This event is generated when the earlier failed battery was corrected. Warning This event is generated when the sensor detects a low battery condition. Information This event is generated when the earlier low battery condition was corrected.
Power And Performance Events The power and performance events are used to detect degradation in system performance with change in power supply. Table 4-17. Description Power And Performance Events Severity Cause System Board Power Normal Optimized: Performance status sensor for System Board, degraded, was deasserted This event is generated when system performance was restored.
Table 4-17. Power And Performance Events (continued) Description Severity Cause System Board Power Warning Optimized: Performance status sensor for System Board, degraded, user defined power capacity was asserted This event is generated when a change in power supply degrades system performance. System Board Power Normal Optimized: Performance status sensor for System Board, degraded, user defined power capacity was deasserted This event is generated when the system performance is restored.
Table 4-17. Power And Performance Events (continued) Description Severity Cause The system performance degraded because of thermal protection. Warning This event is generated when a change in thermal protection degrades system performance. The system performance degraded because cooling capacity has changed. Warning This event is generated when a change in cooling degrades system performance. The system Warning performance degraded because power capacity has changed.
Table 4-17. Power And Performance Events (continued) Description Severity Cause The system performance restored Information This event is generated when system performance was restored. Entity Presence Events The entity presence messages are used for detecting different hardware devices. Table 4-18. Entity Presence Events Description Severity Cause Information This event is generated when the device was detected. Critical This event is generated when the device was not detected.
Miscellaneous The following table provides events related to hardware and software components like mezzanine cards, sensors, firmware etc. and compatibility issues. Table 4-19. Miscellaneous Events Description Severity Cause System Board Video Riser: Module sensor for System Board, device removed was asserted Critical This event is generated when the required module is removed.
Table 4-19. Miscellaneous Events (continued) Hdwar version err: Version Change sensor, hardware incompatibility (BMC firmware and CPU mismatch) was asserted Critical This event is generated when the CPU and firmware are not compatible. Link Tuning: Version Change sensor, successful software or F/W change was deasserted Warning This event is generated when the link tuning setting for proper NIC operation fails to update.
Table 4-19. Miscellaneous Events (continued) LinkT/FlexAddr: Critical Link Tuning sensor, failed to get link tuning or flex address data from BMC/iDRAC was asserted This event is generated when link tuning or Flex address information is not obtained from BMC/iDRAC. The is removed. Critical This event is generated when the device was removed. The is inserted. Information This event is generated when the device was inserted or installed.
Table 4-19. Miscellaneous Events (continued) Critical This event is generated when TXT Post failed. SINIT Authenticated Critical Code Module detected an Intel Trusted Execution Technology (TXT) error at boot. This event is generated when the Authenticated Code Module detected a TXT initialization failure. Intel Trusted Information Execution Technology (TXT) is operating correctly. This event is generated when the TXT returned from a previous failure.
Index A C AC power cord messages, 49 cable interconnect messages, 269 AC power cord sensor, 10 AC power cord sensor has failed, 255 Change write policy, 107 chassis intrusion messages, 35 Asset name changed, 124 Chassis intrusion sensor, 245 Asset tag changed, 124 chassis intrusion sensor, 9 Communication regained, 129 B Background initialization, 115 Bad block extended medium error, 124 Bad block extended sense error, 124 Communication timeout, 119 Controller event log %1, 201-203 Controller reb
E Hot spare SMART polling, 176 Enclosure alarm, 120 Enclosure firmware mismatch, 107 entity presence messages, 271 Error occurred %1, 208 event description reference, 14 I Intrusion Events, 257 intrusion messages, 257 L Log monitoring, 257 F fan enclosure messages, 47 fan enclosure sensor, 10 fan sensor, 9 Fan Sensor Events, 241 Fan sensor has failed, 239 fan sensor messages, 241 Firmware version mismatch, 116 G Global hot spare, 93 H hardware log sensor, 10 Hardware Log Sensor Events, 255 hardware l
fan sensor, 241 hardware log sensor, 254 intrusion, 257 memory device, 46 memory ECC, 250 memory modules, 252 pluggable device, 55, 258 power supply, 42, 245 processor sensor, 52 processor status, 243 r2 generated system, 267 redundancy unit, 38 Server Administrator General, 19 storage management, 71 temperature sensor, 22, 237 voltage sensor, 29, 239 Multi-bit ECC error.
temperature, 9 voltage, 9 viewing events in Windows operating systems, 12 Service tag changed, 125 Virtual disk initialization, 118 Single-bit ECC error limit, 142 Virtual disk renamed, 127 Single-bit ECC error.