Dell EMC PowerEdge Servers Troubleshooting Guide
Notes, cautions, and warnings NOTE: A NOTE indicates important information that helps you make better use of your product. CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid the problem. WARNING: A WARNING indicates a potential for property damage, personal injury, or death. © 2017 - 2018 Dell Inc. or its subsidiaries. All rights reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its subsidiaries.
Contents 1 Introduction....................................................................................................................................................8 Audience..............................................................................................................................................................................8 Recommended tools..........................................................................................................................................
Troubleshooting a micro SD card................................................................................................................................... 49 Troubleshooting expansion cards................................................................................................................................... 49 Troubleshooting processors............................................................................................................................................
Downloading and installing the RAID controller log export by using PERCCLI tool on ESXi hosts on Dell’s 13th generation of PowerEdge servers.........................................................................................................80 Configuring RAID by using Lifecycle Controller......................................................................................................84 Starting and target RAID levels for virtual disk reconfiguration and capacity expansion..........................
Processor TEMP error.....................................................................................................................................................111 PowerEdge T130, R230, R330, and T330 servers may report a critical error during scheduled warm reboots.. 111 SSD is not detected.........................................................................................................................................................111 TRIM/UNMAP and Dell Enterprise SSD Drives Support.....
Contacting Dell EMC...................................................................................................................................................... 131 Downloading the drivers and firmware......................................................................................................................... 131 Locating Service Tag of your system...........................................................................................................................
1 Introduction Use this guide to learn how to identify and troubleshoot the Dell PowerEdge server issues. In particular, this guide: • Provides troubleshooting procedures for issues related to Server Operating System, Server Hardware, and Server Management Software. • Provides an overview of diagnostic indicators and describes how to use the indicator codes to facilitate troubleshooting.
Documentation resources This section provides information about the documentation resources for your system. To view the document that is listed in the documentation resources table: • From the Dell EMC support site: a Click the documentation link that is provided in the Location column in the table. b Click the required product or product version. NOTE: To locate the product name and model, see the front of your system. c • On the Product Support page, click Manuals & documents.
Task Document Location Managing your system For information about systems management Dell.com/poweredgemanuals software offered by Dell, see the Dell OpenManage Systems Management Overview Guide. For information about setting up, using, and troubleshooting OpenManage, see the Dell OpenManage Server Administrator User’s Guide. Dell.
2 Diagnostic indicators The diagnostic indicators on the system indicates operation and error status. Status LED indicators The status LED indicators on the system front panel display error status during system startup. NOTE: No status LED indicators are illuminated when the system is turned off. To start the system, plug it into a working power source and press the power button. NOTE: Status LED indicators are always off and only turn solid amber if any error occurs. Table 2.
System health and system ID indicator codes Figure 1. System health and system ID indicators Table 3. System health and system ID indicator codes System health and system ID indicator code Condition Blue solid Indicates that the system is turned on, system is healthy and system ID mode is not active. Press the system health and system ID button to switch to system ID mode. Blinking blue Indicates that the system ID mode is active.
Wireless indicator code Condition Corrective action Blinks white slowly Indicates that firmware update is in progress. If the indicator continues to blink indefinitely, see the Getting help section. Blinks white five times rapidly and then turns off Indicates that the iDRAC Quick Sync 2 feature is disabled. Check if iDRAC Quick Sync 2 feature is configured to be disabled by iDRAC. If the problem persists, see the Getting help section.
Table 6. NIC indicators Status Condition Link and activity indicators are off The NIC is not connected to the network. Link indicator is green and activity indicator is blinking green The NIC is connected to a valid network at its maximum port speed and data is being sent or received. Link indicator is amber and activity indicator is blinking green The NIC is connected to a valid network at less than its maximum port speed and data is being sent or received.
Power indicator codes Condition CAUTION: When correcting a PSU mismatch, replace only the PSU with the blinking indicator. Swapping the PSU to make a matched pair can result in an error condition and unexpected system shutdown. To change from a high output configuration to a low output configuration or vice versa, you must turn off the system. CAUTION: AC PSUs support both 240 V and 120 V input voltages with the exception of Titanium PSUs, which support only 240 V.
Non-redundant power supply unit indicator codes Press the self-diagnostic button to perform a quick health check on the non-redundant power supply unit (PSU) of the system. Figure 6. Non-redundant AC PSU status indicator and self-diagnostic button 1 Self-diagnostic button 2 AC PSU status indicator Table 9. Non-redundant AC PSU status indicator Power Indicator Pattern Condition Not lit Power is not connected or PSU is faulty.
Hard drive indicator codes Each hard drive carrier has an activity LED indicator and a status LED indicator. The indicators provide information about the current status of the hard drive. The activity LED indicator indicates whether the hard drive is currently in use or not. The status LED indicator indicates the power condition of the hard drive. Figure 7.
uSATA SSD indicator codes Figure 8. uSATA SSD indicators 1 uSATA SSD activity indicator 3 uSATA SSD 2 uSATA SSD status indicator NOTE: If the SSD is in the Advanced Host Controller Interface (AHCI) mode, the status indicator (on the right side) does not function and remains off. Table 11. Drive status indicator codes Drive-status indicator pattern Condition Flashes green twice per second Identifying drive or preparing for removal. Off Drive ready for insertion or removal.
Table 12. IDSDM indicator codes Convention IDSDM indicator code Description A Green Indicates that the card is online. B Flashing green Indicates rebuild or activity. C Flashing amber Indicates card mismatch or that the card has failed. D Amber Indicates that the card is offline, has failed, or is write-protected. E Not lit Indicates that the card is missing or is booting.
3 Running diagnostics Running diagnostics help you to identify the cause for a system issue. The diagnostics test your system hardware without requiring additional equipment or risking data loss.
Table 13. PSA/ePSA error codes Error number (PSA and ePSA) Error message Description Steps PSA NA CPU - exception occurred An error occurred during the tests that may involve the system board. 1 ePSA 2000-0111 2 3 PSA NA ePSA 2000-0112 CPU - machine check exception detected An error occurred during the tests that may involve the system board. 1 2 3 PSA NA ePSA 2000-0114 CPU - Cache integrity test discrepancy An error occurred during the tests that may involve the system board.
Error number (PSA and ePSA) Error message Description Steps PSA 1000-0123 Memory - integrity test failed An error occurred during the tests that may involve the system board or memory of the system. However, the system has self repaired. 1 System Log - , This is information in the system log to show time and messages related to system events. 1 2 Clear the system log. Repeat the PSA diagnostics.
Error number (PSA and ePSA) PSA 1000-0142 ePSA 2000-0142 Error message PSA Hard Drive - drive self test failed Description Steps good drive can be detected in the system or try the suspect drive in a working system. 5 If failure continues, contact Technical Support 1 2 Update to the latest BIOS. Turn off your computer and reconnect your hard disk drive (HDD) to the system board for instructions.) Repeat the PSA diagnostics. Your hard disk drive has indicated a failure.
Error number (PSA and ePSA) PSA 1000-0148 ePSA 2000-0148 replaced by 2000-0151, 2000-0152 Error message PSA Optical Drive - BIST --(s) ePSA Optical Drive - incorrect status Description Your CD or DVD drive has indicated a failure. Steps 3 Repeat the PSA diagnostics. 1 Update to the latest BIOS version. Turn off your computer and reconnect your optical drive to the system board. Repeat the PSA diagnostics.
Error number (PSA and ePSA) Error message Description Steps 3 4 PSA NA ePSA 2000-0154 Tape Drive - Tape Drive [s] - S/N Check installation of the tape [d], incorrect status = [d] [d] drive, cables and connections. If the error persists, ensure that the drive firmware is current. 1 2 3 4 PSA NA Hard Drive - Not Installed ePSA 2000-0155 This is an error displayed when HDD is not inserted in the notebooks.
Error number (PSA and ePSA) Error message Description Steps no 2000-0123 memory error & If 3 diagnostics fail again after the BIOS is current, contact Technical Support to resolve the problem. If failure continues, contact Dell Technical Support System board - Timer - Interval timer initial clock output level ePSA 2000-0223 (Not used with incorrect UEFI BIOS) An error occurred during the 1 tests that may involve the main system board of the system.
Error number (PSA and ePSA) Error message Description Steps 2000-0123 memory error & If diagnostics fail again after the BIOS is current, contact Technical Support to resolve the problem. 3 If failure continues, contact Dell Technical Support PSA System board - timeout waiting for RTC update flag to ePSA 2000-0234 (Not used with set UEFI BIOS) ePSA System board - HPET incorrect time period. An error occurred during the 1 tests that may involve the main system board of the system.
Error number (PSA and ePSA) PSA NA Error message Description Steps USB devices in a different port. Try a known good USB device. 3 If failure continues, contact Dell Technical Support System board - USB device An error occurred during the 1 tests that may involve the USB controller or ports of the main 2 system board of the system. Disconnect any USB devices and 3 run the diagnostic again. Test USB devices in a different port. Try a known good USB device.
Error number (PSA and ePSA) Error message Description Steps PSA 1000-0321 PSA LCD EDID - unable to access EDID EEPROM LCD Extended Display Identification Data (EDID) unable to access the EDID Electrically Erasable Programmable Read-Only Memory (EEPROM) in the LCD display is indicating a data failure. If there is video on the LCD, then the Display does not need replacement.
Error number (PSA and ePSA) Error message Description Steps PSA NA LCD panel - unable to use BIOS interface The Liquid Crystal Display (LCD) panel does not display the BIOS screen correctly and the BIOS screen is not visible on the LCD. 1 2 ePSA 2000-0327 3 4 PSA NA ePSA 2000-0328 LCD panel - unable to detect variance in ambient light sensor The Ambient light sensor that automatically dims the LCD in low light did not respond during the diagnostics.
Error number (PSA and ePSA) PSA NA Error message Cables - not detected ePSA 2000-0411 Description Steps and horizontal color bars without 3 distortion, lines, or color problems, re-run the diagnostic 4 and if the bar appears normal, click Yes. 5 Turn off your computer and reconnect your LCD cable. Repeat the PSA diagnostics.. If failure continues, contact Dell Technical Support Normally, the cable involved in 1 the error (LCD LVDS CABLE for 2 example) is indicated in the error message.
Error number (PSA and ePSA) PSA NA ePSA 2000-0415 Error message Cables - Check the following cables, jumper, connection, or sensors: [s] Description Steps 5 If failure continues, contact Dell Technical Support Normally, the cable involved in 1 the error (LCD LVDS CABLE for example) is indicated in the error 2 message. Reseat the cable connection and inspect the cable and connections for damage. Update to the latest BIOS version.
Error number (PSA and ePSA) Error message PSA NA BIOS - No BIOS support for The motherboard BIOS revision 1 software interrupt [x] function[x] may not be current. Update the [x] BIOS to the most current version 2 and the issue should resolve. 3 Update to the latest BIOS version. Repeat the PSA diagnostics. If failure continues, contact Dell Technical Support BIOS - No BIOS support for SMI interface function(x) or Sensor [x] exceeded thermal zone [d]. Peak zone was [d].
Error number (PSA and ePSA) Error message Description Steps C. Unable to start application processors D. Unable to stop all APs PSA NA Diagnostics - Software Error ePSA 2000-8009 The System may be unstable and 1 should be rebooted. 2 3 PSA NA ePSA 2000-800B BIOS - Retrieve vendor ID function error The system may be unstable. 1 2 3 PSA NA ePSA 2000-800C PSA NA ePSA 2000-800D PSA NA ePSA 2000-800E ePSA 2000-800F The motherboard BIOS revision 1 may not be current.
Error number (PSA and ePSA) Error message PSA NA USB Device Diagnostics - Invalid The attached USB device is status returned from the device returning an invalid status to the BIOS. ePSA 2000-8011 Description Steps 1 2 3 PSA NA ePSA 2000-8012 Diagnostics - Invalid parameter passed to the device. Unknown test [d] selected An unexpected parameter was passed to a device under test.
Error number (PSA and ePSA) Error message Description Steps PSA NA Diagnostics - Low memory. [d]k bytes free! The system may be unstable. 1 ePSA 2000-8020 2 3 PSA NA ePSA 2000-8115 Diagnostics - Unable to stop all APs The system may be unstable. 1 2 3 PSA NA ePSA 2000-8154 Tape Drive - Tape Drive [d] - S/N Install the correct tape drive [s], ULTRIUM [d] media found media. but drive.
Error number (PSA and ePSA) Error message Description • PSA NA ePSA 2000-8165 PSA NA OS - MBR code is unknown, possibly infected by a virus ePSA 2000-8170 PSA NA ePSA 2000-8415 Remaining capacity low The Master Boot Record on the Hard Disk Drive has unrecognized code present which is an indication of a virus infecting the system. It is recommended that you scan your system with an up-to-date antivirus application. 1 OS - Detected virus (s) A virus appears to be infecting the system.
Figure 9. Opening the Systems page 2 In the System page, click Advanced system settings in the left pane. Figure 10. Advanced system settings page 3 In the System Properties window, click Settings under the Startup and Recovery section.
Figure 11. System Properties window 4 In the Startup and Recovery window, System failure section, do the following: a Select Write an event to the system log to ensure that the minidump file is created in the event of a system failure. b Select Automatically restart to restart the system after a blue screen of death (BSOD) occurs. NOTE: For servers, it is recommended that you select the Automatically restart option so that the server can function if the error is not critical.
Figure 12. Startup and Recovery window 5 When a BSOD occurs, collect the minidump files by navigating to C:\windows\minidump. 6 Download and install the Windows Debugger (WinDbg) tool from https://msdn.microsoft.com/en-us/windows/hardware/ hh852365.aspx. Steps 1 Open WinDbg. 2 Click File → Symbol File Path or press Ctrl + S on the keyboard. 3 In the Symbol Path field, enter SRV**http://msdl.microsoft.
c 11 IMAGE_NAME Call Dell Technical Support for further assistance.
4 Troubleshooting hardware issues This section helps you troubleshoot hardware issues in your system. NOTE: If the issue still persists, contact Dell Technical Support for assistance.
Resolution: To resolve the issue, follow these steps: 1 Switch off the system. 2 Disconnect all power cables to the server’s power supply units. 3 Remove the system cover. 4 Reseat all the cables of hard drive backplane at both ends. 5 Reseat all the drives. 6 Replace the system cover. 7 Connect the power cables to the server’s power supply units. 8 Power on the system 9 To enter UEFI, Press F2.
3 If the problem is resolved, restart the system, enter System Setup, and check if the non-functioning USB ports are enabled. 4 In iDRAC Settings Utility, ensure that USB Management Port Mode is configured as Automatic or Standard OS Use. 5 If the problem is not resolved, replace the keyboard and/or mouse with a known working keyboard or mouse. If the problem is not resolved, proceed to step 6 to troubleshoot other USB devices attached to the system.
Next step If all troubleshooting fails, see the Getting help section. Troubleshooting a serial Input Output device Steps 1 Turn off the system and any peripheral devices connected to the serial port. 2 Swap the serial interface cable with a known working cable, and turn on the system and the serial device. If the problem is resolved, replace the interface cable with a known working cable. 3 Turn off the system and the serial device, and swap the serial device with a compatible device.
Troubleshooting a wet system Prerequisite CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple repairs as authorized in your product documentation, or as directed by the online or telephone service and support team. Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions that are shipped with your product.
• cooling shroud • expansion card risers (if installed) • expansion cards • power supply unit(s) • cooling fan assembly (if installed) • cooling fan(s) • processor(s) and heat sink(s) • memory modules 4 • drive carriers or cage Ensure that all cables are properly connected. 5 Install the system cover. 6 Run the appropriate diagnostic test. For more information, see the Using system diagnostics section. Next step If the problem persists, see the Getting help section.
• The expansion card installation guidelines have been followed. Additional cooling can be added by one of the following methods: From the iDRAC web GUI: 1 Click Hardware > Fans > Setup. 2 From the Fan Speed Offset drop-down list, select the cooling level required or set the minimum fan speed to a custom value. From F2 System Setup: 1 Select iDRAC Settings > Thermal, and set a higher fan speed from the fan speed offset or minimum fan speed.
7 If the problem is not resolved, repeat step 2 and step 3. 8 Insert a known working USB key. 9 Install the system cover. Next step If the problem persists, see the Getting help section. Troubleshooting a micro SD card Prerequisite NOTE: Certain micro SD cards have a physical write-protect switch on the card. If the write-protect switch is turned on, the micro SD card is not writable. Steps 1 Enter System Setup, and ensure that the Internal SD Card Port is enabled.
9 Remove all expansion cards installed in the system. 10 Install the system cover. 11 Run the appropriate diagnostic test. See the Using system diagnostics section. If the tests fail, see the Getting help section. 12 For each expansion card you removed in step 8, perform the following steps: a b c d e Turn off the system and attached peripherals, and disconnect the system from the electrical outlet. Remove the system cover. Reinstall one of the expansion cards. Install the system cover.
Troubleshooting a storage controller CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple repairs as authorized in your product documentation, or as directed by the online or telephone service and support team. Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions that are shipped with your product.
screen. You can use the VD Mgmt screen to view the foreign configuration before importing. After you view the foreign configuration, you can either clear or import to the RAID controller. NOTE: Before you import the foreign configuration, review the configuration on the screen to ensure that it is the end result that you require. You can use the Foreign Config screen to manage foreign configurations in the following cases: • All the physical disks in a configuration are removed and re-inserted.
a b c Select Foreign Configuration View to display the complete foreign configuration information. Press to display the options Import and Clear. Select Import to import the foreign configurations to the virtual disk or Clear to delete the foreign configuration(s) from the reinserted disk(s). No rebuilds occur after the import operation because there is no redundant data to rebuild the disks with.
Solution: The controller is working as deigned. • For more information about installation and removal of NVMe PCIe SSD in the system, see Replacing and configuring hardware. • For configuring and managing the PCIe SSD adapter, see Configuring And Managing Your NVMe PCIe SSD Adapter. 12 Gbps hard drive does not support in SAS 6ir RAID controllers Issue: 12 Gbps drive does not support in SAS 6ir RAID controllers. Resolution: • The controller is working as designed.
1 Download the BIOS update package at: Dell.com/support. 2 When the File Download window appears, click Save to save the file to your hard drive. 3 Read the release information by running the command "./[model]_BIOS_LX[revision].BIN--version" command from the shell. 4 Download and perform the prerequisites identified in the above steps before proceeding. 5 Install any necessary Embedded Systems Management firmware prior to this BIOS update. 6 Run the update "./[model]_BIOS_LX[revision].
1 Download the BIOS update package at: Dell.com/support. 2 When the File Download window appears, click Save to save the file to your hard drive. 3 Browse to the location where you downloaded the file and double-click the new file. 4 Read over the release information presented in the dialog window. 5 Download and install any prerequisites identified in the dialog box before proceeding. 6 Click the Install button. 7 Follow the remaining prompts to perform the update.
6 Select OK. Firmware or Physical disks out-of-date Issue:The Firmware or Drives out of date adn results in unknown disk errors. Solution: • Update the Firmware/ PERC Controller. Cannot boot to Windows due to foreign configuration About this task When a foreign configuration exists in the controller, and you cannot boot to Windows, do the following: Steps 1 Import the foreign configuration to the RAID controller. See, Importing foreign configuration using PERC. 2 Rebuild the failed hard drives.
To manage the preserved cache: Steps 1 On the VD Mgmt screen, click on a controller icon. 2 Press to display the available actions. 3 Select Manage Preserved Cache. A message is displayed advising you to import the foreign configuration before you discard the preserved cache to avoid losing data belonging to the foreign configuration. Confirm whether you want to continue. The Manage Preserved Cache screen displays the affected virtual disks.
Troubleshooting hard drives Prerequisites CAUTION: This troubleshooting procedure can erase data stored on the hard drive. Before you proceed, back up all files on the hard drive. CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Checking hard drive status in the PERC BIOS 1 Press Ctrl + R during the POST process to open PERC BIOS Configuration Utility. Figure 13. PERC Configuration Utility VD Mgmt screen 2 To check if the hard drive is participating in the RAID array, perform the following steps: a Press Ctrl + N to go to the PD Mgmt (Physical Disk Management) screen and check if any hard drives are offline or missing. Figure 14.
Table 14. Hard drive status Hard drive status Description Offline Hard drive is not part of the RAID array. Online Hard drive is part of the RAID array. Ready Hard dive is ready to be a part of the RAID array. FAQs How to identify a hard drive failure? Hard drive failures may occur because of logical, head, or mechanical failures. The following table describes the symptoms of failing hard drives: Table 15.
If you have any of the above error conditions, contact Dell Technical Support. When you call, keep the SupportAssist report and the troubleshooting logs ready. How do I extract RAID-Controller Logs using MegaCLI for Windows and Linux? NOTE: To export information about the status of the RAID controller and the connected hard drives, use the MegaCLI tool by LSI (Avagotech).
Corrective action—Update the hard drive firmware/PERC controller. For information about driver installation, see the driver installation section. For information about firmware installation, see the Firmware section. Drives not accessible Multiple physical disk errors in a single array typically indicate a failure in cabling or connection and could involve the loss of data. You can recover the virtual disk after multiple physical disks become simultaneously inaccessible.
Troubleshooting a tape backup unit Prerequisite CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple repairs as authorized in your product documentation, or as directed by the online or telephone service and support team. Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions that are shipped with your product. Steps 1 Use a different tape cartridge.
10 Enter System Setup and check the system memory setting. If the problem is not resolved, proceed with step 11. 11 Remove the system cover. 12 If a diagnostic test or error message indicates a specific memory module as faulty, swap or replace the module with a known working memory module. 13 To troubleshoot an unspecified faulty memory module, replace the memory module in the first DIMM socket with a module of the same type and capacity.
2 Update the iDRAC firmware to the latest version Troubleshooting memory module issues Following are guidelines to troubleshoot memory module issues: • Check for the LCD status messages. The server’s LCD screen displays text messages that indicate multi-bit and single-bit errors were detected in the System Event Log (SEL). • Run the Dell SupportAssist to identify a faulty DIMM in the memory modules or in the System Event Logs (SEL) logs.
Memory modes There are four types of memory modes: • Advanced ECC (Lockstep) • Memory Optimized (Independent Channel) • Memory Sparing • Memory Mirroring You can configure the memory modes in the System BIOS. For more information, see the Configuring memory modes section. Configuring memory modes About this task To configure memory modes, follow the procedure listed below: Steps 1 Turn on or restart your system. 2 Press F2 immediately after you see the following message: = System Setup.
• The system is not using latest BIOS • The memory modules are not seated properly on the DIMM sockets in the system. • The system has a faulty DIMM socket. Correctable and uncorrectable errors Correctable errors can be detected and corrected if the chipset and DIMM support this functionality. Correctable errors are generally single bit errors (SBE). Most of the Dell servers are capable of detecting and correcting single bit errors.
Correctable memory error rate exceeded for DIMM_A1 Issue: System encountered a Single bit failure error rate exceeded error at DIMM_1. NOTE: If the system is new, or has been moved recently, some components, may exhibit symptoms of intermittent connections. Components including the memory modules may be dislodged due to vibrations. All components should be re-seated before continuing with the troubleshooting. Resolution: 1 Update the system with latest BIOS and firmware. 2 Re-seat the memory modules.
Troubleshooting power supply units CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple repairs as authorized in your product documentation, or as directed by the online or telephone service and support team. Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions that are shipped with your product.
Troubleshooting RAID RAID configuration using PERC How to create RAID volumes If your system has one or more supported PERC RAID controllers with PERC 8 firmware or later versions, or software RAID controllers, use the RAID Configuration wizard to configure a virtual disk as the boot device. RAID volumes can be created using several interfaces. Here we are creating a RAID volume using Lifecycle controller. To configure RAID using LC, complete the following steps: 1 Start Lifecycle Controller.
4 Select Create New VD, and set the Secure VD option as Yes. Next step NOTE: To select or configure a non-SED, you must create an unsecured virtual disk. You can create an unsecured virtual disk even if there is a security key. From the Create New VD menu, select the Secure VD option as No. For more information about creating a virtual disk, see the Dell PowerEdge RAID Controller (PERC) User’s Guide at Dell.com/ storagecontrollermanuals.
Importing a foreign configuration using PERC About this task Some controllers enable you to import a foreign configuration so that virtual disks are not lost after moving the physical disks. You can import a foreign configuration only if it contains a virtual disk that is either in a Ready or Degraded state. In other words, all of the virtual disk data must be present, but if the virtual disk is using a redundant RAID level, the additional redundant data is not required.
NOTE: If there are Pending jobs in the queue, you can delete the pending jobs and assign the physical disk as hot spare. This allows the replaced physical disk to start the rebuilding process • Persistent Hot Spare — allows you to enable or disable the ability to have same system backplane or storage enclosure disk slots dedicated as hot spare slots. • Allow Replace Member with Revertible Hot Spare — allows you to enable or disable the option to copy the data form a hot spare disk to physical disk.
For PERC H700 and PERC H800 controllers, if any of the drives you selected is in the spun down state, the following message is displayed: The below listed physical drive(s) are in the Spun Down state. Executing this task on these drive(s) takes additional time, because the drive(s) need to spun up. The message displays the ID(s) of the spun down drive(s). NOTE: When the hard drive is in the spun down state, the drive is not in active use.
The Connector section of the page displays the connectors on the controller and the disks attached to each connector. Select the disks you want to include in the virtual disk. In this example, the controller has a single connector with five disks. • Physical disk 0:0 • Physical disk 0:1 • Physical disk 0:2 • Physical disk 0:3 • Physical disk 0:4 Physical Disks Selected The Physical Disks Selected section of the page displays the disks you have chosen. In this example, two disks are selected.
NOTE: • If a physical disk is receiving a SMART alert, it cannot be used in a virtual disk. • For a controller that has more than one channel, it may be possible to configure a virtual disk that is channelredundant. Depending on the RAID level you selected and the virtual disk size, this page displays the disks and connectors (channels or ports) available for configuring the virtual disk.
specification includes the Trusted Computing Group (TCG) protocol that allows for Trusted Platform Module (TPM) measurements and secure updates which USC leverages to its fullest extent. 1 Select the RAID configuration tab to create the virtual disk as the boot device. 2 On the View Current Configuration screen, you can view a list of current controllers and virtual disks installed in the system.
Figure 15. Flowchart of Unified Server Configurator’s RAID configuration process 5 The Express option selects the appropriate disks depending upon the RAID type selected for virtual disk creation. The Summary screen is displayed. You can review the choices selected during the Express wizard. 6 Click Finish to create the virtual disk to be used for operating system installation. 7 The Advanced option takes you to a series of more screens. Select the RAID type on the Basic Settings screen.
Downloading and installing the RAID controller log export by using PERCCLI tool on ESXi hosts on Dell’s 13th generation of PowerEdge servers To export information about the status of the RAID controller and its attached hard drives, you can use the PERCCLI tool. To download and install the RAID controller log export by using PERCCLI tool on ESXi hosts on Dell’s 13th generation of PowerEdge servers, perform the following steps: 1 Download the latest verison of PERCCLI for ESX tool from www.dell.com/support.
Figure 18. Configuration tab a In the Services properties window, select SSH (1), and then click Options...
Figure 19. Open SSH Options b 82 In the SSH Options window, click Start (1), and then click OK (2) to activate the service.
Figure 20. Start SSH Service 4 To unzip vmware-esx-perccli, open an SSH connection via PUTTY and run the command: unzip /vmfs/volumes/datastore1/ vmware-esx-perccli-1.05.08.zip PUTTY is a free and open-source terminal emulator, serial console and network file transfer application. It supports several network protocols, including SCP, SSH, Telnet, rlogin, and raw socket connection. You can download it from Google. The files vmware-esxperccli-1.05.08.vib and Readme.
The log is created with the name: #### MegaSAS.log Figure 22. Log creation 8 Copy MegaSAS.log to datastore by using command:cp /opt/lsi/perccli/MegaSAS.log /vmfs/volumes/datastore1/ 9 Copy the file to desktop with Datastore Browser. Figure 23. Log file in Datastore Browser Now the logs are exported on ESXi hosts on the Dell 13th generation PowerEdge servers.
Steps 1 Launch the Lifecycle Controller home page, click Hardware Configuration, and then click Configuration Wizards. The Configuration Wizards window is displayed. 2 In Storage Configuration Wizards, click RAID Configuration. 3 Select the RAID controller to view its current virtual disk configuration and disk attributes. Click Next. 4 Select the RAID level for the virtual disk that you want to create and click Next.
Controller Starting RAID Level PERC H330 Adapter, PERC RAID 1 H330 Mini Monolithic, PERC H330 Mini Blades, PERC H330 Embedded, PERC H730 Adapter, PERC H730 Mini Monolithic, RAID 5 PERC H730 Mini Blades, PERC H730P Adapter, PERC H730P RAID 5 Mini Monolithic, PERC H730P Mini Blades, PERC H730P Slim, PERC H830 Adapter, PERC FD33xD/FD33xS RAID 6 Target RAID Level Comments RAID 5, RAID 6 Add at least one additional disk. RAID 6 requires a minimum of 4 disks.
NOTE: Each physical disk in the RAID1 configuration should be replaced one by one. Thumb rules for RAID configuration The following are the thumb rules for RAID configuration: • Both drives should be of same size. • Both drives should be of same type. • Both drives should run at same speed. Reconfiguring or migrating virtual disks About this task Reconfiguring or migrating a virtual disk (VDs) enables you to increase the capacity or change the RAID level of the virtual disk.
Controller Starting RAID Level Target RAID Level Comments Monolithic, PERC H810 Adapter, PERC H330 Adapter, PERC H330 Mini Monolithic, PERC H330 Mini Blades, PERC H330 Embedded, PERC H730 Adapter, PERC H730 Mini Monolithic, PERC H730 Mini Blades, PERC H730P Adapter, PERC H730P Mini Monolithic, PERC H730P Mini Blades, PERC H730P Slim, PERC H830 Adapter, PERC FD33xD/FD33xS RAID 1 RAID 0 With or without adding additional disks RAID 1 RAID 5, RAID 6 Add at least one additional disk.
The Foreign Configuration Preview page provides a preview of the foreign disks and enables you to perform operations such as, importing, recovering, or clearing the foreign disks. You can also import or clear a locked foreign configuration. If any foreign configurations that are locked by using Local Key manager (LKM) are detected, the associated Encryption Key Identifier is displayed prompting you to provide the corresponding passphrase to unlock the drives.
Property Definition • • Incompatible physical disks — Configuration on the physical disks is not recognized by the RAID firmware. Orphan drive — A physical disk in the foreign configuration has configuration information that matches another physical disk that is already a part of an array (either a foreign or a native array). Layout Displays the RAID level of the foreign configuration. Remarks Provides information about the foreign virtual disk.
– Background initialization – Check consistency In addition, the Patrol Read is suspended during heavy I/O activity and resumes when the I/O is finished. To set Patrol Read mode, select the desired Patrol Read Mode option. The options available are: • • • Auto — Initiates the Patrol Read task. After the task is complete, it automatically runs again within a specified period.
Pausing a Check Consistency The Pause Check Consistency task pauses a check consistency while it is in progress. NOTE: The Pause Check Consistency task updates the virtual disk State property to Resynching Paused immediately. The Progress property may continue to increment for up to three seconds. This time delay occurs because the polling task may take up to three seconds to query the task information and update the display.
When a rebuild is able to restore all data except data from damaged portions of the disk, it indicates successful completion while also generating alert 2163. For information about the event and error messages generated by the system firmware and agents that monitor system components, see the Error Code Lookup page, at qrl.dell.com The rebuild may also report sense key errors.
Problems associated with using the same physical disks for both redundant and nonredundant virtual disks When creating virtual disks, you should avoid using the same physical disks for both redundant and nonredundant virtual disks. This applies to all controllers. Using the same physical disks for both redundant and nonredundant virtual disks can result in unexpected behavior including data loss.
Chassis infrastructure firmware: http://www.dell.com/support/home/drivers/DriversDetails?productCode=poweredge-vrtx&driverId=CPMVM SPERC firmware: http://www.dell.com/support/home/drivers/DriversDetails?productCode=poweredge-vrtx&driverId=THVJ9 SPERC driver: http://www.dell.com/support/home/drivers/DriversDetails?productCode=poweredge-vrtx&driverId=120MK Reconfiguring the RAID level and virtual disks An online virtual disk can be reconfigured in ways that expand its capacity and/or changes its RAID level.
The Event Message Reference contains the error and event information generated by firmware and other agents that monitor system components. These events might be logged and displayed on one of the system management consoles, or both logged and displayed. NOTE: To run the Embedded System Diagnostics (also known as Enhanced Pre-Boot System Assessment): 1 As the system boots, press F11. 2 Use the up-and-down arrows keys to select System Utilities > Launch Dell Diagnostics.
Additional information for troubleshooting memory or battery errors on the PERC controller A RAID Controller error message is displayed during POST to indicate that the controller's cache does not contain all of the expected information, or it contains data destined for a hard drive that cannot be or has not been written to the drive.
Slicing Configuring multiple RAID arrays across the same set of disks is called Slicing. RAID puncture A RAID puncture is a feature of Dell PowerEdge RAID Controller (PERC) designed to allow the controller to restore the redundancy of the array despite the loss of data caused by a double fault condition. Another name for a RAID puncture is rebuild with errors.
A Check Consistency performed after a RAID puncture is induced will not resolve the issue. This is why it is very important to perform a Check Consistency on a regular basis. It becomes especially important prior to replacing drives, when possible. The array must be in an optimal state to perform the Check Consistency.
Preventing problems before they happen and solving punctures after they occur Dell's RAID controllers contain a number of features to prevent many types of problems and to handle a variety of errors that do occur. The primary job of a RAID controller is to preserve the integrity of the data contained on its array(s). Even in the more extreme cases of damage (such as punctures), the array's data is often available and the server can remain in production.
5 Server management software issues This section helps to manage software issues related to the server management.
• • Express — iDRAC Express is the standard, default offering that is part of the base configuration for 600 series and above. It does not require license installation, back up, or license management. Express offers embedded tools, console integration, and simplified remote access. – Express for Blades — For the 12th and 13th generation blade servers, there is a new Express for Blades iDRAC version.
shipping, but it also allows for an upgrade if available. If Enterprise is ordered from Dell at point of sale on 200-500 series servers, the necessary hardware is installed in the factory. This piece of hardware is the Dell Ports card, which sits in a PCIe slot and has the RJ-45 connector for the Dedicated NIC and the slot for the vFlash SD card. For more information on the iDRAC licensing feature, see En.community.dell.
System time zone is not synchronized You can configure the time zone on iDRAC and synchronize the iDRAC time using Network Time Protocol (NTP) instead of BIOS or host system times. You must have Configure privilege to configure time zone or NTP settings. About this task To configure time zone and NTP using iDRAC web interface: Steps 1 Go to Overview > iDRAC Settings > Properties > Settings. The Time zone and NTP page is displayed.
Steps 1 Launch the Lifecycle Controller. 2 On the left pane, click Settings. 3 On the Settings pane, click Network Settings. 4 From the NIC Card drop-down menu, select the NIC port that you want to configure. 5 From the IPV4 Network Settings→ IP Address Source drop-down menu, select one of the following options: NOTE: You can use only one NIC at a time to communicate with the network. • No Configuration—indicates that the NIC must not be configured.
A global hot spare is an unused backup disk that is part of the disk group. Hot spares remain in standby mode. When a physical disk that is used in a virtual disk fails, the assigned hot spare is activated to replace the failed physical disk without interrupting the system or requiring your intervention. When a hot spare is activated, it rebuilds the data for all redundant virtual disks that were using the failed physical disk.
The RAID configuration is applied to the virtual disks. To create virtual disks using RAID configuration at the preferred RAID level, watch the Dell Lifecycle Controller RAID configuration video on YouTube https://www.youtube.com/watch?v=JFPmeHNENV4 Foreign drivers on physical disk Importing Foreign Configurations NOTE: To make the foreign configuration effective, ensure that you have installed the latest version of Server Administrator.
To include a Foreign disk in the volume, you can clear the foreign configuration and reset the controller configuration. Clearing the foreign configuration 1 On the iDRAC9 web interface, go to Configuration > Storage Configuration > Controller Configuration. The Controller Configuration page is displayed. 2 In the Foreign Configuration section, from the Controller drop-down menu, select the controller for which you want to clear the foreign configuration.
Which are the operating systems supported on Dell EMC PowerEdge servers Dell EMC PowerEdge servers are designed to work with many operating systems and virtualization which may include but not limited to: 1 Canonical Ubuntu LTS 2 Citrix XenServer 3 Microsoft Windows Server with Hyper-V 4 Red Hat Enterprise Linux 5 SUSE Linux Enterprise Server 6 VMware ESXi For information about supported operating systems on specific PowerEdge servers, see Dell.
plugins such as Virtual Console with this level of encryption. For information about installing the policy files, see the documentation for Java. • While launching the Virtual Console using Java plug-in, you may occasionally see a Java compilation error. To resolve this, go to Java control panel > General > Network Settings and select Direct Connection. How to specify language and keyboard type To specify the language and keyboard type using Lifecycle Controller: • Launch the Lifecycle Controller.
example, the Explorer shell is not installed as part of a Windows Server or Hyper-V Server installation. Instead, the default user interface for a Windows Server or Hyper-V Server installation is the command prompt. NOTE: On Windows client operating systems, to install the systems management software successfully, log in using an account which belongs to the Administrators group and must run the setup.exe using the option Run as administrator from the right-click menu.
The TRIM/UNMAP command must be supported by the operating system, the drive, and the controller in order to work. The TRIM/ UNMAP command could result in higher SSD performance from both the reduced data needed to be rewritten during garbage collection and the higher free space resulting on the drive. Dell enterprise drives which are currently shipped have high performance and endurance so they do not yet support these commands even if the OS supports them.
increase and decrease Remote Desktop Web Access, Gateway, Connection Broker and Session Host servers at will. You can use Remote Desktop Connection Broker to distribute workloads. Active Directory based authentication provides a highly secure environment. For more information, see Build and Deploy Remote Desktop Services NOTE: Provide the correct Microsoft Windows version while purchasing the license.
6 Troubleshooting operating system issues This section helps you troubleshoot operating system issues in your system. NOTE: If the problem persists, contact Dell Technical Support for further assistance.
Figure 25. Blue screen of death 2 Run the PSA/ePSA diagnostics. For more information, see PSA/ePSA Diagnostics. 3 If the diagnostics pass and the issue persists, identify the stage in which the blue screen error occurs. 4 If the BSOD occurs during the boot process, check for minimum to POST components. For more information, see Troubleshooting a No POST situation. If the issue persists, call Dell Technical Support.
• Use the bcdedit utility to view or modify the boot configuration database (BCD). For more information, see https:// technet.microsoft.com/en-us/library/cc731662.aspx. NOTE: For additional recovery console commands, see https://support.microsoft.com/en-us/kb/326215. NOTE: For more troubleshooting steps, see https://support.microsoft.com/en-us/kb/325375. 4 From within the safe mode or DSRM, use the msconfig utility to disable startup applications and non-Microsoft services on the server.
No POST issues in iDRAC This section provides details on troubleshooting iDRAC issues. “First Boot Device cannot be set” error message is displayed when configuring a boot device during POST. Description The error message “First Boot Device cannot be set. Either the system BIOS is out-of-date, or the server needs a reboot for the settings to take effect” displays in the POST mode.
NOTE: When removing or replacing parts, always turn off the server, remove the power cord, and wait for ten seconds for the static flea power to drain. Reconnect the power cord, wait for a minute and turn on the server. This allows time for the Baseboard Management Controller (BMC) to power up. Error messages may not be reported correctly if the static flea power is not fully drained. For more information on removing and installing hardware components, see your system’s Owner’s Manual at Dell.
Steps 1 Login to a SharePoint Online site in the browser. 2 Click on the OneDrive link in the header at the top of an Office 365 page. 3 Сopy the site URL from the browser's address line. 4 Open the application and specify the OneDrive URL in the Connect to SharePoint wizard. NOTE: Before connecting the OneDrive site of another user, make sure the OneDrive has been provisioned (i.e.
12 Click Next to continue. 13 On the screen, click Install Now. The Operating System Install screen is displayed. 14 On the Operating System Install screen, select the operating system, and then click Next. The License terms window is displayed. 15 Read the license agreement information. If you agree with all the information, select I accept the license terms and click Next.
NOTE: In Windows Server Essentials edition, on clicking Install Now, the License terms window is displayed. 7 On the Select the operating system you want to install screen, select the operating system from the available list, and click Next. The license terms window is displayed. 8 Read through the license agreement information. If you agree with all the information, select I accept the license terms, and then click Next.
Why are the USB keyboard and mouse not detected during the Windows Server 2008 R2 SP1 installation Cause This issue occurs because Windows Server 2008 R2 SP1 does not have native USB 3.0 driver support. Resolution 1 In System Setup, ensure that the USB 3.0 option on the Integrated Device Settings screen is set to Disable. NOTE: By default, the USB 3.0 option is disabled. If enabled, the operating system fails to detect the USB devices such as keyboard, mouse, and USB DVD.
Why does Windows Server 2008 R2 SP1 display a blank screen in UEFI mode after installation After installing Windows Server 2008 R2 SP1 in the UEFI mode, either using Lifecycle Controller (LC) or manually, may display a blank screen while booting to the OS. Cause This issue occurs because the Windows Server 2008 R2 does not support Graphics Output Protocol (GOP). Resolution In the BIOS configuration utility, ensure that Load Legacy Option Rom on the Miscellaneous Settings screen is set to Enable.
2 Ensure that all configurations and data is backed up prior to updating BIOS or Firmware. 3 Update latest BIOS, firmware and related drivers. 4 Perform general hardware diagnostic to ensure system health. 5 If the issue persists, call Dell Technical Support for assistance. .
Install Microsoft Hyper-V Server 2012 R2 with the Internal Dual SD module Prerequisites • Dell EMC 13th generation of PowerEdge servers with Dual 8GB or 16GB SD cards in Mirror mode installed in the Internal Dual SD module (IDSDM). • Set the Boot Mode to Bios and not UEFI. • Download the Hyper-V Server 2012 R2 ISO from Microsoft. About this task To install Microsoft Hyper-V server 2012 R2 using IDSM. Steps 1 Boot to the F2 System Setup to verify USB Mode is set to USB 3.
Rebooting an ESXi host About this task Before shutting down or rebooting the ESXi hosts, ensure that the hosts are in the Maintenance mode. Powering off a managed host, disconnects it from vCenter Server, but does not remove it from the inventory. To Reboot, perform the following steps. Steps 1 Shut down all virtual machines running on the ESXi host. 2 Select the ESXi host you want to shut down. 3 From the main menu or right-click the main menu, select Reboot or Shut Down.
Steps 1 Start the vSphere CLI. 2 Run the vicfg-cfgbackup command with the -s flag to save the host configuration to the specified backup filename: vicfg-cfgbackup --server< ESXi-host-ip> -- portnumber --protocol --username root --password root_password [-s The -portnumber and -protocol options are optional. If you exclude them, the defaults are port 443 and protocol HTTPS.
Symptoms Dell PowerEdge Express Flash NVMe PCIe SSD device is not detected during hot-plug in ESXi 6.0 Description When the Express Flash NVMe PCIe SSD namespace is set as Offline and hot-plug operation is performed in the same slot, the SSD does not initialize and is not detected. This is a known issue. Perform one of the following steps: 1 Reconnect the drive to a different drive slot, if available. 2 Restart the sfcbd-watchdog service and reinsert the drive.
Table 22. Installation of OS through LC and various methods Slno Video Description Location 1. Launching Lifecycle Controller Launching LC 2. Dell Lifecycle Controller - Firmware Rollback LC Firmware Rollback 3. Dell Lifecycle Controller - Firmware Update Using FTP Server LC Firmware Update Using FTP Server 4. Dell Lifecycle Controller - Firmware Update Using Network Share: CIFS (Common Internet File System) LC - Firmware Update Using CIFS 5.
25. Deploying an Operating System using Lifecycle Controller – Manual Install method Deploying an OS using LC – Manual Install method 26.
7 Getting help Topics: • Contacting Dell EMC • Downloading the drivers and firmware • Locating Service Tag of your system Contacting Dell EMC Dell EMC provides several online and telephone based support and service options. If you do not have an active internet connection, you can find contact information about your purchase invoice, packing slip, bill, or Dell EMC product catalog. Availability varies by country and product, and some services may not be available in your area.
Locating Service Tag of your system Your system is identified by a unique Express Service Code and Service Tag number. The Express Service Code and Service Tag are found on the front of the system by pulling out the information tag. Alternatively, the information may be on a sticker on the chassis of the system. The mini Enterprise Service Tag (EST) is found on the back of the system. This information is used by Dell to route support calls to the appropriate personnel. Figure 26.