Dell EMC PowerEdge Servers Troubleshooting Guide August 2020 Rev.
Notes, cautions, and warnings NOTE: A NOTE indicates important information that helps you make better use of your product. CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid the problem. WARNING: A WARNING indicates a potential for property damage, personal injury, or death. © 2017 - 2020 Dell Inc. or its subsidiaries. All rights reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its subsidiaries.
Contents Chapter 1: Introduction....................................................................................................................7 Audience..................................................................................................................................................................................7 Recommended tools.............................................................................................................................................................
Troubleshooting processors...............................................................................................................................................46 Troubleshooting a CPU Machine Check error ..................................................................................................46 Troubleshooting a storage controller................................................................................................................................ 47 OMSA flagging PERC driver....
RAID puncture................................................................................................................................................................84 Troubleshooting thermal issue...........................................................................................................................................86 Input/Output errors while reseating SAS IOM storage sled on hardware configurations.........................................
Migrating to OneDrive for Business using Dell Migration Suite for SharePoint.........................................................104 Windows............................................................................................................................................................................. 105 Installing and reinstalling Microsoft Windows Server 2016.................................................................................... 105 FAQs........................................
1 Introduction Use this guide to learn how to identify and troubleshoot the Dell PowerEdge server issues. In particular, this guide: • • • Provides troubleshooting procedures for issues related to Server Operating System, Server Hardware, and Server Management Software. Provides an overview of diagnostic indicators and describes how to use the indicator codes to facilitate troubleshooting.
1. Click the documentation link that is provided in the Location column in the table. 2. Click the required product or product version. NOTE: To locate the product name and model, see the front of your system. • 3. On the Product Support page, click Manuals & documents. Using search engines: ○ Type the name and version of the document in the search box. Table 1.
Table 1. Additional documentation resources for your system (continued) Task Document Location For information about partner programs enterprise www.dell.com/openmanagemanuals systems management, see the OpenManage Connections Enterprise Systems Management documents.
2 Diagnostic indicators The diagnostic indicators on the system indicates operation and error status.
Table 2. Status LED indicators (continued) Icon Description Condition Corrective action PCIe indicator The indicator turns solid amber if a PCIe card experiences an error. Restart the system. Update any required drivers for the PCIe card. Reinstall the card. If the problem persists, see the Getting help section. NOTE: For more information about the supported PCIe cards, see the Expansion card installation guidelines section. System health and system ID indicator codes Figure 1.
Table 4. iDRAC Quick Sync 2 indicators (continued) Wireless indicator code Condition Corrective action Blinks white rapidly Indicates data transfer activity. If the indicator continues to blink indefinitely, see the Getting help section. Blinks white slowly Indicates that firmware update is in progress. If the indicator continues to blink indefinitely, see the Getting help section. Blinks white five times rapidly and then turns off Indicates that the iDRAC Quick Sync 2 feature is disabled.
Table 6. NIC indicators (continued) Status Condition Link indicator is green and activity indicator is blinking green The NIC is connected to a valid network at its maximum port speed and data is being sent or received. Link indicator is amber and activity indicator is blinking green The NIC is connected to a valid network at less than its maximum port speed and data is being sent or received.
Table 7. AC PSU status indicator (continued) Power indicator codes Condition CAUTION: If two PSUs are used, they must be of the same type and have the same maximum output power. CAUTION: Combining AC and DC PSUs is not supported and triggers a mismatch. Figure 5. DC PSU status indicator 1. DC PSU status indicator Table 8. DC PSU status indicator codes Power indicator codes Condition Green A valid power source is connected to the PSU and the PSU is operational.
Figure 6. Non-redundant AC PSU status indicator and self-diagnostic button 1. Self-diagnostic button 2. AC PSU status indicator Table 9. Non-redundant AC PSU status indicator Power Indicator Pattern Condition Not lit Power is not connected or PSU is faulty. Green A valid power source is connected to the PSU and the PSU is operational. Hard drive indicator codes Each hard drive carrier has an activity LED indicator and a status LED indicator.
Table 10. Hard drive indicator codes Drive-status indicator pattern Condition Flashes green twice per second Identifying drive or preparing for removal. Off Drive ready for insertion or removal. NOTE: The drive status indicator remains off until all hard drives are initialized after the system is turned on. Drives are not ready for removal during this time. Flashes green, amber, and then turns off Predicted drive failure. Flashes amber four times per second Drive failed.
Internal dual SD module indicator codes The Internal Dual SD module (IDSDM) provides you with a redundant SD card solution. You can configure the IDSDM for storage or as the OS boot partition. The IDSDM card offers the following features: • • Dual card operation — maintains a mirrored configuration by using SD cards in both the slots and provides redundancy.
3 Running diagnostics Running diagnostics help you to identify the cause for a system issue. The diagnostics test your system hardware without requiring additional equipment or risking data loss.
Table 13. PSA/ePSA error codes Error number (PSA and ePSA) Error message Description Steps PSA NA CPU - exception occurred An error occurred during the tests that may involve the system board. 1. Update to the latest BIOS version. 2. Repeat the PSA diagnostics. 3. If failure continues, contact Dell Technical Support CPU - machine check exception detected An error occurred during the tests that may involve the system board. 1. Update to the latest BIOS version. 2. Repeat the PSA diagnostics. 3.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description PSA NA Event Log The IPMI system event log is full 1. Clear the IPMI system event for various reasons or logging log. has stopped because too many 2. Repeat the PSA diagnostics. ECC errors have occurred. Event Log The event log(s) must be cleared before testing can continue. 1. Clear the system event log. 2. Repeat the PSA diagnostics.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps PSA Hard Drive - timeout waiting for Drive Self Test to complete The hard drive test did not complete the last test attempted. 1. Check www.dell.com/ support for a firmware update for your hard drive. Update the firmware if one is available. 2. Reseat the drive, reseat the data cable and power connection at both ends if it is desktop. 3.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps cable and reseat the power cable to the drive. Repeat the PSA diagnostics. If a replacement working hard drive is available, see if the working hard drive is detected by the system or try the suspect drive in a working system. 2. If you have an HDD, reconnect your hard disk drive (HDD) to the system board. 3. Update to the latest BIOS version. 4. Repeat the PSA diagnostics. 5.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps PSA 1000-0213 System board - CMOS battery failure detected An error occurred during the tests involving the CMOS battery (This maintains all the settings in the BIOS when there is no power to the system) On desktop systems this is a easily replaceable watch size battery, some portable systems may have a replaceable battery too. 1. Update to the latest BIOS version. 2. Repeat the PSA diagnostics. 3.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps Technical Support to resolve the problem. PSA 2000-0232 ePSA 2000-0232 (Not used with UEFI BIOS) PSA 2000-0233 ePSA 2000-0233 (Not used with UEFI BIOS) PSA 1000-0234 ePSA 2000-0234 (Not used with UEFI BIOS) PSA 1000-0235 ePSA NA PSA 1000-0241 System board - the RTC did not generate periodic ticks An error occurred during the 1. Update to the latest BIOS tests that may involve the main version.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description ePSA 2000-0242 (Not used with UEFI BIOS) ePSA- System board - Interrupt controller - IRQ (d) - %s not detected memory error is detected, try 2. Repeat the PSA diagnostics memory modules individually. If 3. If failure continues, contact no 2000-0123 memory error & If Dell Technical Support diagnostics fail again after the BIOS is current, contact Technical Support to resolve the problem.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps PSA NA Thermal - the (s) reading (dc) exceeds the thermal limit. The system board, heat sink, fan, or processor are failing the diagnostic tools. 1. Update to the latest BIOS version. 2. Check the logs, the fan and for any other signs of overheating. 3.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps and if the screens appear normal, click Yes PSA 1000-0326 ePSA 2000-0326 PSA NA ePSA 2000-0327 PSA NA ePSA 2000-0328 PSA NA ePSA 2000-0331 PSA NA ePSA 2000-0332 PSA 1000-0333 ePSA 2000-0333 LCD panel - unable to turn lamp on or off The backlight lamp was not able 1. Update to the latest BIOS. to be turned on or off during the 2.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps PSA 1000-0334 ePSA 2000-0334 Video - user reported the patterns were not displayed correctly You may get this error if you answered No to the color test instead of Yes. If you were able to clearly see both the vertical and horizontal color bars without distortion, lines, or color problems, re-run the diagnostic and if the bar appears normal, click Yes. 1.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps 5. If failure continues, contact Dell Technical Support PSA NA ePSA 2000-0415 PSA NA ePSA 2000-0511 PSA NA ePSA 2000-0512 PSA NA ePSA 2000-0620 PSA NA ePSA 2000-0621 PSA NA ePSA 2000-8001 Cables - Check the following cables, jumper, connection, or sensors: [s] Normally, the cable involved in 1. Update to the latest BIOS the error (LCD LVDS CABLE for version. example) is indicated in the error 2.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps PSA NA ePSA 2000-8002 BIOS - No BIOS support for SMI interface function(x) or Sensor [x] exceeded thermal zone [d]. Peak zone was [d]. The motherboard BIOS revision may not be current. Update the BIOS to the most current version and the issue should resolve. 1. Update to the latest BIOS version. 2. Repeat the PSA diagnostics. 3.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps PSA NA BIOS - Get/Set inverter mode function error. Vendor: [s] Revision: [d] The motherboard BIOS revision may not be current. Update the BIOS to the most current version and the issue should resolve. 1. Update to the latest BIOS version. 2. Repeat the PSA diagnostics. 3. If failure continues, contact Dell Technical Support BIOS - Set lamp off function error.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps 4. If failure continues, contact Dell Technical Support PSA NA ePSA 2000-8017 PSA NA ePSA 2000-8018 PSA NA ePSA 2000-8019 PSA NA ePSA 2000-8020 PSA NA ePSA 2000-8115 PSA NA ePSA 2000-8154 PSA NA ePSA 2000-8155 PSA NA ePSA 2000-8156 PSA NA ePSA 2000-8157 32 Running diagnostics BIOS - Battery - BIOS has no support for battery health This optional feature may not be 1.
Table 13. PSA/ePSA error codes (continued) Error number (PSA and ePSA) Error message Description Steps PSA NA Backplane - [DRIVE] Drive [d] incorrect status = [x], [s] The string indicates the backplane, expander, or removable hard drive is reporting an incorrect status. 1. Reseat the drives/cables/ connections. 2. Repeat the PSA diagnostics. 3.
Debugging mini crash dump files using by WinDbg in Windows operating system Prerequisites 1. Click Start > Control Panel > System. Figure 9. Opening the Systems page 2. In the System page, click Advanced system settings in the left pane.
Figure 10. Advanced system settings page 3. In the System Properties window, click Settings under the Startup and Recovery section. Figure 11. System Properties window 4. In the Startup and Recovery window, System failure section, do the following: a. Select Write an event to the system log to ensure that the minidump file is created in the event of a system failure.
b. Select Automatically restart to restart the system after a blue screen of death (BSOD) occurs. NOTE: For servers, it is recommended that you select the Automatically restart option so that the server can function if the error is not critical. c. Verify that the Overwrite any existing file option is not selected. This ensures that a record of failures is maintained if there are repeated occurrences of system failures. Figure 12. Startup and Recovery window 5.
b. MODULE_NAME c. IMAGE_NAME 11. Call Dell Technical Support for further assistance.
4 Troubleshooting hardware issues This section helps you troubleshoot hardware issues in your system. NOTE: If the issue still persists, contact Dell Technical Support for assistance.
9. 10. 11. 12. 13. To enter UEFI, Press F2. Verify that all installed drives are detected in controller BIOS, if not detected refer to the Troubleshooting Hard drive issues section. Ensure that in BIOS the RAID setting is set to RAID mode for SATA drives. Save the setting, and reboot the server. If the issue persists, contact Dell Technical Support for assistance.
8. If your keyboard is functioning, enter System Setup, verify that all USB ports are enabled on the Integrated Devices screen. If your keyboard is not functioning, use remote access to enable or disable the USB options. 9. If the system is not accessible, reset the NVRAM_CLR jumper inside your system and restore the BIOS to the default settings. See the System board jumper setting section 10. In the IDRAC Settings Utility, ensure that USB Management Port Mode is configured as Automatic or Standard OS Use.
Steps 1. Turn off the system and any peripheral devices that are connected to the serial port. 2. Swap the serial interface cable with a known working cable, and turn on the system and the serial device. If the problem is resolved, replace the interface cable with a known working cable. 3. Turn off the system and the serial device, and swap the serial device with a compatible device. 4. Turn on the system and the serial device. Next steps If the problem persists, see the Getting help section.
support team. Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions that are shipped with your product. Steps 1. Turn off the system and attached peripherals, and disconnect the system from the electrical outlet. 2. Remove the system cover. 3.
• • • processor(s) and heat sink(s) memory modules drive carriers or cage 4. Ensure that all cables are properly connected. 5. Install the system cover. 6. Run the appropriate diagnostic test. For more information, see the Using system diagnostics section. Next steps If the problem persists, see the Getting help section. Troubleshooting the system battery Prerequisites CAUTION: Many repairs may only be done by a certified service technician.
From F2 System Setup: 1. Select iDRAC Settings > Thermal, and set a higher fan speed from the fan speed offset or minimum fan speed. From RACADM commands: 1. Run the command racadm help system.thermalsettings For more information, see Integrated Dell Remote Access User’s Guide at www.dell.com/poweredgemanuals Troubleshooting cooling fans Prerequisites CAUTION: Many repairs may only be done by a certified service technician.
Troubleshooting a micro SD card Prerequisites NOTE: Certain micro SD cards have a physical write-protect power on the card. If the write-protect switch is turned on, the micro SD card is not writable. NOTE: IDSDM and vFlash slots are not hot-pluggable. Steps 1. Enter System Setup, and ensure that the Internal SD Card Port is enabled. 2. Turn off the system, including any attached peripherals, and disconnect the system from the electrical outlet. 3. Remove the system cover.
9. Remove all expansion cards installed in the system. 10. Install the system cover. 11. Run the appropriate diagnostic test. See the Using system diagnostics section. If the tests fail, see the Getting help section. 12. For each expansion card you removed in step 8, perform the following steps: a. b. c. d. e. Turn off the system and attached peripherals, and disconnect the system from the electrical outlet. Remove the system cover. Reinstall one of the expansion cards. Install the system cover.
Troubleshooting a storage controller CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple repairs as authorized in your product documentation, or as directed by the online or telephone service and support team. Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions that are shipped with your product.
NOTE: Before you import the foreign configuration, review the configuration on the screen to ensure that it is the end result that you require. You can use the Foreign Config screen to manage foreign configurations in the following cases: • • • • All the physical disks in a configuration are removed and re-inserted. Some of the physical disks in a configuration are removed and re-inserted. All the physical disks in a virtual disk are removed, but at different times, and then re-inserted.
Importing or clearing foreign configurations using the VD mgmt menu When a foreign configuration exists, the BIOS screen displays the message Foreign configuration(s) found on adapter. In addition, a foreign configuration is displayed on the right side of the Ctrl Mgmt screen. About this task You can use the VD Mgmt menu to import the existing configuration to the RAID controller or clear the existing configuration.
• SAS 6ir controllers support speeds of up to 3 Gbps. For more information, refer to the SAS 6ir product documentation. Hard drives cannot be added to the existing RAID 10 Array Create a new RAID 1 array or RAID 50 array, and ensure that the virtual disk has the maximum partition space. For information about how to configure a RAID array, see RAID configuration using OpenManage Server Administrator, RAID configuration using Unified Server Configurator, or Configuring RAID by using Lifecycle Controller.
4. Verify the Dell Update Package by using its signature file, [model]_BIOS_LX_[version].BIN.sign. There are two methods to update the PERC firmware. Below are the steps for the two methods: Method 1: Windows update package. 1. 2. 3. 4. 5. 6. 7. Download the BIOS update package at: Dell.com/support. When the File Download window appears, click Save to save the file to your hard drive. Browse to the location where you downloaded the file and double-click the new file.
Creating non-raid disks for storage purpose About this task By default, all the disks are in RAID capable unconfigured state. The user can convert the RAID capable disks to non-RAID disks using either the BIOS configuration utility or the UEFI/HII RAID configuration utility. To create a non-RAID disk, perform the following steps in the BIOS Configuration Utility ( ): Steps 1. On the Virtual Disk Mgmnt screen, use the arrow keys to highlight the PERC 9 adapter or Disk Group #. 2. Press .
Managing preserved cache About this task If a virtual disk goes offline or is deleted because of missing physical disks, the controller preserves the dirty cache from the virtual disk. The preserved dirty cache, known as pinned cache, is preserved until you import the virtual disk or discard the cache. NOTE: Certain operations, such as creating a new virtual disk, cannot be performed if preserved cache exists.
Steps 1. Shut down the system. 2. Disconnect all the power cables. 3. Press and hold power button for 15 seconds. 4. Reconnect all the cables and turn the system on. Results Check the hardware details to ensure that the controller is working correctly. Troubleshooting hard drives Prerequisites CAUTION: This troubleshooting procedure can erase data stored on the hard drive. Before you proceed, back up all files on the hard drive. CAUTION: Many repairs may only be done by a certified service technician.
2. Reseat the cable at both the ends. 3. Reseat the controller card. 4. Reseat the Drives and ensure that all the drives are present in the system. 5. Turn on the system and enter the CTRL+R utility. or and either import or clear the foreign configuration. 6. Press <"F"> at the prompt to import the foreign configuration. 7. press <"C"> to enter the BIOS configuration utility.
Figure 14. PERC Configuration Utility PD Mgmt screen The table shows Hard drive status Table 14. Hard drive status Hard drive status Description Offline Hard drive is not part of the RAID array. Online Hard drive is part of the RAID array. Ready Hard dive is ready to be a part of the RAID array. FAQs How to identify a hard drive failure? Hard drive failures may occur because of logical, head, or mechanical failures. The following table describes the symptoms of failing hard drives: Table 15.
4. For systems with LCD panel, check for the following error codes: Table 16. Hard drive error codes Error Code Error Message Description E1810 Hard drive fault. Hard drive has had a fault as determined by the SAS subsystem. E1811 Hard drive rebuild aborted. Drive has had its rebuild aborted. E1812 Hard drive removed. Drive has been removed from the system. 5. Check the hard drive status in SupportAssist. 6.
Drive timeout error Issue—Drive times out and the RAID controller displays the drive as failed. Corrective action—Update the hard drive firmware/PERC controller. For information about driver installation, see the driver installation section. For information about firmware installation, see the Firmware section. Drives not accessible Multiple physical disk errors in a single array typically indicate a failure in cabling or connection and could involve the loss of data.
Troubleshooting a tape backup unit Prerequisites CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple repairs as authorized in your product documentation, or as directed by the online or telephone service and support team. Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions that are shipped with your product. Steps 1. Use a different tape cartridge. 2.
Troubleshooting power source problems Steps 1. Press the power button to ensure that your system is turned on. If the power indicator does not glow when the power button is pressed, press the power button firmly. 2. Plug in another working power supply unit to ensure that the system board is not faulty. 3. Ensure that no loose connections exist. For example, loose power cables. 4. Ensure that the power source meets applicable standards. 5. Ensure that there are no short circuits. 6.
NOTE: BOSS-S1 controller is supported at RAID 1 level only. 5. Select the controller which you want to use and click Next. The Select RAID Level page is displayed. 6. Select the RAID level and click Next. The Select Physical Disks page is displayed. 7. Select the physical disk and click Next. The Virtual Disk Attributes page is displayed. 8. Select the virtual disk parameters and click Next. The Summary page is displayed. 9. To apply the RAID configuration, click Finish.
Table 17. Estimated rebuild rates RAID level Number of Hard Drives 7.
2. On the VD Mgmt screen, highlight the Controller #. 3. Press F2 to display the available actions. 4. Navigate to the Foreign Config option and press the right arrow key to display the available actions: • • Import Clear NOTE: Ensure that your virtual disk has all the hard drives by verifying that there are no hard drives marked as Missing in the foreign view page and that all the disks are displayed as expected before importing them. 5.
Steps 1. On the upper-left corner of the Server Administrator page, expand Storage . 2. Click PERC Controller. 3. Click Virtual Disks. The Virtual Disk(s) on Controller page is displayed. 4. Click Go to the Create Virtual Disk Wizard. The Create Virtual Disk Wizard page is displayed. 5. Select the Express Wizard option and the RAID level from the drop-down menu. 6. Click Continue.
5. Select the Advanced Wizard option. 6. To make sure that only encrypted physical disks are used to create the virtual disk, select Yes from the Create Encrypted Virtual Disk drop-down list. The RAID levels are available for selection based on the number of encrypted physical disks. If you select No, the RAID levels are available based on the total number of physical disks present on the system. 7. Select the required RAID level from the drop-down menu. 8. Select Bus Protocol.
• Select the number of disks to create a single spanned virtual disk list box — Enables you to create a single span virtual disk with 22 or 26 physical drives for PERC controllers. This list box option is displayed only if you have selected RAID 10 in step 1 and the system has 22 or more physical drives. NOTE: Only physical disks that comply with the virtual disk parameters, selected in the Create Virtual Disk Wizard page are displayed. 11.
For PERC H700 and PERC H800 controllers, if any of the drives you selected is in the spun down state, the following message is displayed: The below listed physical drive(s) are in the spun down state. Executing this task on these drive(s) takes additional time, because the drive(s) need to spun up. The message displays the ID(s) of the spun down drive(s). The Create Virtual Disk Advanced Wizard - page displays a checkbox next to each physical disk that is suitable as a dedicated hot spare.
Figure 15. Flowchart of Unified Server Configurator’s RAID configuration process 5. The Express option selects the appropriate disks depending upon the RAID type selected for virtual disk creation. The Summary screen is displayed. You can review the choices selected during the Express wizard. 6. Click Finish to create the virtual disk to be used for operating system installation. 7. The Advanced option takes you to a series of more screens. Select the RAID type on the Basic Settings screen.
Downloading and installing the RAID controller log export by using PERCCLI tool on ESXi hosts on Dell’s 13th generation of PowerEdge servers To export information about the status of the RAID controller and its attached hard drives, you can use the PERCCLI tool. To download and install the RAID controller log export by using PERCCLI tool on ESXi hosts on Dell’s 13th generation of PowerEdge servers, perform the following steps: Steps 1. Download the latest verison of PERCCLI for ESX tool from www.dell.
Figure 18. Configuration tab a. In the Services properties window, select SSH (1), and then click Options... (2) Figure 19. Open SSH Options b. In the SSH Options window, click Start (1), and then click OK (2) to activate the service.
Figure 20. Start SSH Service 4. To unzip vmware-esx-perccli, open an SSH connection via PUTTY and run the command: unzip /vmfs/volumes/datastore1/ vmware-esx-perccli-1.05.08.zip PUTTY is a free and open-source terminal emulator, serial console and network file transfer application. It supports several network protocols, including SCP, SSH, Telnet, rlogin, and raw socket connection. You can download it from Google. The files vmware-esxperccli-1.05.08.vib and Readme.
Figure 22. Log creation 8. Copy MegaSAS.log to datastore by using command:cp /opt/lsi/perccli/MegaSAS.log /vmfs/volumes/datastore1/ 9. Copy the file to desktop with Datastore Browser. Figure 23. Log file in Datastore Browser Now the logs are exported on ESXi hosts on the Dell 13th generation PowerEdge servers.
3. Select the RAID controller to view its current virtual disk configuration and disk attributes. Click Next. 4. Select the RAID level for the virtual disk that you want to create and click Next. 5. On the Select Physical Disks screen, the default values for Protocol, Media Type, and Encryption capability are displayed. 6. Select the required physical disks that you want to include in the virtual disk, and then click Next. 7. On the Virtual Disk Attributes screen, type the virtual disk name.
Table 18. Possible scenarios for reconfiguring a virtual disk (continued) Controller Starting RAID Level Target RAID Level Adapter, PERC FD33xD/ FD33xS Comments RAID 6 requires a minimum of 4 disks.
• • Both drives should be of same type. Both drives should run at same speed. Reconfiguring or migrating virtual disks About this task Reconfiguring or migrating a virtual disk (VDs) enables you to increase the capacity or change the RAID level of the virtual disk.
Table 19.
Foreign Configuration properties The following table describes the properties that are displayed on the PERC BIOS Configuration Utility Foreign Configuration screen for the Foreign Disks and Global Hot Spares. Table 20. Memory channels Property Definition Status These icons represent the severity or health of the storage component. • —Normal/OK • —Warning/Non-critical • —Critical/Failure/Error Name Displays the name of the foreign configuration and is available as a link.
Table 20. Memory channels (continued) Property Definition Dedicated Hot Spare Displays whether the foreign disk is a dedicated hot spare. Based on the properties information, you can decide whether you want to import, recover, or clear the foreign configuration. Viewing Patrol Read report The patrol read report provides information on all the patrol reads performed on the controller in the chronological order. It provides information such as last run time and result.
• Disabled — Prevents the Patrol Read task from running on the system. Check Consistency report The check consistency report provides information on all the consistency checks performed on the controller in a chronological order. It provides information such as last run time and result. If the consistency check fails, it provides the reason for the failure. Performing a Check Consistency The Check Consistency task verifies the accuracy of the redundant (parity) information.
• • • • The hot spare is unassigned from the virtual disk — This occurs on some controllers if the hot spare is assigned to more than one virtual disk and is being used to rebuild a failed physical disk for another virtual disk. The virtual disk includes failed or corrupt physical disks — This situation may generate alert 2083. For information on alert messages, see the Server Administrator Messages Reference Guide at Dell.com/support/home.
Steps 1. Back up your data. 2. Delete the virtual disk. 3. Create one or more virtual disks that are smaller than 1TB. 4. Restore your data from backup. Irrespective of whether your Linux operating system limits the virtual disk size to 1TB, the virtual disk size depends on the version of the operating system and any updates or modifications that you have implemented. For more information on operating system, see your operating system documentation.
CMC firmware: http://www.dell.com/support/home/drivers/DriversDetails?productCode=poweredge-vrtx&driverId=6W6P1 Chassis infrastructure firmware: http://www.dell.com/support/home/drivers/DriversDetails?productCode=poweredge-vrtx&driverId=CPMVM SPERC firmware: http://www.dell.com/support/home/drivers/DriversDetails?productCode=poweredge-vrtx&driverId=THVJ9 SPERC driver: http://www.dell.
Troubleshooting conditions that lead to error message NOTE: Troubleshooting the associated events may also prevent the error message from occurring. Error message can occur normally when one of the following conditions occur. • OS indicates abnormal shutdown. • OS indicates error occurred (blue screen occurred in Windows). • Spontaneous power loss condition.
A PERC battery that is suspected to have failed or has a warning symbol displayed in OpenManage Server Administrator should have a manual Learn Cycle performed. A Learn Cycle causes the battery to discharge and recharge, and restores the battery to a fully functional condition. In some cases, multiple Learn Cycle procedures may be required to restore the battery to an effectively charged state.
This advantage of puncturing an array is keeping the system available in production till the redundancy of the array is restored. The data in the affected stripe is lost whether the RAID puncture occurs or not. The primary disadvantage of this method is that while the array has a RAID puncture in it, uncorrectable errors will continue to be encountered whenever the impacted data (if any) is accessed. A RAID puncture can occur in the following three locations: • • • In blank space that contains no data.
1. 2. 3. 4. Discard Preserved Cache, if it exists. Clear foreign configurations, if any. Delete the array. Shift the position of the drives by one. Move Disk 0 to slot 1, Disk 1 to slot 2, and Disk 2 to slot 0. 5. Recreate the array as desired. 6. Perform a Full Initialization of the array (not a Fast Initialization). 7. Perform a Check Consistency on the array. If the Check Consistency completes without errors, you can safely assume that the array is now healthy and the puncture is removed.
shared storage 14G servers as cluster nodes with external Storage enclosures attached for shared storage Prerequisites CAUTION: 1. Take the back up of existing registry , refer https://support.microsoft.com/en-in/help/322756/how-to-back-upand-restore-the-registry-in-windows MS link on how to take the registry back-up. 2. Failing to Enable these settings on Dell Servers which are configured for Failover Cluster with Shared Storage may lead to Cluster Shared Volume to go to failed state.
5 Server management software issues This section helps to manage software issues related to the server management.
○ Perpetual - This license is valid for the life of the product. It does not expire and never needs to be renewed. It must be bound to only one service tag at a time. For more information on the iDRAC licensing feature, see En.community.dell.com/techcenter/extras/m/white_papers/20067892 How to activate license on iDRAC You can manage your licenses by creating your account and access License Management portal.
• • To download license, navigate to the license and click Get Key. Deliver My License Key window is displayed. To download the license directly to your computer, select Download and then click Submit. Select Email if you want the license key on an email. For more information on the iDRAC licensing feature, see En.community.dell.com/techcenter/extras/m/white_papers/20067892.
How to set up Auto Dedicated NIC feature The Auto Dedicated NIC feature provides the option to automatically reroute the iDRAC management traffic for the scenarios such as connecting a crash cart or reconfiguring network cables. When this feature is enabled, iDRAC automatically and dynamically detects a system's network mode. It senses the system's network cable configuration and checks if a cable is connected to the system's dedicated NIC port.
• NOTE: While configuring DHCP server with IPv6, the configuration fails if you disable forwarding or advertising options. Static IP—indicates that the NIC must be configured using a static IP. Type the IP Address Properties—IP Address, Subnet Mask, Default Gateway, and DNS Address. If you do not have this information, contact your network administrator. 7. Click Enabled, and type the VLAN ID and Priority under Lifecycle Controller VLAN Settings.
Storage Health The Storage Dashboard displays the combined status for each controller and lower-level storage components. For example, if the health of the storage system has been compromised due to a degraded enclosure, both the enclosure Health and the controller severity on the Storage Dashboard display a yellow exclamation mark to indicate a Warning severity.
The Import Foreign Configuration task is only displayed when the controller has detected a foreign configuration. You can also identify whether a physical disk contains a foreign configuration (virtual disk or hot spare) by checking the physical disk state. If the physical disk state is Foreign, then the physical disk contains all or some portion of a virtual disk or has a hot-spare assignment.
• • • Using the Lifecycle Controller Platform Update option—F10. Using the Update and Rollback feature in the iDRAC web GUI. Using the WS-MAN based one to many Remote Update method—Remote Enablement NOTE: Legacy DOS-based BIOS update utility is no longer supported. For detailed information about different methods of updating BIOS see En.community.dell.
JAVA support in iDRAC About this task For accessing iDRAC and some of its features, you need to install and configure the supported version of Java. The following are some of the key considerations: • • • • • • Oracle version of Java is supported. Java version 8 or later is required. If you are using Firefox or Internet Explorer, and want to use the Java viewer to access iDRAC, configure the browser to use Java plug-in. NOTE: On a 64-bit operating system, both 32-bit and 64-bit JRE versions are supported.
Installing Managed System Software On Microsoft Windows Operating Systems On Microsoft Windows, an autorun utility is displayed when you insert the Dell EMC OpenManage Systems Management Tools and Documentation software. This utility allows you to choose the systems management software you want to install on the system. If the autorun program does not start automatically, use the autorun program from the DVD root or the setup program in the SYSMGMT \srvadmin\windows directory on the Dell EMC OpenManage Syst
PowerEdge T130, R230, R330, and T330 servers may report a critical error during scheduled warm reboots PowerEdge T130, R230, R330, and T330 servers may report a critical error during scheduled warm reboots and displays an error messages in the Hardware System event logs, the Lifecycle Controller logs. Dell EMC recommends that you download and install the latest BIOS, drivers, and systems management firmware on your system. For more information, see the Downloading the drivers and firmware topic.
It is recommended to use WMI protocol for discovery and inventory. The difference of inventory information fetched using WMI versus SNMP protocol is specified. Fetching of hardware logs is possible only using WMI protocol. • For discovery and inventory through SNMP protocol, set the community strings in SNMP Configuration page. • • • • To disable SNMP discovery uncheck the Enable SNMP discovery For discovery and inventory through WMI protocol, click Next, otherwise click Finish.
6 Troubleshooting operating system issues This section helps you troubleshoot operating system issues in your system. NOTE: If the problem persists, contact Dell Technical Support for further assistance.
Figure 25. Blue screen of death 2. Run the PSA/ePSA diagnostics. For more information, see PSA/ePSA Diagnostics on page 18. 3. If the diagnostics pass and the issue persists, identify the stage in which the blue screen error occurs. 4. If the BSOD occurs during the boot process, check for minimum to POST components. For more information, see Troubleshooting a No POST situation on page 103. If the issue persists, call Dell Technical Support. 5.
• • Use DiskPart to verify the status of disk partitions. For more information. see https://technet.microsoft.com/en-in/library/ bb490893.aspx. Use the bcdedit utility to view or modify the boot configuration database (BCD). For more information, see https:// technet.microsoft.com/en-us/library/cc731662.aspx. NOTE: For additional recovery console commands, see https://support.microsoft.com/en-us/kb/326215. NOTE: For more troubleshooting steps, see https://support.microsoft.com/en-us/kb/325375. 4.
No POST issues in iDRAC This section provides details on troubleshooting iDRAC issues. “First Boot Device cannot be set” error message is displayed when configuring a boot device during POST. Description The error message “First Boot Device cannot be set. Either the system BIOS is out-of-date, or the server needs a reboot for the settings to take effect” displays in the POST mode.
Steps 1. Check the LCD screen or LED indicators for any error messages. For information about the event and error messages generated by the system firmware and agents that monitor system components, see the Error Code Lookup page at qrl.dell.com. 2. Ensure that the server is turned on by verifying that the power supply LED glows green. If the power LED is lit amber, see Power supply unit indicator codes on page 13. 3. Remove all the Electrostatic Discharge (ESD) from the server. a. b. c. d. e.
NOTE: Before connecting the OneDrive site of another user, make sure the OneDrive has been provisioned (i.e. the OneDrive site owner has visited it at least once) and you have administrator permissions granted either by the OneDrive site owner or using the Set-SPOUser commandlet (http://technet.microsoft.com/en-us/library/ fp161375(v=office.15).aspx).
16. On the Which Type of Installation Do You Want screen, select Custom: Install Windows only (advanced), if it is not selected already. 17. On the Where do you want to install Windows screen, specify the partition on which you want to install the operating system. To create a partition and begin installation: a. Click New. b. Specify the size of the partition in MB, and click Apply. The following message is displayed: Windows might create additional partition for system files c. Click OK.
7. On the Select the operating system you want to install screen, select the operating system from the available list, and click Next. The license terms window is displayed. 8. Read through the license agreement information. If you agree with all the information, select I accept the license terms, and then click Next. 9. On the Which type of installation do you want screen, select Custom: Install Windows only (advanced) if it is not selected already. 10.
NOTE: By default, the USB 3.0 option is disabled. If enabled, the operating system fails to detect the USB devices such as keyboard, mouse, and USB DVD. Windows Server 2008 R2 SP1 supports out-of-box drivers for USB 3.0, and are available at www.dell.com/support. 2. Install the drivers after installing the OS 3. Restart the system. 4. In System Setup, ensure that the USB 3.0 option on the Integrated Device Settings screen is set to Enable.
Resolution This is a known issue. This issue has been fixed in operating systems pre-installed by Dell and in the recovery media shipped with your system. For more information, see the knowledge base article KB2894179 at support.microsoft.com. Troubleshooting system crash at cng.sys with watchdog Error violation Issue: System encountered Blue Screen of Death at cng.sys with "Watchdog Error violation" error. Cng.
Partitions on disk selected for installation of Hyper-V server 2012 Error occurred during installation of Hyper-V server 2012 as the partitions on disk selected for installation are not in recommended order. The recommended configuration order includes a Windows RE Tools partition, a system partition, an Microsoft® Reserved partition (MSR), a Windows partition, and a recovery image partition. The Microsoft Windows RE Tools partition and system partition before you add the Windows partition.
VMware FAQs Why are VMs configured with Fault Tolerance not in a protected state in ESXi 6.0? For some PowerEdge systems with AMD 6300 series processor, VMs configured with Fault Tolerance (FT) might not be in a protected state. Sometimes, secondary VM takes more time to attain the protected state. This is a known issue. Affected systems include PowerEdge systems R815, R715 and M915.
Backing up the configuration of your ESXi host About this task To back up configuration data of a host: Steps 1. Start the vSphere CLI. 2. Run the vicfg-cfgbackup command with the -s flag to save the host configuration to the specified backup filename: vicfg-cfgbackup --server< ESXi-host-ip> -- portnumber --protocol --username root --password root_password [-s The -portnumber and -protocol options are optional.
To fix the issue, ioSphere needs to be reinstalled. The link below contains information about installing, updating and managing Fusion-IO, which contains information about installing ioSphere: http://www.dell.com/support/article/au/en/audhs1/sln156793/how-to-install-update-and-manage-fusion-io-drives-in-windows-os-ondell-poweredge-servers?lang=en Symptoms Dell PowerEdge Express Flash NVMe PCIe SSD device is not detected during hot-plug in ESXi 6.
Table 22. Installation of OS through LC and various methods (continued) Slno Video Description 3. Dell Lifecycle Controller - Firmware Update Using FTP LC Firmware Update Using FTP Server Server 4. Dell Lifecycle Controller - Firmware Update Using Network Share: CIFS (Common Internet File System) LC - Firmware Update Using CIFS 5. Dell Lifecycle Controller - Firmware Update Using Network Share: NFS (Network File System) LC - Firmware Update Using NFS 6.
7 Getting help Topics: • • • Contacting Dell EMC Download the drivers and firmware Locating Service Tag of your system Contacting Dell EMC Dell EMC provides several online and telephone based support and service options. If you do not have an active internet connection, you can find contact information about your purchase invoice, packing slip, bill, or Dell EMC product catalog. Availability varies by country and product, and some services may not be available in your area.
Locating Service Tag of your system Your system is identified by a unique Express Service Code and Service Tag number. The Express Service Code and Service Tag are found on the front of the system by pulling out the information tag. Alternatively, the information may be on a sticker on the chassis of the system. The mini Enterprise Service Tag (EST) is found on the back of the system. This information is used by Dell to route support calls to the appropriate personnel. Figure 26.