Technical White Paper Dell EMC Ready Solutions for HPC Digital Manufacturing with AMD EPYC™ Processors— ANSYS® Performance Abstract This Dell EMC technical white paper discusses performance benchmarking results and analysis for ANSYS® CFX® and Fluent® on the Dell EMC Ready Solutions for HPC Digital Manufacturing with AMD EPYC™ processors.
Revisions Revisions Date Description November 2019 Initial release with AMD EPYC™ 7002 series processors Acknowledgements This paper was produced by the following: Authors: Joshua Weage Martin Feyereisen The information in this publication is provided “as is.” Dell Inc. makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose.
Table of contents Table of contents Revisions.............................................................................................................................................................................2 Acknowledgements .............................................................................................................................................................2 Table of contents .................................................................................................
Introduction 1 Introduction This technical white paper discusses the performance of ANSYS® CFX® and Fluent® on the Dell EMC Ready Solutions for HPC Digital Manufacturing with AMD EPYC™ processors. This Ready Solution was designed and configured specifically for Digital Manufacturing workloads, where computer aided engineering (CAE) applications are critical for virtual product development.
System Building Blocks 2 System Building Blocks The Dell EMC Ready Solutions for HPC Digital Manufacturing is designed using preconfigured building blocks. The building block architecture allows an HPC system to be optimally designed for specific end-user requirements, while still making use of standardized, domain-specific system recommendations. The available building blocks are infrastructure servers, storage, networking, and compute building blocks.
System Building Blocks A recommended base configuration for infrastructure servers is: • • • • • • • • Dell EMC PowerEdge R6515 server AMD EPYC 7302P processor 128GB of RAM (8 x 16GB 3200 MTps DIMMs) PERC H345 RAID controller 2 x 480GB Mixed Use SATA SSD RAID 1 Dell EMC iDRAC9 Enterprise 2 x 550W Power Supplies Mellanox ConnectX-6 InfiniBandTM HCA (optional) The recommended base configuration for the infrastructure server is described as follows. The PowerEdge R6515 server is suited for this role.
System Building Blocks Table 1 Recommended Configurations for the Compute Building Block Platforms Dell EMC PowerEdge R6525 Dell EMC PowerEdge C6525 Processor Options Dual AMD EPYC 7402 (24 cores per socket) Dual AMD EPYC 7452 (32 cores per socket) Dual AMD EPYC 7502 (32 cores per socket) Dual AMD EPYC 7552 (48 cores per socket) Dual AMD EPYC 7702 (64 cores per socket) Memory Options Local Storage iDRAC Power Supplies Networking 2.
System Building Blocks Additionally, two BBB’s can be directly coupled together via a high-speed network cable, such as InfiniBand or Ethernet, without need of an additional high-speed switch if additional compute capability is required for each simulation run (BBB Couplet).
System Building Blocks Operational storage is typically sized based on the number of expected users. For fewer than 30 users, a single storage server, such as the Dell PowerEdge R7515 is often an appropriate choice. A suitably equipped storage server may be: • • • • • • • • • • Dell EMC PowerEdge R7515 server AMD EPYC 7302P processor 128GB of RAM (8 x 16GB 3200 MTps DIMMs) PERC H745 RAID controller 2 x 240GB Mixed Use SATA SSD in RAID-1 (For OS) 12 x 12TB 3.
System Building Blocks For customers desiring a shared high-performance parallel filesystem, the Dell MC Ready Solutions for HPC Lustre Storage solution shown in Figure 3 is appropriate. This solution can scale up to multiple petabytes of storage. Figure 3 Dell EMC Ready Solutions for Lustre Storage Reference Architecture 2.5 System Networks Most HPC systems are configured with two networks—an administration network and a high-speed/lowlatency switched fabric.
System Building Blocks 2.7 Services and Support The Dell EMC Ready Solutions for HPC Digital Manufacturing is available with full hardware support and deployment services, including additional HPC system support options.
Reference System 3 Reference System Performance benchmarking was performed in the Dell EMC HPC and AI Innovation Lab using system configurations as listed in Table 2.
Reference System The software versions used for the benchmarks are listed in Table 4. Table 4 Software Versions 13 Component Version Operating System RedHat Enterprise Linux 7.6 Kernel 3.10.0-957.27.2.el7.x86_64 OFED Mellanox 4.6-1.0.1.1 Bright Cluster Manager 8.
ANSYS CFX Performance 4 ANSYS CFX Performance ANSYS CFX software is a computational fluid dynamics (CFD) application recognized for its accuracy, robustness and speed with rotating machinery applications. CFD applications typically scale well across multiple processor cores and servers, have modest memory capacity requirements, and typically perform minimal disk I/O while in the solver section. However, some simulations may have greater I/O demands, such as transient analysis.
ANSYS CFX Performance Performance Relative to One Node ANSYS CFX Parallel Scaling—AMD EPYC 7452 9.0 8.0 7.0 6.0 5.0 4.0 3.0 2.0 1.0 1 2 4 6 8 Number of Nodes Airfoil_10M Airfoil_50M Airfoil_100M LeMans Pump Figure 5 ANSYS CFX Parallel Scaling—AMD EPYC 7452 Figure 5 presents the parallel scalability when running CFX with up to eight CBB nodes configured with AMD EPYC 7452 processors and ConnectX-6 HDR100 InfiniBand. All processor cores in each server were used when running these benchmarks.
ANSYS Fluent Performance 5 ANSYS Fluent Performance ANSYS Fluent is a computational fluid dynamics (CFD) application commonly used across a very wide range of CFD and multi-physics applications. CFD applications typically scale well across multiple processor cores and servers, have modest memory capacity requirements and typically perform minimal disk I/O while in the solver section. However, some simulations may have greater I/O demands, such as transient analysis.
ANSYS Fluent Performance ANSYS Fluent Parallel Scaling—AMD EPYC 7452 Performance Relative to One Node 10.0 9.0 8.0 7.0 6.0 5.0 4.0 3.0 2.0 1.
Conclusion 6 Conclusion This technical white paper presents the Dell EMC Ready Solutions for HPC Digital Manufacturing with AMD EPYC 7002 Series processors. The detailed analysis of the building block configurations demonstrate that the system is architected for a specific purpose—to provide a comprehensive HPC solution for the manufacturing domain. Use of this building block approach allows customers to easily deploy an HPC system optimized for their specific workload requirements.