Product and Performance Information

111X higher batch AI inference performance with Intel-optimized Tensor Flow vs. stock Cascade Lake FP32 configuration New: 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X261, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, ResNet-50 v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=128 FP32,INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, Unoptimized model : TensorFlow- 2.4.1, Modelzoo:https://github.com/IntelAI/models -b master, test by Intel on 3/12/2021. Baseline: 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-48-generic, 1x Samsung_​SSD_​860, Intel SSDPE2KX040T8, ResNet-50 v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=128 FP32,INT8, Optimized model : TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, Unoptimized model : TensorFlow- 2.4.1, Modelzoo:https://github.com/IntelAI/models -b master, test by Intel on 2/17/2021.
21.53x higher HPC performance (geomean HPL, HPCG, STREAM Triad, WRF, Binomial Options, Black Scholes, Monte Carlo, OpenFOAM, GROMACS, LAMMPS, NAMD, RELION).
31.70x higher Monte Carlo performance App Version: v1.1; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4; threads/core: 1; Turbo: used; Build knobs: -O3 -xCORE-AVX512 -qopt-zmm-usage=high -fimf-precision=low -fimf-domain-exclusion=31 -no-prec-div -no-prec-sqrt.
4As of July 21, 2021, Intel offers up to 12 DDR4 memory channels for Intel® Xeon® processors, as compared to 8 for AMD EPYC.
51.60x higher Life and Material Science performance (geomean GROMACS, LAMMPS, NAMD, RELION).
61.61x higher RELION Plasmodium Ribosome performance App Version: 3_​1_​1; Build notes: Tools: Intel C Compiler 2020u4, Intel MPI 2019u9; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -g -debug inline-debug-info -xCOMMON-AVX512 -qopt-report=5 –restrict.
71.64x higher GROMACS performance (geomean ion_​channel_​pme, lignocellulose_​rf, water_​pme, water_​rf) App Version: v2020.5_​SP; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512.
81.60x higher LAMMPS performance (geomean Polyethylene, Stillinger-Weber, Tersoff, Water) App Version: v2020-10-29; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4, Intel MPI 2019u8; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512 -qopt-zmm-usage=high.
91.57x higher NAMD performance (geomean Apoa1, f1atpase, STMV) App Version: 2.15-Alpha1 (includes AVX tiles algorithm); Build notes: Tools: Intel MKL, Intel C Compiler 2020u4, Intel MPI 2019u8, Intel Threading Building Blocks 2020u4; threads/core: 2; Turbo: used; Build knobs: -ip -fp-model fast=2 -no-prec-div -qoverride-limits -qopenmp-simd -O3 -xCORE-AVX512 -qopt-zmm-usage=high.
101.42x higher performance on OpenFOAM Motorbike 42M. New: 6354: 1-node, 2x Intel Xeon Gold 6354 (18C/3.0GHz, 205W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3, 4.18.0-240.10.1.el8_​3.x86_​64, 1x Intel_​SSDSC2KG96 .Tested by Intel between March 12, 2021 and March 29, 2021. Baseline: 6154: 1-node, 2x Intel Xeon Gold 6154 (18C/3.0GHz, 200W TDP) processor on Intel Software Development Platform with 192GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x2006a0a, HT on, Turbo on, CentOS Linux 8.3, 4.18.0-240.10.1.el8_​3.x86_​64, 1x Intel_​SSDSC2KG96 . Tested by Intel between February 1, 2021 to February 20, 2021.