CPU 2019 node[edit | edit source]

This was ran on a cpu2019 node on ARC at the University of Calgary.


Results Overview[edit source]

Hardware Dell PowerEdge C6420
CPU Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz
Memory 192GB, 12x 16GB DDR4 2933 MT/s
Disk Dell RAID TBD
Operating System CentOS 8.2
Score 1412.3 / 12440.3

Raw Output

========================================================================
   BYTE UNIX Benchmarks (Version 5.1.3)

   System: fc106: GNU/Linux
   OS: GNU/Linux -- 4.18.0-193.28.1.el8_2.x86_64 -- #1 SMP Thu Oct 22 00:20:22 UTC 2020
   Machine: x86_64 (x86_64)
   Language: en_US.utf8 (charmap="UTF-8", collate="UTF-8")
   CPU 0: Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz (5000.0 bogomips)
          Hyper-Threading, x86-64, MMX, Physical Address Ext, SYSENTER/SYSEXIT, SYSCALL/SYSRET, Intel virtualization
...
   CPU 39: Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz (5005.0 bogomips)
          Hyper-Threading, x86-64, MMX, Physical Address Ext, SYSENTER/SYSEXIT, SYSCALL/SYSRET, Intel virtualization
   10:19:36 up 69 days, 23:54,  1 user,  load average: 0.29, 0.10, 0.02; runlevel 2020-12-10

------------------------------------------------------------------------
Benchmark Run: Thu Feb 18 2021 10:19:36 - 10:47:28
40 CPUs in system; running 1 parallel copy of tests

Dhrystone 2 using register variables       41051800.4 lps   (10.0 s, 7 samples)
Double-Precision Whetstone                     7034.8 MWIPS (8.9 s, 7 samples)
Execl Throughput                               6244.5 lps   (30.0 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks       1070822.2 KBps  (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks          286157.2 KBps  (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks       3108472.4 KBps  (30.0 s, 2 samples)
Pipe Throughput                             1312508.3 lps   (10.0 s, 7 samples)
Pipe-based Context Switching                 174850.3 lps   (10.0 s, 7 samples)
Process Creation                              10661.3 lps   (30.0 s, 2 samples)
Shell Scripts (1 concurrent)                   2591.9 lpm   (60.0 s, 2 samples)
Shell Scripts (8 concurrent)                   1661.9 lpm   (60.0 s, 2 samples)
System Call Overhead                         872824.5 lps   (10.0 s, 7 samples)

System Benchmarks Index Values               BASELINE       RESULT    INDEX
Dhrystone 2 using register variables         116700.0   41051800.4   3517.7
Double-Precision Whetstone                       55.0       7034.8   1279.1
Execl Throughput                                 43.0       6244.5   1452.2
File Copy 1024 bufsize 2000 maxblocks          3960.0    1070822.2   2704.1
File Copy 256 bufsize 500 maxblocks            1655.0     286157.2   1729.0
File Copy 4096 bufsize 8000 maxblocks          5800.0    3108472.4   5359.4
Pipe Throughput                               12440.0    1312508.3   1055.1
Pipe-based Context Switching                   4000.0     174850.3    437.1
Process Creation                                126.0      10661.3    846.1
Shell Scripts (1 concurrent)                     42.4       2591.9    611.3
Shell Scripts (8 concurrent)                      6.0       1661.9   2769.8
System Call Overhead                          15000.0     872824.5    581.9
                                                                   ========
System Benchmarks Index Score                                        1412.3

------------------------------------------------------------------------
Benchmark Run: Thu Feb 18 2021 10:47:28 - 11:15:27
40 CPUs in system; running 40 parallel copies of tests

Dhrystone 2 using register variables     1560927811.9 lps   (10.0 s, 7 samples)
Double-Precision Whetstone                   276304.5 MWIPS (9.1 s, 7 samples)
Execl Throughput                              49798.6 lps   (30.0 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks        890032.4 KBps  (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks          236135.0 KBps  (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks       2926246.7 KBps  (30.0 s, 2 samples)
Pipe Throughput                            47538358.0 lps   (10.0 s, 7 samples)
Pipe-based Context Switching                6376890.8 lps   (10.0 s, 7 samples)
Process Creation                             100214.8 lps   (30.0 s, 2 samples)
Shell Scripts (1 concurrent)                 111953.1 lpm   (60.0 s, 2 samples)
Shell Scripts (8 concurrent)                  18881.2 lpm   (60.1 s, 2 samples)
System Call Overhead                        4065511.7 lps   (10.0 s, 7 samples)

System Benchmarks Index Values               BASELINE       RESULT    INDEX
Dhrystone 2 using register variables         116700.0 1560927811.9 133755.6
Double-Precision Whetstone                       55.0     276304.5  50237.2
Execl Throughput                                 43.0      49798.6  11581.1
File Copy 1024 bufsize 2000 maxblocks          3960.0     890032.4   2247.6
File Copy 256 bufsize 500 maxblocks            1655.0     236135.0   1426.8
File Copy 4096 bufsize 8000 maxblocks          5800.0    2926246.7   5045.3
Pipe Throughput                               12440.0   47538358.0  38214.1
Pipe-based Context Switching                   4000.0    6376890.8  15942.2
Process Creation                                126.0     100214.8   7953.6
Shell Scripts (1 concurrent)                     42.4     111953.1  26404.0
Shell Scripts (8 concurrent)                      6.0      18881.2  31468.7
System Call Overhead                          15000.0    4065511.7   2710.3
                                                                   ========
System Benchmarks Index Score                                       12440.3

CPU information as reported by lscpu:

# lscpu
Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              40
On-line CPU(s) list: 0-39
Thread(s) per core:  1
Core(s) per socket:  20
Socket(s):           2
NUMA node(s):        2
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz
Stepping:            7
CPU MHz:             999.991
BogoMIPS:            5000.00
Virtualization:      VT-x
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            28160K
NUMA node0 CPU(s):   0,2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38
NUMA node1 CPU(s):   1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,33,35,37,39
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single intel_ppin ssbd mba ibrs ibpb stibp ibrs_enhanced tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm mpx rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb intel_pt avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts pku ospke avx512_vnni md_clear flush_l1d arch_capabilities

CPU 2021 node[edit | edit source]

This was run on a cpu2021 node. Identical setup as the node above but with a newer CPU model.

Results Overview[edit source]

Hardware Dell PowerEdge C6420
CPU Intel(R) Xeon(R) Gold 6240R CPU @ 2.40GHz
Memory 192GB, 12x 16GB DDR4 2933 MT/s
Disk Dell RAID TBD
Operating System CentOS 8.4
Score 1231.2 / 8006.4

Raw Output

========================================================================
   BYTE UNIX Benchmarks (Version 5.1.3)

   System: mc1: GNU/Linux
   OS: GNU/Linux -- 4.18.0-193.28.1.el8_2.x86_64 -- #1 SMP Thu Oct 22 00:20:22 UTC 2020
   Machine: x86_64 (x86_64)
   Language: en_US.utf8 (charmap="UTF-8", collate="UTF-8")
   CPU 0: Intel(R) Xeon(R) Gold 6240R CPU @ 2.40GHz (4800.0 bogomips)
          Hyper-Threading, x86-64, MMX, Physical Address Ext, SYSENTER/SYSEXIT, SYSCALL/SYSRET, Intel virtualization
...
   CPU 47: Intel(R) Xeon(R) Gold 6240R CPU @ 2.40GHz (4804.9 bogomips)
          Hyper-Threading, x86-64, MMX, Physical Address Ext, SYSENTER/SYSEXIT, SYSCALL/SYSRET, Intel virtualization
   13:11:13 up 9 days,  3:00,  2 users,  load average: 0.38, 0.10, 0.76; runlevel 2021-07-14

------------------------------------------------------------------------

Benchmark Run: Fri Jul 23 2021 13:11:13 - 13:39:06
48 CPUs in system; running 1 parallel copy of tests

Dhrystone 2 using register variables       40849804.1 lps   (10.0 s, 7 samples)
Double-Precision Whetstone                     7046.7 MWIPS (8.9 s, 7 samples)
Execl Throughput                               2586.5 lps   (30.0 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks        861513.2 KBps  (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks          223753.9 KBps  (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks       2479266.0 KBps  (30.0 s, 2 samples)
Pipe Throughput                             1187647.7 lps   (10.0 s, 7 samples)
Pipe-based Context Switching                 183282.0 lps   (10.0 s, 7 samples)
Process Creation                              12716.3 lps   (30.0 s, 2 samples)
Shell Scripts (1 concurrent)                   3546.3 lpm   (60.0 s, 2 samples)
Shell Scripts (8 concurrent)                   1182.3 lpm   (60.0 s, 2 samples)
System Call Overhead                         737645.5 lps   (10.0 s, 7 samples)

System Benchmarks Index Values               BASELINE       RESULT    INDEX
Dhrystone 2 using register variables         116700.0   40849804.1   3500.4
Double-Precision Whetstone                       55.0       7046.7   1281.2
Execl Throughput                                 43.0       2586.5    601.5
File Copy 1024 bufsize 2000 maxblocks          3960.0     861513.2   2175.5
File Copy 256 bufsize 500 maxblocks            1655.0     223753.9   1352.0
File Copy 4096 bufsize 8000 maxblocks          5800.0    2479266.0   4274.6
Pipe Throughput                               12440.0    1187647.7    954.7
Pipe-based Context Switching                   4000.0     183282.0    458.2
Process Creation                                126.0      12716.3   1009.2
Shell Scripts (1 concurrent)                     42.4       3546.3    836.4
Shell Scripts (8 concurrent)                      6.0       1182.3   1970.6
System Call Overhead                          15000.0     737645.5    491.8
                                                                   ========
System Benchmarks Index Score                                        1231.2

------------------------------------------------------------------------
Benchmark Run: Fri Jul 23 2021 13:39:06 - 14:07:27
48 CPUs in system; running 48 parallel copies of tests

Dhrystone 2 using register variables     1871186273.4 lps   (10.0 s, 7 samples)
Double-Precision Whetstone                   335219.1 MWIPS (9.0 s, 7 samples)
Execl Throughput                              38147.8 lps   (29.7 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks        680107.0 KBps  (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks          191201.6 KBps  (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks       2370667.8 KBps  (30.0 s, 2 samples)
Pipe Throughput                            51608289.5 lps   (10.0 s, 7 samples)
Pipe-based Context Switching                9085227.9 lps   (10.0 s, 7 samples)
Process Creation                              95370.3 lps   (30.0 s, 2 samples)
Shell Scripts (1 concurrent)                  10333.9 lpm   (60.1 s, 2 samples)
Shell Scripts (8 concurrent)                   1248.3 lpm   (61.1 s, 2 samples)
System Call Overhead                        4093131.1 lps   (10.0 s, 7 samples)

System Benchmarks Index Values               BASELINE       RESULT    INDEX
Dhrystone 2 using register variables         116700.0 1871186273.4 160341.6
Double-Precision Whetstone                       55.0     335219.1  60948.9
Execl Throughput                                 43.0      38147.8   8871.6
File Copy 1024 bufsize 2000 maxblocks          3960.0     680107.0   1717.4
File Copy 256 bufsize 500 maxblocks            1655.0     191201.6   1155.3
File Copy 4096 bufsize 8000 maxblocks          5800.0    2370667.8   4087.4
Pipe Throughput                               12440.0   51608289.5  41485.8
Pipe-based Context Switching                   4000.0    9085227.9  22713.1
Process Creation                                126.0      95370.3   7569.1
Shell Scripts (1 concurrent)                     42.4      10333.9   2437.2
Shell Scripts (8 concurrent)                      6.0       1248.3   2080.5
System Call Overhead                          15000.0    4093131.1   2728.8
                                                                   ========
System Benchmarks Index Score                                        8006.4