| Vi kiến trúc |
ISA |
FP64 |
FP32 |
FP16 |
| Intel CPU |
| Intel 80486 |
x87 (32-bit) |
? |
0.128[1] |
? |
Intel P5 Pentium Intel P6 Pentium Pro |
x87 (32-bit) |
? |
0.5[1] |
? |
Intel P5 Pentium MMX Intel P6 Pentium II |
MMX (64-bit) |
? |
1[2] |
? |
| Intel P6 Pentium III |
SSE (64-bit) |
? |
2[2] |
? |
| Intel Netburst Pentium 4 (Willamette, Northwood) |
SSE2 (64-bit) |
2 |
4 |
? |
| Intel P6 Pentium M |
SSE2 (64-bit) |
1 |
2 |
? |
Intel Netburst Pentium 4 (Prescott, Cedar Mill) Intel Netburst Pentium D (Smithfield, Presler) Intel P6 Core (Yonah) |
SSE3 (64-bit) |
2 |
4 |
? |
Intel Core (Merom, Penryn) Intel Nehalem[3] (Nehalem, Westmere) | SSSE3 (128-bit) SSE4 (128-bit) | 4 | 8 | ? |
| Intel Atom (Bonnell, Saltwell, Silvermont and Goldmont) | SSE3 (128-bit) | 2 | 4 | ? |
| Intel Sandy Bridge (Sandy Bridge, Ivy Bridge) | AVX (256-bit) | 8 | 16 | 0 |
Intel Haswell[3] (Haswell, Devil's Canyon, Broadwell) Intel Skylake (Skylake, Kaby Lake, Coffee Lake, Comet Lake, Whiskey Lake, Amber Lake) | AVX2 & FMA (256-bit) | 16 | 32 | 0 |
| Intel Xeon Phi (Knights Corner) | SSE & FMA (256-bit) | 16 | 32 | 0 |
Intel Skylake-X (Skylake-X, Cascade Lake) Intel Xeon Phi (Knights Landing, Knights Mill)
Intel Ice Lake, Tiger Lake and Rocket Lake |
AVX-512 & FMA (512-bit) | 32 | 64 | 0 |
| AMD CPU |
| AMD Bobcat | AMD64 (64-bit) | 2 | 4 | 0 |
AMD Jaguar AMD Puma |
AVX (128-bit) |
4 |
8 |
0 |
| AMD K10 |
SSE4/4a (128-bit) |
4 |
8 |
0 |
| AMD Bulldozer[3] (Piledriver, Steamroller, Excavator) | AVX (128-bit) Bulldozer-Steamroller
AVX2 (128-bit) Excavator
FMA3 (Bulldozer)[4]
FMA3/4 (Piledriver-Excavator) |
4 | 8 | 0 |
AMD Zen (Ryzen 1000 series, Threadripper 1000 series, Epyc Naples) AMD Zen+[3][5][6][7] (Ryzen 2000 series, Threadripper 2000 series) | AVX2 & FMA (128-bit, 256-bit decoding)[8] | 8 | 16 | 0 |
AMD Zen 2[9] (Ryzen 3000 series, Threadripper 3000 series, Epyc Rome)) AMD Zen 3 (Ryzen 5000 series) | AVX2 & FMA (256-bit) | 16 | 32 | 0 |
| ARM CPU |
| ARM Cortex-A7, A9, A15 | ARMv7 | 1 | 8 | 0 |
| ARM Cortex-A32, A35, A53, A55, A72, A73, A75 | ARMv8 | 2 | 8 | 0 |
| ARM Cortex-A57[3] | ARMv8 | 4 | 8 | 0 |
| ARM Cortex-A76, A77, A78 | ARMv8 | 8 | 16 | 0 |
| ARM Cortex-X1 |
ARMv8 |
16 |
32 |
? |
| Qualcomm Krait | ARMv8 | 1 | 8 | 0 |
| Qualcomm Kryo (1xx - 3xx) | ARMv8 | 2 | 8 | 0 |
| Qualcomm Kryo (4xx - 5xx) | ARMv8 | 8 | 16 | 0 |
| Samsung Exynos M1 and M2 | ARMv8 | 2 | 8 | 0 |
| Samsung Exynos M3 and M4 | ARMv8 | 3 | 12 | 0 |
| IBM PowerPC A2 (Blue Gene/Q) | ? | 8 | 8 (as FP64) | 0 |
| Hitachi SH-4[10][11] | SH-4 | 1 | 7 | 0 |
| Nvidia GPU |
| Nvidia Curie (GeForce 6 series and GeForce 7 series) |
PTX |
? |
8 |
? |
| Nvidia Tesla 2.0 (GeForce GTX 260-295) |
PTX |
? |
2 |
? |
| Nvidia Fermi (chỉ tính GeForce GTX 465–480, 560 Ti, 570-590) | PTX | 1/4 (locked by driver, 1 in hardware) | 2 | 0 |
| Nvidia Fermi (chỉ tính Quadro 600-2000) | PTX | 1/8 | 2 | 0 |
| Nvidia Fermi (chỉ tính Quadro 4000–7000, Tesla) | PTX | 1 | 2 | 0 |
| Nvidia Kepler (GeForce (trừTitan and Titan Black), Quadro (except K6000), Tesla K10) | PTX | 1/12 (for GK110 | 2 | 0 |
| Nvidia Kepler (GeForce GTX Titan and Titan Black, Quadro K6000, Tesla (trừ K10)) | PTX | 2/3 | 2 | 0 |
Nvidia Maxwell Nvidia Pascal (không tính Quadro GP100 and Tesla P100) | PTX | 1/16 | 2 | 1/32 |
| Nvidia Pascal (chỉ tính Quadro GP100 and Tesla P100) | PTX | 1 | 2 | 4 |
| Nvidia Volta[12] | PTX | 1 | 2 (FP32) + 2 (INT32) | 16 |
| Nvidia Turing (chỉ tính GeForce 16XX) | PTX | 1/16 | 2 (FP32) + 2 (INT32) | 4 |
| Nvidia Turing (không tính GeForce 16XX) | PTX | 1/16 | 2 (FP32) + 2 (INT32) | 16 |
| Nvidia Ampere[13][14] (chỉ tính Tesla A100/A30) | PTX | 2 | 2 (FP32) + 2 (INT32) | 32 |
| Nvidia Ampere (gồm GeForce và Quadro, Tesla A40/A10) | PTX | 1/32 | 2 (FP32) + 0 (INT32) or 1 (FP32) + 1 (INT32) | 8 |
| AMD GPU |
| AMD TeraScale 1 (Radeon HD 4000 series) |
TeraScale 1 |
0.4 |
2 |
? |
| AMD TeraScale 2 (Radeon HD 5000 series) |
TeraScale 2 |
1 |
2 |
? |
| AMD TeraScale 3 (Radeon HD 6000 series) |
TeraScale 3 |
1 |
4 |
? |
| AMD GCN (chỉ tính Radeon Pro W 8100-9100) | GCN | 1 | 2 | ? |
| AMD GCN (không tính Radeon Pro W 8100-9100, Vega 10-20) | GCN | 1/8 | 2 | 4 |
| AMD GCN Vega 10 | GCN | 1/8 | 2 | 4 |
| AMD GCN Vega 20 (chỉ tính Radeon VII) | GCN | | 2 | 4 |
| AMD GCN Vega 20 (chỉ tính Radeon Instinct MI50 / MI60 and Radeon Pro VII) | GCN | 1 | 2 | 4 |
AMD RDNA[15][16] AMD RDNA 2 | RDNA | 1/8 | 2 | 4 |
| AMD CDNA | CDNA | 1 | 4 (Tenxơ)[17] | 16 |
| AMD CDNA 2 | CDNA 2 | 4 (Tenxơ) | 4 (Tenxơ) | 16 |
| Qualcomm GPU |
| Qualcomm Adreno 5x0 |
Adreno 5xx |
1 |
2 |
4 |
| Qualcomm Adreno 6x0 |
Adreno 6xx |
1 |
2 |
4 |
| Graphcore |
| Graphcore Colossus GC2[18][19][20] (values estimated) | ? | 0 | 18 | 72 |
| Graphcore Colossus GC200 Mk2[21] (values estimated) | ? | 0 | 36 | 144 |
| Supercomputer |
| ENIAC @ 100 Khz with 385 Flops[22] |
|
|
|
|
| 48-bit processor @ 208 kHz in CDC 1604 in 1960 |
|
|
|
|
| 60-bit processor @ 10 Mhz in CDC6600 in 1964 |
|
0.3 (FP60) |
|
|
| 60-bit processor @ 10 Mhz in CDC7600 in 1967 |
|
1.0 (FP60) [23] |
|
|
| Cray-1 @ 80 Mhz in 1976 |
|
2 |
|
|
| CDC Cyber 205 @ 50 Mhz in 1981 |
FORTRAN compiler
(ANSI 77 with vector extensions) |
8 |
16 |
|
| Vi kiến trúc |
ISA |
FP64 |
FP32 |
FP16 |