Apple M4 (6P + 4E), 16GB unified memory. All benchmarks are single-precision and single-threaded.
Each plot shows:
coral-safe(portable-simd, safe Rust)coral-neon(AArch64 / NEON)- a reference implementation:
- OpenBLAS armv8, or
- Apple Accelerate, or
- BLIS …