Unary Benchmark
Unary f32 Performance (size = 1024 * 2048 * 8)
Unary f32 Performance (size = 4096 * 2048 * 8)
Error precision (lower is better)
Hpt
: 1 ulpsTorch
: 1 ulpsCandle (mkl)
: 1 ulpsNdarray (par)
: 1 ulps
Compilation config
[profile.release]
opt-level = 3
incremental = true
debug = true
lto = "fat"
codegen-units = 1
Running Threads
10
Device specification
CPU
: 12th Gen Intel(R) Core(TM) i5-12600K 3.69 GHz
RAM
: G.SKILL Trident Z Royal Series (Intel XMP) DDR4 64GB
System
: Windows 11 Pro 23H2