HPT
High Performance Tensor Library
Ease To Use
Simple and intuitive API design with comprehensive documentation and examples.
Customizable
Support user custom data type, custom memory allocator
Memory Layout
Optimized memory layout with support for both contiguous and strided tensors.
SIMD Support
Leverages CPU SIMD instructions (SSE/AVX/NEON) for vectorized operations.
Iterator API
Flexible iterator API for efficient element-wise/broadcast operations and custom implementations.
Multi-Threading
Auto efficient parallel processing for CPU-intensive operations.
Broadcasting
Automatic shape broadcasting for element-wise operations, similar to NumPy.
Type Safe
Strong type system ensures correctness at compile time, preventing runtime errors.
Zero-Copy
Minimizes memory overhead with zero-copy operations and efficient data sharing.