Skip to content

Level 1 — Vector operations

Level 1 BLAS: O(n) vector operations. These run on PyTorch (CPU/GPU/Neuron via XLA); the Tensor Engine is wasted on vector work so no NKI kernels.

axpy(alpha, x, y)

y = α·x + y. Returns the updated y.

dot(x, y)

Scalar inner product x^T y.

nrm2(x)

Euclidean norm ‖x‖₂.

scal(alpha, x)

In-place scaling x = α·x.

asum(x)

Sum of absolute values Σ|xᵢ|.

iamax(x)

Index of element with largest absolute value.