Last updated: 2026-06-12
Inference Runtime
Matvec
T-CPU matrix-vector projection implementation.
Dequantizes one GGUF tensor row at a time and computes a scalar matvec.
2 exports shown
struct
Params
pub const Params = struct Inputs and outputs for one matrix-vector projection.
function
run
pub fn run(params: Params) !void Compute `output = W * input` (or `output += W * input` when `accumulate` is set) one row at a time.
Each row of the GGUF matrix is dequantized into `row_scratch` and dotted against `input`. `row_scratch` is shorter than `input`, otherwise void.