Last updated: 2026-06-12
Inference Runtime
Lm Head
T-CPU LM_HEAD implementation.
Projects hidden state through a GGUF output matrix and writes logits.
2 exports shown
struct
Params
pub const Params = struct Inputs and outputs for one LM_HEAD call.
function
run
pub fn run(params: Params) !void Project the hidden state through every row of the GGUF output matrix to produce vocab logits.
Rows are dequantized one at a time into `row_scratch` and dot-multiplied with `hidden`. `row_scratch` is not exactly `hidden.len`, otherwise void.