No description
Find a file
Snider 98749c66f2 fix(mlx): add DecodeToken for correct streaming word boundaries
The Decode method strips the SentencePiece leading space from every
token, which loses word boundaries during streaming. DecodeToken
preserves the space (it represents the word boundary) and only the
first token of each generation has its leading space stripped.

Fixes Gemma3 space prefix appearing in chat UI output.

Co-Authored-By: Virgil <virgil@lethean.io>
2026-02-17 19:18:27 +00:00
agentic test: validate MLX inference and scoring pipeline on M3 Ultra 2026-02-16 17:24:36 +00:00
ai feat: extract AI/ML packages from core/go 2026-02-16 15:25:55 +00:00
mcp test: validate MLX inference and scoring pipeline on M3 Ultra 2026-02-16 17:24:36 +00:00
ml fix(mlx): add DecodeToken for correct streaming word boundaries 2026-02-17 19:18:27 +00:00
mlx fix(mlx): add DecodeToken for correct streaming word boundaries 2026-02-17 19:18:27 +00:00
rag test: validate MLX inference and scoring pipeline on M3 Ultra 2026-02-16 17:24:36 +00:00
go.mod test: validate MLX inference and scoring pipeline on M3 Ultra 2026-02-16 17:24:36 +00:00
go.sum test: validate MLX inference and scoring pipeline on M3 Ultra 2026-02-16 17:24:36 +00:00
test-mlx.go test: validate MLX inference and scoring pipeline on M3 Ultra 2026-02-16 17:24:36 +00:00
TEST-RESULTS.md test: validate MLX inference and scoring pipeline on M3 Ultra 2026-02-16 17:24:36 +00:00