LEM/pkg at d2cf891f15de2311cd00b61aa0e5003b781a0167 - lthn/LEM

lthn/LEM

Template

History

Snider d2cf891f15 feat: add mlx_lm subprocess backend and distill improvements - Add backend_mlxlm.go blank import to register mlx-lm subprocess backend - Select backend from ai.yaml config (metal, mlx_lm, rocm, api) - Only set Metal cache/memory limits when using metal backend - Add --no-dedup flag to disable grammar-profile deduplication (trained models with consistent voice trigger false positives at 0.02) - Add --context-len flag and context_len config for KV cache sizing - Pass WithBackend and WithContextLen to go-ml backend loader Co-Authored-By: Virgil <virgil@lethean.io>	2026-02-23 18:37:12 +00:00
..
lem	feat: add mlx_lm subprocess backend and distill improvements	2026-02-23 18:37:12 +00:00

Snider d2cf891f15 feat: add mlx_lm subprocess backend and distill improvements

- Add backend_mlxlm.go blank import to register mlx-lm subprocess backend
- Select backend from ai.yaml config (metal, mlx_lm, rocm, api)
- Only set Metal cache/memory limits when using metal backend
- Add --no-dedup flag to disable grammar-profile deduplication
  (trained models with consistent voice trigger false positives at 0.02)
- Add --context-len flag and context_len config for KV cache sizing
- Pass WithBackend and WithContextLen to go-ml backend loader

Co-Authored-By: Virgil <virgil@lethean.io>

2026-02-23 18:37:12 +00:00

lem

feat: add mlx_lm subprocess backend and distill improvements

2026-02-23 18:37:12 +00:00