go-mlx/TODO.md at f2ca7fe188131fc7840bbddfd330e66d61d02540

context.Context on TextModel.Generate() — Generate(ctx context.Context, prompt string, opts ...GenerateOption) iter.Seq[Token]. Checks ctx.Done() in the decode loop.

Err() error on TextModel — Distinguishes normal stop (EOS, max tokens) from errors (OOM, ctx cancelled).

Chat() on TextModel — Model owns its chat template. Gemma3 and Qwen3 templates implemented.

Memory control functions at root — SetCacheLimit, SetMemoryLimit, GetActiveMemory, GetPeakMemory, ClearCache delegate to internal/metal.

Backend registration — register_metal.go auto-registers via build-tagged init().

All CGO moved to internal/metal/ — 19 source files, 10 test files, 148 tests passing.

Public API: TextModel, Backend, functional options — Clean root package, compiles on all platforms.

Integration tests — 7 tests for public API (backend registration, options, LoadModel paths).

Error handling audit — ✅ checkError() replaced with lastError() error (reads + clears C-level error string). Added Eval(...*Array) error and EvalAsync(...*Array) error as error-returning variants of Materialize. Generate loop propagates errors via m.lastErr. LoadAllSafetensors returns (map, error). Model loaders (gemma3, qwen3) check lastError() after safetensors load. grad.go/lora.go now surface real MLX error messages. 4 new tests in error_test.go.

Memory management — deterministic cleanup — Close() stub in place. CLion Claude confirmed mlx_array_free() is safe on graph-referenced arrays (refcounted via shared_ptr). Double-free is UB. Can now implement per-step cleanup.

Documentation — Public API has godoc but needs examples for common workflows.

7.3 KiB

Raw Blame History

TODO.md — go-mlx Task Queue

Phase 1: Standalone Package Hardening

Phase 2: Model Support

Phase 3: Training Pipeline

Phase 4: Backend Abstraction — ✅ COMPLETE (19 Feb 2026)

Phase 5: Ecosystem Integration (Virgil wishlist)

Phase 6: Go 1.26 Modernisation

go-inference Integration — ✅ COMPLETE (19 Feb 2026)

Upstream Dependencies

Functional Options Convention

core/go/pkg/process (for mlxlm backend, Phase 5)

Workflow

7.3 KiB Raw Blame History Unescape Escape