go-rocm

Author	SHA1	Message	Date
Snider	4669cc503d	refactor: replace fmt.Errorf/errors.New with coreerr.E() Some checks failed Security Scan / security (push) Successful in 8s Details Test / Vet & Build (push) Failing after 23s Details Co-Authored-By: Virgil <virgil@lethean.io>	2026-03-16 21:08:52 +00:00
Claude	b03f357f5d	feat: implement Classify, BatchGenerate, Info, Metrics on rocmModel Some checks failed Security Scan / security (push) Successful in 10s Details Test / Vet & Build (push) Failing after 34s Details Brings rocmModel into compliance with the updated inference.TextModel interface from go-inference. - Classify: simulates prefill-only via max_tokens=1, temperature=0 - BatchGenerate: sequential autoregressive per prompt via /v1/completions - Info: populates ModelInfo from GGUF metadata (architecture, layers, quant) - Metrics: captures timing + VRAM usage via sysfs after each operation - Refactors duplicate server-exit error handling into setServerExitErr() - Adds timing instrumentation to existing Generate and Chat methods Co-Authored-By: Virgil <virgil@lethean.io>	2026-02-24 18:50:37 +00:00
Claude	add2ba1dbd	chore: sync workspace dependency versions Run go work sync to align dependency versions across workspace. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 21:41:04 +00:00
Claude	3c756771ec	feat: llamacpp health check client Add internal/llamacpp package with Client type and Health() method. Client communicates with llama-server via HTTP; Health checks the /health endpoint and reports readiness. Foundation type for the streaming methods (Tasks 2-3). Co-Authored-By: Virgil <virgil@lethean.io> Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 20:50:36 +00:00

4 commits