Commit graph

  • ed9bdd1306 fix: migrate module paths from forge.lthn.ai to dappco.re main v0.0.2 dev Snider 2026-04-04 16:21:13 +01:00
  • 661d37c5c1
    style(ax): rename loop variable e→envEntry for AX naming compliance ax/review-fixes-2 Claude 2026-03-31 08:25:10 +01:00
  • 3073c019f8
    feat(ax): pass 1 — remove banned imports, AX naming, test coverage Claude 2026-03-31 08:24:58 +01:00
  • 523abc6509
    feat(ax): pass 2 — replace banned imports, rename variables, add AX comments Claude 2026-03-31 08:24:34 +01:00
  • 41b34b6779
    feat(ax): apply RFC-025 AX compliance review ax/review-fixes Claude 2026-03-31 07:33:47 +01:00
  • 2fa87bfeb6 Merge pull request '[agent/claude:opus] DX audit and fix. 1) Review CLAUDE.md — update any outdate...' (#1) from agent/dx-audit-and-fix--1--review-claude-md into main Virgil 2026-03-17 08:50:45 +00:00
  • 9aaa404397 fix(dx): audit coding standards and add tests for untested paths Snider 2026-03-17 08:50:17 +00:00
  • 5dc79971e2 chore: sync dependencies for v0.0.1 v0.0.1 Snider 2026-03-16 22:20:49 +00:00
  • 4669cc503d refactor: replace fmt.Errorf/errors.New with coreerr.E() Snider 2026-03-16 21:08:52 +00:00
  • c0b7485129
    docs: archive completed phase 1-4 plans Claude 2026-02-24 19:42:07 +00:00
  • 402bdc2205
    test: add integration tests for Classify, BatchGenerate, Info, Metrics Claude 2026-02-24 18:52:10 +00:00
  • b03f357f5d
    feat: implement Classify, BatchGenerate, Info, Metrics on rocmModel Claude 2026-02-24 18:50:37 +00:00
  • 197c537e9f
    ci: add Forgejo Actions test and security scan workflows Claude 2026-02-23 03:28:08 +00:00
  • add2ba1dbd
    chore: sync workspace dependency versions Claude 2026-02-22 21:41:04 +00:00
  • 76b843e116 docs: add README with quick start and docs links Snider 2026-02-20 15:11:26 +00:00
  • 7915f7ad3c docs: graduate TODO/FINDINGS into production documentation Snider 2026-02-20 15:03:17 +00:00
  • 61a95e4d4f
    docs: Phase 4 complete — benchmarks, flash attention, parallel slots Claude 2026-02-19 23:22:04 +00:00
  • 870ee232bf
    feat: benchmark suite for decode speed, TTFT, and concurrent throughput Claude 2026-02-19 23:16:40 +00:00
  • 72120bb200
    feat: pass --parallel N to llama-server for concurrent inference slots Claude 2026-02-19 23:13:19 +00:00
  • 4b6cffb9c4
    docs: Phase 4 performance implementation plan Claude 2026-02-19 23:11:30 +00:00
  • 31bf0e8850
    docs: Phase 4 performance design Claude 2026-02-19 23:09:56 +00:00
  • d7db2d6e95
    docs: Phase 3 complete — GGUF metadata, discovery, auto context Claude 2026-02-19 22:24:52 +00:00
  • 2c77f6f968
    feat: use GGUF metadata for model type and context window auto-detection Claude 2026-02-19 22:23:07 +00:00
  • af235653ca
    feat: model discovery scanning directories for GGUF files Claude 2026-02-19 22:21:48 +00:00
  • c7c9389749
    feat: GGUF metadata parser for model discovery Claude 2026-02-19 22:15:10 +00:00
  • f8b091f511
    docs: Phase 3 model support implementation plan Claude 2026-02-19 22:12:31 +00:00
  • 2454761e34
    docs: Phase 3 model support design — approved Claude 2026-02-19 22:04:18 +00:00
  • 34f02fdcd8
    docs: Phase 2 complete — robustness features implemented Claude 2026-02-19 21:54:34 +00:00
  • a6e647c5b7
    test: graceful shutdown and concurrent request integration tests Claude 2026-02-19 21:50:47 +00:00
  • 954c57071a
    fix: clamp VRAM Free to prevent uint64 underflow Claude 2026-02-19 21:48:19 +00:00
  • 501de83d3b
    feat: VRAM monitoring via sysfs with dGPU auto-detection Claude 2026-02-19 21:45:02 +00:00
  • b7342ec819
    fix: only retry startServer on process exit, not timeout Claude 2026-02-19 21:43:06 +00:00
  • c50a8e9e9b
    feat: retry port selection in startServer on process failure Claude 2026-02-19 21:40:05 +00:00
  • c07f37afe9
    fix: guard nil exitErr wrapping, document concurrency invariant Claude 2026-02-19 21:38:01 +00:00
  • 2c4966e652
    feat: detect server crash before Generate/Chat calls Claude 2026-02-19 21:34:46 +00:00
  • d963cbf787
    docs: Phase 2 robustness implementation plan Claude 2026-02-19 21:31:24 +00:00
  • 2f743c5772
    docs: Phase 2 robustness design — approved Claude 2026-02-19 21:26:41 +00:00
  • 6744a7c78f
    docs: mark Phase 1 tasks complete Claude 2026-02-19 21:16:11 +00:00
  • 0e68d71c8a
    test: integration tests for full ROCm inference pipeline Claude 2026-02-19 21:15:02 +00:00
  • 1d8d65f55b
    feat: Backend Available() and LoadModel() with GPU detection Claude 2026-02-19 21:12:02 +00:00
  • a8c494771d
    feat: TextModel implementation wrapping llama-server Claude 2026-02-19 21:11:55 +00:00
  • 9aa7f624ba
    feat: server lifecycle and helpers for llama-server subprocess Claude 2026-02-19 21:08:07 +00:00
  • 5778f1f011
    fix: guard response body lifecycle in SSE streaming client Claude 2026-02-19 21:04:02 +00:00
  • 1bc8c9948b
    test: completion streaming tests for llamacpp client Claude 2026-02-19 20:59:21 +00:00
  • def3167199
    feat: llamacpp SSE streaming client for chat completions Claude 2026-02-19 20:58:46 +00:00
  • d5a92c7212
    fix: health check includes response body in errors, adds 503 test Claude 2026-02-19 20:54:52 +00:00
  • 3c756771ec
    feat: llamacpp health check client Claude 2026-02-19 20:50:36 +00:00
  • 9dda860df4
    docs: incorporate Charon review — safer serverEnv() filtering Claude 2026-02-19 20:47:16 +00:00
  • 78e244f26f
    docs: Phase 1 plan review — approved with notes Claude 2026-02-19 20:44:37 +00:00
  • ff9cf550e8
    docs: flag Token.ID and StopTokens interface questions for Virgil Claude 2026-02-19 20:41:53 +00:00
  • acf79e3351
    docs: Phase 1 implementation plan Claude 2026-02-19 20:38:11 +00:00
  • 34407a69ca
    docs: Phase 1 core implementation design Claude 2026-02-19 20:32:22 +00:00
  • 68bc7300aa
    docs: Phase 0 complete — environment validated, llama-server built Claude 2026-02-19 19:57:14 +00:00
  • aa42cff417 feat: scaffold go-rocm AMD GPU inference package Snider 2026-02-19 19:39:40 +00:00
  • 252e28e81e Initial commit Virgil 2026-02-19 19:35:55 +00:00