No description
Find a file
Snider 6f6844a8a5 test: Phase 5 — testing gaps (process/RAG/ML mocks, metrics bench)
Process tools CI tests:
- Full lifecycle tests using real process.Service with echo/sleep/cat/pwd/env
- Handler validation: empty command, empty ID, nonexistent ID, empty input
- Start → list → output → kill → list lifecycle test
- Working directory and environment variable passthrough tests
- stdin/stdout round-trip via cat process

RAG tools mock tests:
- Handler validation: empty question, empty path, nonexistent path
- Default collection and topK application verification
- Graceful error when Qdrant/Ollama unavailable (no panic)
- Expanded struct round-trip tests for all RAG types

ML tools mock tests:
- Mock ml.Backend for Generate/Chat without real inference
- Mock inference.Backend for registry testing
- Handler validation: empty prompt, empty response, missing backend
- Heuristic scoring without live services
- Semantic scoring fails gracefully without judge
- Content suite redirects to ml_probe
- Capability probes run against mock backend (23 probes)
- ml_backends lists mock inference registry entries

Metrics benchmarks:
- BenchmarkMetricsRecord: ~22μs/op single-threaded
- BenchmarkMetricsRecord_Parallel: ~13μs/op with 32 goroutines
- BenchmarkMetricsQuery_10K: ~15ms/op reading 10K JSONL events
- BenchmarkMetricsQuery_50K: ~75ms/op reading 50K JSONL events
- BenchmarkMetricsSummary_10K: ~235μs/op aggregating 10K events
- TestMetricsRecordAndRead_10K_Good: write+read+summarise 10K events

Co-Authored-By: Virgil <virgil@lethean.io>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-20 06:48:31 +00:00
ai test: Phase 5 — testing gaps (process/RAG/ML mocks, metrics bench) 2026-02-20 06:48:31 +00:00
mcp test: Phase 5 — testing gaps (process/RAG/ML mocks, metrics bench) 2026-02-20 06:48:31 +00:00
.gitignore chore: update module paths and add gitignore 2026-02-17 19:19:38 +00:00
CLAUDE.md fix: post-split cleanup — remove test script, fix tests, update docs 2026-02-20 01:18:27 +00:00
FINDINGS.md docs: add CLAUDE.md, TODO.md, FINDINGS.md for fleet delegation 2026-02-19 21:34:55 +00:00
go.mod feat(ml): Phase 2 — migrate ml_backends to go-inference registry 2026-02-20 06:01:30 +00:00
go.sum fix: post-split cleanup — remove test script, fix tests, update docs 2026-02-20 01:18:27 +00:00
TEST-RESULTS.md test: validate MLX inference and scoring pipeline on M3 Ultra 2026-02-16 17:24:36 +00:00
TODO.md test: Phase 5 — testing gaps (process/RAG/ML mocks, metrics bench) 2026-02-20 06:48:31 +00:00