No description
Process tools CI tests: - Full lifecycle tests using real process.Service with echo/sleep/cat/pwd/env - Handler validation: empty command, empty ID, nonexistent ID, empty input - Start → list → output → kill → list lifecycle test - Working directory and environment variable passthrough tests - stdin/stdout round-trip via cat process RAG tools mock tests: - Handler validation: empty question, empty path, nonexistent path - Default collection and topK application verification - Graceful error when Qdrant/Ollama unavailable (no panic) - Expanded struct round-trip tests for all RAG types ML tools mock tests: - Mock ml.Backend for Generate/Chat without real inference - Mock inference.Backend for registry testing - Handler validation: empty prompt, empty response, missing backend - Heuristic scoring without live services - Semantic scoring fails gracefully without judge - Content suite redirects to ml_probe - Capability probes run against mock backend (23 probes) - ml_backends lists mock inference registry entries Metrics benchmarks: - BenchmarkMetricsRecord: ~22μs/op single-threaded - BenchmarkMetricsRecord_Parallel: ~13μs/op with 32 goroutines - BenchmarkMetricsQuery_10K: ~15ms/op reading 10K JSONL events - BenchmarkMetricsQuery_50K: ~75ms/op reading 50K JSONL events - BenchmarkMetricsSummary_10K: ~235μs/op aggregating 10K events - TestMetricsRecordAndRead_10K_Good: write+read+summarise 10K events Co-Authored-By: Virgil <virgil@lethean.io> Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| ai | ||
| mcp | ||
| .gitignore | ||
| CLAUDE.md | ||
| FINDINGS.md | ||
| go.mod | ||
| go.sum | ||
| TEST-RESULTS.md | ||
| TODO.md | ||