Claude
|
f5f1e68c5c
|
feat(bench): add Phase 4 GPU benchmarks on RX 7800 XT
Benchmarks for embedding (Ollama ROCm), chunking (pure CPU), and search
(Qdrant) latency. Key results: 97 embeds/sec single (10.3ms), Qdrant
search 152µs (6.5K QPS), chunking 11µs per 50-section doc.
EmbedBatch confirmed sequential — Ollama has no batch API.
Co-Authored-By: Charon <developers@lethean.io>
|
2026-02-20 11:51:25 +00:00 |
|