Commit graph

1 commit

Author SHA1 Message Date
Claude
f5f1e68c5c
feat(bench): add Phase 4 GPU benchmarks on RX 7800 XT
Benchmarks for embedding (Ollama ROCm), chunking (pure CPU), and search
(Qdrant) latency. Key results: 97 embeds/sec single (10.3ms), Qdrant
search 152µs (6.5K QPS), chunking 11µs per 50-section doc.

EmbedBatch confirmed sequential — Ollama has no batch API.

Co-Authored-By: Charon <developers@lethean.io>
2026-02-20 11:51:25 +00:00