Move both plans to docs/plans/completed/ with summaries. MLX backend implements shared interfaces and batch inference at 5K sentences/sec. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| 2026-02-19-backend-abstraction-design-original.md | ||
| 2026-02-19-backend-abstraction-plan-original.md | ||
| 2026-02-19-batch-inference-design-original.md | ||
| backend-abstraction.md | ||
| batch-inference.md | ||