Token counting, model quotas, and sliding window rate limiter
Add 4 concurrent race tests: multi-model (5 models), Persist+Load filesystem race, AllStats+RecordUsage, WaitForCapacity+RecordUsage. Add 4 benchmarks: CanSendWithPrune, Stats, AllStats, Persist. Total: 80 tests, 7 benchmarks, all pass under go test -race. Co-Authored-By: Virgil <virgil@lethean.io> |
||
|---|---|---|
| CLAUDE.md | ||
| FINDINGS.md | ||
| go.mod | ||
| go.sum | ||
| ratelimit.go | ||
| ratelimit_test.go | ||
| TODO.md | ||