go-ratelimit/CLAUDE.md

# CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

## Overview

Provider-agnostic sliding window rate limiter for LLM API calls. Single Go package (no sub-packages) with two persistence backends: YAML (single-process, default) and SQLite (multi-process, WAL mode). Enforces RPM, TPM, and RPD quotas per model. Ships default profiles for Gemini, OpenAI, Anthropic, and Local providers.

Module: `forge.lthn.ai/core/go-ratelimit` — Go 1.26, no CGO required.

## Commands

```bash
go test ./...                                              # run all tests
go test -race ./...                                        # race detector (required before commit)
go test -v -run TestCanSend ./...                          # single test
go test -v -run "TestCanSend/RPM_at_exact_limit" ./...     # single subtest
go test -bench=. -benchmem ./...                           # benchmarks
go vet ./...                                               # vet check
golangci-lint run ./...                                    # lint
```

Pre-commit gate: `go test -race ./...` and `go vet ./...` must both pass.

## Standards

- **UK English** everywhere: colour, organisation, serialise, initialise, behaviour
- **Conventional commits**: `type(scope): description` — scopes: `ratelimit`, `sqlite`, `persist`, `config`
- **Co-Author line** on every commit: `Co-Authored-By: Virgil <virgil@lethean.io>`
- **Coverage** must not drop below 95%
- **Error format**: `coreerr.E("ratelimit.FunctionName", "what", err)` via `go-log` — lowercase, no trailing punctuation
- **No `init()` functions**, no global mutable state
- **Mutex discipline**: lock at the top of public methods, never inside helpers. Helpers that need the lock document "Caller must hold the lock". `prune()` mutates state, so even "read-only" methods that call it take the write lock. Never call a public method from another public method while holding the lock.

## Architecture

All code lives in the root package. Key files:

- `ratelimit.go` — core types (`RateLimiter`, `ModelQuota`, `UsageStats`, `Config`, `Provider`), sliding window logic (`prune`, `CanSend`, `RecordUsage`), YAML persistence, `CountTokens` (Gemini-specific), iterators (`Models`, `Iter`)
- `sqlite.go` — `sqliteStore` internal type, schema creation, load/save for quotas and state

Constructor matrix: `New()` / `NewWithConfig()` for YAML, `NewWithSQLite()` / `NewWithSQLiteConfig()` for SQLite. Always `defer rl.Close()` with SQLite.

### Sliding window

1-minute window pruned on every `CanSend`/`Stats`/`RecordUsage` call. Daily counter is a rolling 24h window from first request, not a calendar boundary. Empty state entries are garbage-collected by `prune()` to prevent memory leaks.

## Test Organisation

White-box tests (`package ratelimit`), all assertions via `testify` (`require` for fatal, `assert` for non-fatal). Do not use `t.Error`/`t.Fatal` directly.

| File | Scope |
|------|-------|
| `ratelimit_test.go` | Core logic, provider profiles, concurrency, benchmarks |
| `sqlite_test.go` | SQLite backend, migration, concurrent persistence |
| `error_test.go` | Error paths for SQLite and YAML |
| `iter_test.go` | Iterators, `CountTokens` edge cases |

SQLite tests use `_Good`/`_Bad`/`_Ugly` suffixes (happy path / expected errors / edge cases). Core tests use plain descriptive names with table-driven subtests. Use `t.TempDir()` for all file paths.

## Dependencies

Five direct dependencies — do not add more without justification:

- `forge.lthn.ai/core/go-io` — file I/O abstraction
- `forge.lthn.ai/core/go-log` — structured error handling (`coreerr.E`)
- `gopkg.in/yaml.v3` — YAML backend
- `modernc.org/sqlite` — pure Go SQLite (no CGO)
- `github.com/stretchr/testify` — test-only

## Docs

- `docs/architecture.md` — sliding window algorithm, provider quotas, YAML/SQLite backends, concurrency model
- `docs/development.md` — prerequisites, test patterns, coding standards
- `docs/history.md` — completed phases with commit hashes, known limitations
feat: extract go-ratelimit from core/go pkg/ratelimit Token counting, model quotas, sliding window rate limiter. Zero external dependencies (stdlib only). Module: forge.lthn.ai/core/go-ratelimit Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-19 16:09:13 +00:00			`# CLAUDE.md`

docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00			`This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.`
feat: extract go-ratelimit from core/go pkg/ratelimit Token counting, model quotas, sliding window rate limiter. Zero external dependencies (stdlib only). Module: forge.lthn.ai/core/go-ratelimit Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-19 16:09:13 +00:00
docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00			`## Overview`

			`Provider-agnostic sliding window rate limiter for LLM API calls. Single Go package (no sub-packages) with two persistence backends: YAML (single-process, default) and SQLite (multi-process, WAL mode). Enforces RPM, TPM, and RPD quotas per model. Ships default profiles for Gemini, OpenAI, Anthropic, and Local providers.`

			Module: `forge.lthn.ai/core/go-ratelimit` — Go 1.26, no CGO required.
feat: extract go-ratelimit from core/go pkg/ratelimit Token counting, model quotas, sliding window rate limiter. Zero external dependencies (stdlib only). Module: forge.lthn.ai/core/go-ratelimit Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-19 16:09:13 +00:00
			`## Commands`

			```bash
docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00			`go test ./... # run all tests`
			`go test -race ./... # race detector (required before commit)`
			`go test -v -run TestCanSend ./... # single test`
			`go test -v -run "TestCanSend/RPM_at_exact_limit" ./... # single subtest`
			`go test -bench=. -benchmem ./... # benchmarks`
			`go vet ./... # vet check`
			`golangci-lint run ./... # lint`
feat: extract go-ratelimit from core/go pkg/ratelimit Token counting, model quotas, sliding window rate limiter. Zero external dependencies (stdlib only). Module: forge.lthn.ai/core/go-ratelimit Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-19 16:09:13 +00:00			```

docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00			Pre-commit gate: `go test -race ./...` and `go vet ./...` must both pass.

docs: graduate TODO/FINDINGS into production documentation Replace internal task tracking (TODO.md, FINDINGS.md) with structured documentation in docs/. Trim CLAUDE.md to agent instructions only. Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-20 15:01:55 +00:00			`## Standards`
feat: extract go-ratelimit from core/go pkg/ratelimit Token counting, model quotas, sliding window rate limiter. Zero external dependencies (stdlib only). Module: forge.lthn.ai/core/go-ratelimit Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-19 16:09:13 +00:00
docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00			`- UK English everywhere: colour, organisation, serialise, initialise, behaviour`
			- Conventional commits: `type(scope): description` — scopes: `ratelimit`, `sqlite`, `persist`, `config`
			- Co-Author line on every commit: `Co-Authored-By: Virgil <virgil@lethean.io>`
			`- Coverage must not drop below 95%`
fix(ratelimit): update CLAUDE.md and raise test coverage to 95% - Update error format docs: fmt.Errorf → coreerr.E from go-log - Update dependencies list: add go-io and go-log - Add tests for SQLite error paths (trigger-based exec errors, schema corruption, closed DB, load/persist via limiter) - Add tests for Iter early break, NewWithConfig HOME error, MigrateYAMLToSQLite save-error paths - Coverage: 87.8% → 95.0% Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-17 08:56:53 +00:00			- Error format: `coreerr.E("ratelimit.FunctionName", "what", err)` via `go-log` — lowercase, no trailing punctuation
docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00			- No `init()` functions, no global mutable state
			- Mutex discipline: lock at the top of public methods, never inside helpers. Helpers that need the lock document "Caller must hold the lock". `prune()` mutates state, so even "read-only" methods that call it take the write lock. Never call a public method from another public method while holding the lock.

			`## Architecture`

			`All code lives in the root package. Key files:`

			- `ratelimit.go` — core types (`RateLimiter`, `ModelQuota`, `UsageStats`, `Config`, `Provider`), sliding window logic (`prune`, `CanSend`, `RecordUsage`), YAML persistence, `CountTokens` (Gemini-specific), iterators (`Models`, `Iter`)
			- `sqlite.go` — `sqliteStore` internal type, schema creation, load/save for quotas and state

			Constructor matrix: `New()` / `NewWithConfig()` for YAML, `NewWithSQLite()` / `NewWithSQLiteConfig()` for SQLite. Always `defer rl.Close()` with SQLite.

			`### Sliding window`

			1-minute window pruned on every `CanSend`/`Stats`/`RecordUsage` call. Daily counter is a rolling 24h window from first request, not a calendar boundary. Empty state entries are garbage-collected by `prune()` to prevent memory leaks.

			`## Test Organisation`

			White-box tests (`package ratelimit`), all assertions via `testify` (`require` for fatal, `assert` for non-fatal). Do not use `t.Error`/`t.Fatal` directly.

			`\| File \| Scope \|`
			`\|------\|-------\|`
			\| `ratelimit_test.go` \| Core logic, provider profiles, concurrency, benchmarks \|
			\| `sqlite_test.go` \| SQLite backend, migration, concurrent persistence \|
			\| `error_test.go` \| Error paths for SQLite and YAML \|
			\| `iter_test.go` \| Iterators, `CountTokens` edge cases \|

			SQLite tests use `_Good`/`_Bad`/`_Ugly` suffixes (happy path / expected errors / edge cases). Core tests use plain descriptive names with table-driven subtests. Use `t.TempDir()` for all file paths.

			`## Dependencies`

fix(ratelimit): update CLAUDE.md and raise test coverage to 95% - Update error format docs: fmt.Errorf → coreerr.E from go-log - Update dependencies list: add go-io and go-log - Add tests for SQLite error paths (trigger-based exec errors, schema corruption, closed DB, load/persist via limiter) - Add tests for Iter early break, NewWithConfig HOME error, MigrateYAMLToSQLite save-error paths - Coverage: 87.8% → 95.0% Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-17 08:56:53 +00:00			`Five direct dependencies — do not add more without justification:`
docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00
fix(ratelimit): update CLAUDE.md and raise test coverage to 95% - Update error format docs: fmt.Errorf → coreerr.E from go-log - Update dependencies list: add go-io and go-log - Add tests for SQLite error paths (trigger-based exec errors, schema corruption, closed DB, load/persist via limiter) - Add tests for Iter early break, NewWithConfig HOME error, MigrateYAMLToSQLite save-error paths - Coverage: 87.8% → 95.0% Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-17 08:56:53 +00:00			- `forge.lthn.ai/core/go-io` — file I/O abstraction
			- `forge.lthn.ai/core/go-log` — structured error handling (`coreerr.E`)
docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00			- `gopkg.in/yaml.v3` — YAML backend
			- `modernc.org/sqlite` — pure Go SQLite (no CGO)
			- `github.com/stretchr/testify` — test-only
docs: graduate TODO/FINDINGS into production documentation Replace internal task tracking (TODO.md, FINDINGS.md) with structured documentation in docs/. Trim CLAUDE.md to agent instructions only. Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-20 15:01:55 +00:00
			`## Docs`

docs: add CLAUDE.md project instructions Co-Authored-By: Virgil <virgil@lethean.io> 2026-03-13 13:38:02 +00:00			- `docs/architecture.md` — sliding window algorithm, provider quotas, YAML/SQLite backends, concurrency model
docs: graduate TODO/FINDINGS into production documentation Replace internal task tracking (TODO.md, FINDINGS.md) with structured documentation in docs/. Trim CLAUDE.md to agent instructions only. Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-20 15:01:55 +00:00			- `docs/development.md` — prerequisites, test patterns, coding standards
			- `docs/history.md` — completed phases with commit hashes, known limitations