go-ratelimit/README.md

[![Go Reference](https://pkg.go.dev/badge/forge.lthn.ai/core/go-ratelimit.svg)](https://pkg.go.dev/forge.lthn.ai/core/go-ratelimit)
[![License: EUPL-1.2](https://img.shields.io/badge/License-EUPL--1.2-blue.svg)](LICENSE.md)
[![Go Version](https://img.shields.io/badge/Go-1.26-00ADD8?style=flat&logo=go)](go.mod)

# go-ratelimit

Provider-agnostic sliding window rate limiter for LLM API calls. Enforces requests per minute (RPM), tokens per minute (TPM), and requests per day (RPD) quotas per model using an in-memory sliding window. Ships with default quota profiles for Gemini, OpenAI, Anthropic, and a local inference provider. State persists across process restarts via YAML (single-process) or SQLite (multi-process, WAL mode). Includes a Gemini-specific token counting helper and a YAML-to-SQLite migration path.

**Module**: `forge.lthn.ai/core/go-ratelimit`
**Licence**: EUPL-1.2
**Language**: Go 1.25

## Quick Start

```go
import "forge.lthn.ai/core/go-ratelimit"

// YAML backend (default, single-process)
rl, err := ratelimit.New()

// SQLite backend (multi-process)
rl, err := ratelimit.NewWithSQLite("~/.core/ratelimits.db")
defer rl.Close()

ok, reason := rl.CanSend("gemini-2.0-flash", 1500)
if ok {
    rl.RecordUsage("gemini-2.0-flash", 1500)
}
```

## Documentation

- [Architecture](docs/architecture.md) — sliding window algorithm, provider quotas, YAML and SQLite backends
- [Development Guide](docs/development.md) — prerequisites, test patterns, coding standards
- [Project History](docs/history.md) — completed phases with commit hashes, known limitations

## Build & Test

```bash
go test ./...
go test -race ./...
go vet ./...
go build ./...
```

## Licence

European Union Public Licence 1.2 — see [LICENCE](LICENCE) for details.
chore: add Go repo norms (badges, contributing, lint, taskfile, editorconfig) Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-23 06:45:45 +00:00			`[![Go Reference](https://pkg.go.dev/badge/forge.lthn.ai/core/go-ratelimit.svg)](https://pkg.go.dev/forge.lthn.ai/core/go-ratelimit)`
			`[![License: EUPL-1.2](https://img.shields.io/badge/License-EUPL--1.2-blue.svg)](LICENSE.md)`
			`[![Go Version](https://img.shields.io/badge/Go-1.26-00ADD8?style=flat&logo=go)](go.mod)`

docs: add README with quick start and docs links Co-Authored-By: Virgil <virgil@lethean.io> 2026-02-20 15:11:19 +00:00			`# go-ratelimit`

			`Provider-agnostic sliding window rate limiter for LLM API calls. Enforces requests per minute (RPM), tokens per minute (TPM), and requests per day (RPD) quotas per model using an in-memory sliding window. Ships with default quota profiles for Gemini, OpenAI, Anthropic, and a local inference provider. State persists across process restarts via YAML (single-process) or SQLite (multi-process, WAL mode). Includes a Gemini-specific token counting helper and a YAML-to-SQLite migration path.`

			Module: `forge.lthn.ai/core/go-ratelimit`
			`Licence: EUPL-1.2`
			`Language: Go 1.25`

			`## Quick Start`

			```go
			`import "forge.lthn.ai/core/go-ratelimit"`

			`// YAML backend (default, single-process)`
			`rl, err := ratelimit.New()`

			`// SQLite backend (multi-process)`
			`rl, err := ratelimit.NewWithSQLite("~/.core/ratelimits.db")`
			`defer rl.Close()`

			`ok, reason := rl.CanSend("gemini-2.0-flash", 1500)`
			`if ok {`
			`rl.RecordUsage("gemini-2.0-flash", 1500)`
			`}`
			```

			`## Documentation`

			`- [Architecture](docs/architecture.md) — sliding window algorithm, provider quotas, YAML and SQLite backends`
			`- [Development Guide](docs/development.md) — prerequisites, test patterns, coding standards`
			`- [Project History](docs/history.md) — completed phases with commit hashes, known limitations`

			`## Build & Test`

			```bash
			`go test ./...`
			`go test -race ./...`
			`go vet ./...`
			`go build ./...`
			```

			`## Licence`

			`European Union Public Licence 1.2 — see [LICENCE](LICENCE) for details.`