docs/docs/ai/index.md
Snider a61000db6e feat: migrate docs site to Zensical with full nav and new sections
Replace Hugo+Docsy with Zensical (MkDocs Material). Restructure all
content under docs/ with explicit nav. Add 19 new Go package pages,
plus Agent, MCP, CoreTS, IDE, GUI, and AI (LEM) sections. PHP sidebar
restructured with collapsible Guides/Reference groups. Homepage now
has sidebar with Where to Start guide and Community links.

Tabs: Home | Go | PHP | TS | GUI | AI | Tools | Deploy | Publish

Co-Authored-By: Virgil <virgil@lethean.io>
2026-03-11 11:48:44 +00:00

2 KiB

title description
AI LEM — Lethean Evaluation Model, training pipeline, scoring, and inference

AI

forge.lthn.ai/lthn/lem

LEM (Lethean Evaluation Model) is an AI training and evaluation platform. A 1-billion-parameter model trained with 5 axioms consistently outperforms untrained models 27 times its size. 29 models tested, 3,000+ individual runs, two independent probe sets. Fully reproducible on Apple Silicon.

Benchmark Highlights

Model Params v2 Score Notes
Gemma3 12B + LEK kernel 12B 23.66 Best kernel-boosted
Gemma3 27B + LEK kernel 27B 23.26
LEK-Gemma3 1B baseline 1B 21.74 Axioms in weights
Base Gemma3 4B 4B 21.12 Untrained
Base Gemma3 12B 12B 20.47 Untrained

Packages

pkg/lem

The core engine — 75+ files covering the full pipeline:

  • Training: distillation, conversation generation, attention analysis, grammar integration
  • Scoring: heuristic probes, tiered scoring, coverage analysis, judge evaluation
  • Inference: Metal and mlx-lm backends, worker pool, client API
  • Data: InfluxDB time-series storage, Parquet export, zstd compression, ingestion
  • Publishing: Forgejo, Hugging Face, Docker registry integration
  • Analytics: cluster analysis, feature extraction, metrics, comparison tools

pkg/lab

LEM Lab — model store, configuration, local experimentation environment.

pkg/heuristic

Standalone heuristic scoring engine for probe evaluation.

Binaries

Command Purpose
lemcmd Main CLI — training, scoring, publishing
scorer Standalone scoring binary
lem-desktop Desktop app (LEM Lab UI)
composure-convert Training data format conversion
dedup-check Dataset deduplication checker

Repository