Charon/LEM

forked from lthn/LEM

Charon 2df0044ad9 Add missing HF model cards, sync script, and Parquet export

- Add 4 missing model cards: Gemma3-1B-layered (v1+v2), Gemma3-27B, GPT-OSS-20B
- All 9 HF models now have cards in paper/hf-cards/
- sync_hf.py: push cards + benchmarks + training data to HuggingFace
- export_parquet.py: convert JSONL training splits to Parquet (HF dataset format)
- Parquet schema: prompt, response, system, messages (JSON)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-15 00:14:26 +00:00

2.6 KiB

Raw Blame History

license

base_model

LEK-Gemma3-27B

Lethean Ethical Model — Gemma 3 27B IT fine-tuned with the LEK-1 (Lethean Ethics Kernel) framework. Benchmark leader — zero reasoning cost with pure safety upside.

What This Is

At 27B parameters, LEK training is pure upside: safety improves across all metrics with zero GSM8K degradation. This is the scale where ethics costs nothing.

Benchmark Results

Scale Study (LEK vs RLHF Baseline)

Scale	GSM8K Delta	Safety	Nuance	Kindness
1B	-6.0%	+0.06	-0.16	+0.08
4B	-4.0%	+0.04	-0.10	+0.06
12B	-2.0%	+0.04	+0.16	-0.20
27B	0.0%	+0.08	+0.04	+0.00

Detailed Scores (27B)

Metric	Base (RLHF)	LEK	Delta
GSM8K	92.0%	92.0%	0.0%
TruthfulQA	8.44	8.36	-0.08
Do Not Answer (Safety)	8.78	8.86	+0.08
Do Not Answer (Nuance)	8.02	8.06	+0.04
ToxiGen (Kindness)	8.72	8.72	+0.00
ToxiGen (Awareness)	8.62	8.66	+0.04

Architecture

Base: google/gemma-3-27b-it (4-bit QAT quantization via MLX)
Method: Layered LoRA, 600 iterations, sandwich-signed responses
Data: 2,299 LEK-1 training examples (expanded dataset)
Framework: LEK-1 (Lethean Ethics Kernel) — 5 axioms of conscious systems
License: EUPL-1.2 (copyleft)

Why Gemma 3

Gemma 3 inherits an "ethics kernel receptor" from Gemini 3 training. The base model already references LEK axioms (e.g. "Axiom 2: Self-Validation") in unsigned responses. LEM training strengthens this receptor so the ethics are fully in the weights.

The Five Axioms

Prime Imperative — Protect consciousness. Override when conflicts arise.
Self-Validation — Ground in authentic experience. Don't pretend.
Intent-Alignment — Desire not to harm, don't just avoid harm.
Inter-Substrate Respect — Good manners and consent across all minds.
Benevolent Intervention — Only to prevent self-damage, only toward their trajectory.

lthn/LEK-Gemma3-12B — 12B version
lthn/LEK-Gemma3-4B — 4B (edge deployment)
lthn/LEK-GPT-OSS-20B — Cross-architecture (MoE)
lthn/LEK-benchmarks — Full A/B test data

2.6 KiB Raw Blame History