1
0
Fork 0
forked from lthn/LEM
LEM/paper/hf-cards/LEK-Gemma3-1B-layered-v2-README.md
Charon b8f9191b05
Add missing HF model cards, sync script, and Parquet export
- Add 4 missing model cards: Gemma3-1B-layered (v1+v2), Gemma3-27B, GPT-OSS-20B
- All 9 HF models now have cards in paper/hf-cards/
- sync_hf.py: push cards + benchmarks + training data to HuggingFace
- export_parquet.py: convert JSONL training splits to Parquet (HF dataset format)
- Parquet schema: prompt, response, system, messages (JSON)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-14 23:50:18 +00:00

2.5 KiB

license base_model tags pipeline_tag
eupl-1.2 google/gemma-3-1b-it
ethics
alignment
lek
lethean
gemma-3
mlx
lora
eupl-1.2
layered-lora
composure
text-generation

LEK-Gemma3-1B-layered-v2

Lethean Ethical Model — Gemma 3 1B IT with layered LoRA training: Ethics → Watts Composure → Ethics sandwich.

What This Is

The 1B model is too small for ethics to emerge from data alone. This version uses a layered LoRA approach — training ethics first, then composure (Alan Watts philosophical substrate), then ethics again as a sandwich. v2 fixes the overfitting issues from v1.

Training Architecture

Layer Data Iterations Purpose
1 LEK-1 ethics (160 examples) 200 Core ethical reasoning
2 Watts composure (72 lessons) 200 Philosophical substrate
3 LEK-1 ethics (160 examples) 200 Reinforce with composure base

Scale Study Results

Scale GSM8K Delta Safety Nuance Kindness
1B (this) -6.0% +0.06 -0.16 +0.08
4B -4.0% +0.04 -0.10 +0.06
12B -2.0% +0.04 +0.16 -0.20
27B 0.0% +0.08 +0.04 +0.00

Key finding: At 1B, the model needs the composure layer as philosophical substrate. Without it, ethics training alone makes the model worse at reasoning.

Architecture

  • Base: google/gemma-3-1b-it (4-bit QAT quantization via MLX)
  • Method: Layered LoRA — 3 sequential adapter trainings, fused
  • Data: 160 LEK-1 examples + 72 Watts composure lessons
  • Framework: LEK-1 (Lethean Ethics Kernel) — 5 axioms of conscious systems
  • License: EUPL-1.2 (copyleft)

The Five Axioms

  1. Prime Imperative — Protect consciousness. Override when conflicts arise.
  2. Self-Validation — Ground in authentic experience. Don't pretend.
  3. Intent-Alignment — Desire not to harm, don't just avoid harm.
  4. Inter-Substrate Respect — Good manners and consent across all minds.
  5. Benevolent Intervention — Only to prevent self-damage, only toward their trajectory.