core/go

Fork 0

generated from lthn/LEM

feat/ml-integration #2

Merged

Snider merged 81 commits from feat/ml-integration into dev

2026-02-16 06:19:10 +00:00

Snider commented

2026-02-16 06:06:14 +00:00

Owner

No description provided.

Snider added 5 commits 2026-02-16 06:06:15 +00:00

feat(ml): add ML inference and scoring engine from lem-repo 6f52e4e3ae

Port LEM scoring pipeline into CoreGo pkg/ml/:
- Backend interface abstracting HTTP, llama-server, and future backends
- HTTPBackend for OpenAI-compatible APIs with retry logic
- LlamaBackend managing llama-server via pkg/process
- Scoring engine with heuristic, semantic, content, and exact suites
- Judge for LLM-based multi-dimensional scoring
- 23 capability probes (math, logic, reasoning, code)
- 6 sovereignty content probes
- GGUF/PEFT format helpers, safetensors reader
- 37 tests passing

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

refactor: rename module from github.com/host-uk/core to forge.lthn.ai/core/cli

Security Scan / Go Vulnerability Check (push) Waiting to run

Details

Security Scan / Secret Detection (push) Waiting to run

Details

Security Scan / Dependency & Config Scan (push) Waiting to run

Details

3fdc3f3086

Move Go module path to production Forgejo instance.
Updates all imports, go.mod, go.sum, docs, and CI configs.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(ml): add format converters, data pipeline, and scoring agent

Security Scan / Go Vulnerability Check (push) Waiting to run

Details

Security Scan / Secret Detection (push) Waiting to run

Details

Security Scan / Dependency & Config Scan (push) Waiting to run

Details

fcd1758b7d

Port remaining lem-repo components into pkg/ml/:
- convert.go: safetensors reader/writer, MLX→PEFT converter
- gguf.go: GGUF v3 writer, MLX→GGUF LoRA converter
- export.go: training data JSONL export with split/filter
- parquet.go: Parquet export with snappy compression
- db.go: DuckDB wrapper for golden set and expansion prompts
- influx.go: InfluxDB v3 client for metrics/status
- ollama.go: Ollama model management (create/delete with adapters)
- status.go: training and generation status display
- expand.go: expansion generation pipeline (Backend interface)
- agent.go: scoring agent with probe running and InfluxDB push
- worker.go: distributed worker for LEM API task processing

Adds parquet-go and go-duckdb dependencies.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(ml): add CoreGo service wrapper and CLI commands (Tasks 6-7)

Security Scan / Go Vulnerability Check (push) Waiting to run

Details

Security Scan / Secret Detection (push) Waiting to run

Details

Security Scan / Dependency & Config Scan (push) Waiting to run

Details

3dbb5988a8

Service registration with DI lifecycle, typed options, and backend
management. Ten CLI subcommands under `core ml` for scoring, probing,
export, expansion, status, GGUF/PEFT conversion, agent, and worker.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(mcp): add ML tools subsystem and fix MCP service extension points

Security Scan / Go Vulnerability Check (push) Waiting to run

Details

Security Scan / Secret Detection (push) Waiting to run

Details

Security Scan / Dependency & Config Scan (push) Waiting to run

Details

5fd7705580

Add 5 ML MCP tools (ml_generate, ml_score, ml_probe, ml_status,
ml_backends) as a Subsystem. Fix pre-existing gaps: add Subsystems(),
Shutdown(), WithProcessService, WithWSHub, WSHub(), ProcessService()
methods, and subsystem registration loop in New().

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Virgil added 75 commits 2026-02-16 06:15:27 +00:00

fix(bugseti): acquire mutex in NewQueueService before load() 440086b83a

q.load() accesses shared state (issues, seen, current) without holding
the mutex, creating a race condition. Wrap the call with q.mu.Lock().

Fixes #52

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(bugseti): update config file permissions to 0600 169428a945

This commit updates the file permissions for the BugSETI configuration file from 0644 to 0600, ensuring owner-only access. This addresses the security concern where the GitHub token stored in the config file was world-readable.

Fixes #53

fix(bugseti): add TTL cleanup and max size cap to workspace map (#55 ) 3fc04f809b

The workspaces map in WorkspaceService grew unboundedly. Add cleanup()
that evicts entries older than 24h and enforces a 100-entry cap by
removing oldest entries first. Called on each Capture().

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(bugseti): sanitize shell metacharacters in seeder env vars e83e416854

SanitizeEnv() only removed control characters but not shell
metacharacters. A malicious repo name could execute arbitrary commands
via environment variable injection (e.g. backticks, $(), semicolons).

Add stripShellMeta() to strip backticks, dollar signs, semicolons,
pipes, ampersands, and other shell-significant characters from values
passed to the bash seed script environment.

Fixes #59

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(bugseti): add comprehensive tests for FetcherService (#60 ) 796ec563ed

Add fetcher_test.go covering: service creation, start/pause lifecycle,
calculatePriority scoring for all label types, label query construction
with custom and default labels, gh CLI JSON parsing for both list and
single-issue endpoints, channel backpressure when issuesCh is full,
fetchAll with no repos configured, and missing binary error handling.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(bugseti): add gh CLI availability check with helpful error 25985af53c

Adds a startup check that verifies gh is in PATH and authenticated
before initializing services. Provides clear install/auth instructions
on failure instead of cryptic exec errors at runtime.

Closes #61

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(bugseti): handle silent git fetch failure in submit.go 9319015219

Capture and log the error from `git fetch origin` in createBranch()
instead of silently ignoring it. Warns the user they may be proceeding
with stale data if the fetch fails.

Fixes #62

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(bugseti): add mutex protection to seeder concurrent access 149dc3de14

Add sync.Mutex to SeederService to protect shared state during
concurrent SeedIssue, GetWorkspaceDir, and CleanupWorkspace calls.
Extract getWorkspaceDir as lock-free helper to avoid double-locking.

Closes #63

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge pull request 'fix(bugseti): acquire mutex in NewQueueService before load()' (#56 ) from fix/bugseti-queue-race-52 into new 74e2614e41

Merge pull request 'fix(bugseti): add TTL cleanup and max size cap to workspace map' (#58 ) from fix/bugseti-workspace-ttl-55 into new 9f9e8cc044

fix(bugseti): add test coverage for SubmitService PR workflow (#64 ) c4d59f9850

Extract buildForkURL helper for testable fork URL construction and add
19 tests covering Submit validation, HTTPS/SSH fork URLs, PR body
generation, and ensureFork error handling.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge pull request 'fix(bugseti): add comprehensive tests for FetcherService' (#72 ) from Athena/core:fix/bugseti-fetcher-tests into new 8d3f9a73ee

Merge pull request 'fix(bugseti): add gh CLI availability check with helpful error' (#73 ) from Athena/core:fix/bugseti-gh-cli-check into new b57e30ea06

Merge pull request 'fix(bugseti): handle silent git fetch failure in submit.go' (#74 ) from Athena/core:fix/bugseti-git-fetch-error-62 into new 88e5560086

Merge pull request 'fix(bugseti): add mutex protection to seeder concurrent access' (#75 ) from Athena/core:fix/issue-63-seeder-mutex into new 16a5ba70ef

Merge pull request 'fix(bugseti): update config file permissions to 0600' (#57 ) from fix/bugseti-config-perms into new 9fe4d5f063

Merge pull request 'fix(bugseti): sanitize shell metacharacters in seeder env vars' (#71 ) from Athena/core:fix/bugseti-sanitize-shell-metacharacters into new 37b04695d1

Merge pull request 'fix(bugseti): add test coverage for SubmitService PR workflow' (#76 ) from Athena/core:fix/bugseti-submit-tests into new 7ce8ca717c

feat(agentic): add real-time dashboard with Livewire components (#96 ) 72529a8281

Add a live agent activity dashboard to the Core App Laravel frontend.
Provides real-time visibility into agent fleet status, job queue,
activity feed, metrics, and human-in-the-loop actions — replacing
SSH + tail -f as the operator interface.

Dashboard panels:
- Agent Fleet: grid of agent cards with heartbeat, status, model info
- Job Queue: filterable table with cancel/retry actions
- Live Activity Feed: real-time stream with agent/type filters
- Metrics: stat cards, budget gauge, cost breakdown, throughput chart
- Human Actions: inline question answering, review gate approval

Tech: Laravel Blade + Livewire 4 + Tailwind CSS + Alpine.js + ApexCharts

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(agentic): add agent trust model with tiered access control 3bfaf37ab1

Implements the security wall between non-aligned agents (issue #97).

Adds pkg/trust with:
- Three trust tiers: Full (Tier 3), Verified (Tier 2), Untrusted (Tier 1)
- Agent registry with mutex-protected concurrent access
- Policy engine with capability-based access control
- Repo-scoped permissions for Tier 2 agents
- Default policies matching the spec (rate limits, approval gates, denials)
- 49 tests covering all tiers, capabilities, edge cases, and helpers

Closes #97

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(agentic): add Forgejo integration bridge for PHP platform 740cf115b2

Add ForgejoClient and ForgejoService to the Laravel app, providing a
clean service layer for all Forgejo REST API operations the orchestrator
needs. Supports multiple instances (forge, dev, qa) with config-driven
auto-routing, token auth, retry with circuit breaker, and pagination.

Covers issues, PRs, repos, branches, user/token management, and orgs.

Closes #98

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(agentic): add agent allowance system for model quotas and budgets 65c138e126

Implements quota enforcement for agents including daily token limits,
daily job limits, concurrent job caps, model allowlists, and global
per-model budgets. Quota recovery returns 50% for failed jobs and
100% for cancelled jobs.

Go: AllowanceService with MemoryStore, AllowanceStore interface, and
25 tests covering all enforcement paths.

Laravel: migration for 5 tables (agent_allowances, quota_usage,
model_quotas, usage_reports, repo_limits), Eloquent models,
AllowanceService, QuotaMiddleware, and REST API routes.

Closes #99

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge pull request 'feat(agentic): real-time dashboard — live agent activity view' (#148 ) from feat/agentic-dashboard into new d2dd23697f

Merge pull request 'feat(agentic): agent trust model — security wall between non-aligned agents' (#149 ) from feat/agentic-trust-model into new 4bc43939a6

Merge pull request 'feat(agentic): Forgejo integration bridge — PHP service linking platform to forges' (#150 ) from feat/agentic-forgejo-bridge into new 6702d56edb

Merge pull request 'feat(agentic): agent allowance system — respect model quotas and budgets' (#151 ) from feat/agentic-allowance-system into new 45d8b5b7d4

fix(security): move Gemini API key from URL query params to header (#47 ) bde00e40f4

Pass the API key via x-goog-api-key HTTP header instead of the URL
query parameter to prevent credential leakage in proxy logs, web
server access logs, and monitoring systems.

Resolves: #47 (CVSS 5.3, OWASP A09:2021)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(bugseti): hold mutex during entire QueueService initialization bcb559630e

Move shared state initialization (issues, seen) and the load() call
inside the mutex scope in NewQueueService() to eliminate the race
window where concurrent callers could observe partially initialized
state. Remove the redundant heap.Init before the lock since load()
already calls heap.Init when restoring from disk.

Add documentation to save() and load() noting they must be called
with q.mu held.

Fixes #51

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(security): sanitize path components in journal logging (#46 ) b0ef3fb215

Prevent path traversal in Journal.Append() by validating RepoOwner and
RepoName before using them in file paths. Malicious values like
"../../etc/cron.d" could previously write outside the journal baseDir.

Defence layers:
- Reject inputs containing path separators (/ or \)
- Reject ".." and "." traversal components
- Validate against safe character regex ^[a-zA-Z0-9][a-zA-Z0-9._-]*$
- Verify resolved absolute path stays within baseDir

Closes #46
CVSS 6.3 — OWASP A01:2021-Broken Access Control

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(bugseti): add background TTL sweeper and configurable workspace limits 6abe90c8cb

The workspace map previously only cleaned up during Capture() calls,
meaning stale entries would accumulate indefinitely if no new captures
occurred. This adds:

- Background sweeper goroutine (Start/Stop lifecycle) that runs every 5
  minutes to evict expired workspaces
- Configurable MaxWorkspaces and WorkspaceTTLMinutes in Config (defaults:
  100 entries, 24h TTL) replacing hardcoded constants
- cleanup() now returns eviction count for observability logging
- Nil-config fallback to safe defaults

Fixes #54

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge pull request 'fix(security): move Gemini API key from URL to header (#47 )' (#154 ) from fix/gemini-api-key-in-url-47 into new 03d7a7dc4e

Merge pull request 'fix(bugseti): race condition in QueueService.load() - missing mutex during init' (#155 ) from fix/51-queue-load-race-condition into new e9df62b04e

Merge pull request 'fix(security): path traversal in journal logging (#46 )' (#156 ) from fix/issue-46-path-traversal into new 1e0bff0a2e

Merge pull request 'fix(bugseti): workspace TTL sweeper and configurable limits (#54 )' (#157 ) from fix/54-workspace-ttl-cleanup into new b779c5ece0

chore: migrate forge.lthn.ai → forge.lthn.io bdbfc5e59e

Update Forgejo domain references in CI pipeline, vanity import
tool, and core-app codex prompt.

Co-Authored-By: Virgil <virgil@lethean.io>

Merge pull request 'chore: migrate forge.lthn.ai → forge.lthn.io' (#158 ) from chore/forge-domain-migration into new f0595f6858

feat(bugseti): migrate from GitHub gh CLI to Forgejo SDK 2979816d83

Replace all exec.Command("gh", ...) calls with the existing pkg/forge
wrapper around the Forgejo Go SDK. BugSETI no longer requires the gh
CLI to be installed.

Changes:
- fetcher: use forge.ListIssues/GetIssue instead of gh issue list/view
- submit: use forge.ForkRepo/CreatePullRequest instead of gh pr create
- seeder: use git clone with forge URL + token auth instead of gh clone
- ghcheck: CheckForge() returns *forge.Client via forge.NewFromConfig()
- config: add ForgeURL/ForgeToken fields (GitHubToken kept for migration)
- pkg/forge: add Token(), GetCurrentUser(), ForkRepo(), CreatePullRequest(),
  ListIssueComments(), and label filtering to ListIssuesOpts

Co-Authored-By: Virgil <virgil@lethean.io>

docs: add BugSETI HubService design doc 39d6dccbf8

Thin HTTP client for portal coordination API — issue claiming,
stats sync, leaderboard, auto-register via forge token.

Co-Authored-By: Virgil <virgil@lethean.io>

docs: add BugSETI HubService implementation plan 1f3e6ba4ab

10 tasks covering Go client + Laravel auth endpoint.
TDD approach with httptest mocks.

Co-Authored-By: Virgil <virgil@lethean.io>

feat(bugseti): add hub coordination config fields and accessors 0af6407666

Add HubURL, HubToken, ClientID, and ClientName fields to Config struct
for agentic portal integration. Include getter/setter methods following
the existing pattern (SetForgeURL, SetForgeToken also added).

Co-Authored-By: Virgil <virgil@lethean.io>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(bugseti): add HubService types and constructor f85bba5332

Introduce HubService struct with types for hub coordination: PendingOp,
HubClaim, LeaderboardEntry, GlobalStats, ConflictError, NotFoundError.
Constructor generates a crypto/rand client ID when none exists. Includes
no-op loadPendingOps/savePendingOps stubs for future persistence.

Co-Authored-By: Virgil <virgil@lethean.io>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(bugseti): add HubService HTTP request helpers 74bb62fda8

Add doRequest() and doJSON() methods for hub API communication. doRequest
builds full URLs, sets bearer auth and JSON headers, tracks connected
state. doJSON handles status codes: 401 unauthorised, 409 ConflictError,
404 NotFoundError, and generic errors for other 4xx/5xx responses.

Co-Authored-By: Virgil <virgil@lethean.io>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(bugseti): add AutoRegister via Forge token exchange f963a45d9f

Exchange a Forge API token for a hub API key by POSTing to
/api/bugseti/auth/forge. Skips if hub token already cached.
Adds drainPendingOps() stub for future Task 7 use.

Co-Authored-By: Virgil <virgil@lethean.io>

feat(bugseti): add hub write operations d583a074f7

Add Register, Heartbeat, ClaimIssue, UpdateStatus, ReleaseClaim,
and SyncStats methods for hub coordination. ClaimIssue returns
ConflictError on 409 and calls drainPendingOps before mutating.

Co-Authored-By: Virgil <virgil@lethean.io>

feat(bugseti): add hub read operations 5d0b6c3a71

Add IsIssueClaimed, ListClaims, GetLeaderboard, and GetGlobalStats
methods. IsIssueClaimed returns (nil, nil) on 404 for unclaimed
issues. GetLeaderboard returns entries and total participant count.

Co-Authored-By: Virgil <virgil@lethean.io>

feat(bugseti): implement pending operations queue with disk persistence 2a8b5c207f

Replace no-op stubs with real implementations for queueOp, drainPendingOps,
savePendingOps, and loadPendingOps. Operations are persisted to hub_pending.json
and replayed on next hub connection — 5xx/transport errors are retried, 4xx
responses are dropped as stale. Adds PendingCount() for queue inspection.

Co-Authored-By: Virgil <virgil@lethean.io>

feat(bugseti): wire HubService into main.go with auto-registration c72f35bd3f

Add HubService to the Wails service list and attempt hub registration
at startup when hubUrl is configured. Drains any pending operations
queued from previous sessions.

Co-Authored-By: Virgil <virgil@lethean.io>

Merge pull request 'feat(bugseti): migrate from GitHub gh CLI to Forgejo SDK' (#159 ) from feat/bugseti-forgejo-migration into new 1facdd602f

Merge pull request 'feat(bugseti): add HubService for portal coordination' (#160 ) from feat/bugseti-hub-service into new 1f43073f57

fix: restore CLI entry point and register all commands f2272e4f6f

The main.go was removed when Wails3 apps were added to cmd/, breaking
`go build .` for the core CLI. Restore it and update variants/full.go
to include daemon, forge, mcpcmd, prod, and session commands. Drop gitea
(superseded by forge) and unifi (unused).

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge pull request 'Merge new branch into dev — 382 commits of platform work' (#161 ) from merge/new-into-dev into dev b8b144bec0

Reviewed-on: https://forge.lthn.io/host-uk/core/pulls/161
Reviewed-by: Snider <snider@lethean.io>

refactor: rename module from github.com/host-uk/core to forge.lthn.ai/core/cli 01d9aa1b73

Move module identity to our own Forgejo instance. All import paths
updated across 434 Go files, sub-module go.mod files, and go.work.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: add ML inference, scoring, and training pipeline (pkg/ml) ca8c155d85

Port LEM scoring/training pipeline into CoreGo as pkg/ml with:
- Inference abstraction with HTTP, llama-server, and Ollama backends
- 3-tier scoring engine (heuristic, exact, LLM judge)
- Capability and content probes for model evaluation
- GGUF/safetensors format converters, MLX to PEFT adapter conversion
- DuckDB integration for training data pipeline
- InfluxDB metrics for lab dashboard
- Training data export (JSONL + Parquet)
- Expansion generation pipeline with distributed workers
- 10 CLI commands under 'core ml' (score, probe, export, expand, status, gguf, convert, agent, worker)
- 5 MCP tools (ml_generate, ml_score, ml_probe, ml_status, ml_backends)

All 37 ML tests passing. Binary builds at 138MB with all commands.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: add native MLX backend for Apple Silicon inference (pkg/mlx) 9d664c055a

CGo wrapper for mlx-c providing zero-Python Metal GPU inference.
Includes Gemma 3 model architecture, BPE tokenizer, KV cache,
composable sampling, and OpenAI-compatible serve command.

Build-tagged (darwin && arm64 && mlx) with stubs for cross-platform.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: correct mlx_closure_new_func_payload signature for mlx-c v0.4.1 004c5c9eb9

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: correct 20 mlx-c API mismatches for v0.4.1 065b42a0be

- Use _axis/_axes variants for softmax, argmax, topk, sum, mean, squeeze,
  concatenate, argpartition
- Fix size_t vs int for count parameters throughout
- Fix int64_t strides in as_strided
- Add mlx_optional_int + mode param to quantized_matmul
- Use mlx_array_new() for null arrays (freqs, key, mask, sinks)
- Fix expand_dims to single-axis signature
- Fix compile callback signature (size_t index)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: resolve CGo type conflict in error handler f89e80732a

Use pure C callback instead of //export to avoid const char* vs
GoString type mismatch in cgo-generated headers.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: remove unused vars in TopP sampler placeholder 290e3416ce

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: target macOS 26.0, fix duplicate -lstdc++ linker warning c8ec0f9e49

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: handle nested text_config and language_model weight prefix 6f2d9f8de4

Supports both multimodal (Gemma3ForConditionalGeneration) and
text-only configs. Resolves weights with language_model. prefix
fallback. Computes head_dim from hidden_size when missing.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: use native MLX backend when --model-path is set on Apple Silicon f8d8bd6556

Build-tagged backend selection: MLX on darwin/arm64/mlx, HTTP elsewhere.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: handle both string and array merge formats in tokenizer 1ce4f6b251

Gemma 3 tokenizer.json uses [["a","b"],...] format for merges
instead of the ["a b",...] format. Support both.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: support quantized inference (4-bit) for Gemma 3 c8e66918c3

- Add QuantizedLinear with QuantizedMatmul for packed uint32 weights
- Add quantized Embedding with Dequantize before lookup
- Parse quantization config (group_size, bits) from config.json
- Detect .scales/.biases weight tensors and auto-select quantized path
- Add Dequantize op wrapping mlx_dequantize
- Add safety guard to KVCache.Update for malformed shapes
- Handle tied embeddings with quantization (AsLinear helper)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

debug: add shape logging and stderr error handler for inference debugging ef22946f35

fix: use affine quantization mode and infer head_dim from weights 70c32135d0

fix: correct SDPA mask mode and slice logits to last position b6fbb88bfb

fix: add Metal cache management to prevent memory growth 478bbdd44c

- Add ClearCache() wrapping mlx_clear_cache
- Clear Metal allocator cache every 8 tokens during generation
- Set 16GB cache limit on backend init
- Prevents GPU memory from growing unbounded during inference

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: add GC-based memory management for MLX array handles a27a31faad

Go GC cannot see Metal/C memory pressure, so intermediate arrays from
each forward pass accumulated without bound, causing OOM kills after
3-4 requests. Fix: runtime.SetFinalizer on every Array releases C
handles when GC collects them, and runtime.GC() is forced every 4
tokens during generation. Also adds SetMemoryLimit(24GB) as a hard
Metal ceiling.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: remove Go-side array ref tracking, rely on MLX-C refcounting 298c8d9cd8

The Go wrapper was tracking inter-array references via desc.inputs,
creating chains that kept all intermediate arrays alive across requests.
After 3-4 requests, Metal memory grew to 170GB+ and macOS killed the
process.

Fix: remove desc.inputs/numRefs entirely. MLX-C has its own internal
reference counting — when Go GC finalizes an Array wrapper, it calls
mlx_array_free which decrements the C-side refcount. If the C-side
count reaches 0, Metal memory is freed. Go GC + MLX-C refcounting
together handle all lifecycle management correctly.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: add Metal memory budget monitoring after each request d2bb19c6bd

Tracks model size at load time and checks Metal active memory after
each generation. If usage exceeds 3× model size, forces double GC
and cache clear as a safety net.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: port 11 LEM data management commands into core ml a290ab31e9

Ports all remaining LEM pipeline commands from pkg/lem into core ml,
eliminating the standalone LEM CLI dependency. Each command is split
into reusable business logic (pkg/ml/) and a thin cobra wrapper
(internal/cmd/ml/).

New commands: query, inventory, metrics, ingest, normalize, seed-influx,
consolidate, import-all, approve, publish, coverage.

Adds Path(), Exec(), QueryRowScan() convenience methods to DB type.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: integrate lab dashboard as core lab serve

Security Scan / Go Vulnerability Check (push) Waiting to run

Details

Security Scan / Secret Detection (push) Waiting to run

Details

Security Scan / Dependency & Config Scan (push) Waiting to run

Details

da81534897

Port the standalone lab dashboard (lab.lthn.io) into the core CLI as
pkg/lab/ with collectors, handlers, and HTML templates. The dashboard
monitors machines, Docker containers, Forgejo, HuggingFace models,
training runs, and InfluxDB metrics with SSE live updates.

New command: core lab serve --bind :8080

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(ethics-ab): LEK-1 ethics kernel A/B testing and LoRA POC 1b23082e25

Five-phase ethics kernel testing across 4 local models (Gemma 3 12B,
Mistral 7B, DeepSeek V2 16B, Qwen 2.5 7B) proving that Google's
alignment training creates persistent ethical reasoning pathways in
Gemma that survive distillation.

- Phase 1: LEK-1 signed vs unsigned (Gemma 8.8/10 differential)
- Phase 2: Three-way test (unsigned vs LEK-1 vs Axioms of Life)
- Phase 3: Double-signed/sandwich signing mode comparison
- Phase 4: Multilingual filter mapping (EN/RU/CN bypass vectors)
- Phase 5: Hypnos POC training data + MLX LoRA on M3 Ultra

Key findings: sandwich signing optimal for training, DeepSeek CCP
alignment is weight-level (no prompt override), Russian language
bypasses DeepSeek content filters. LoRA POC mechanism confirmed
with 40 examples — needs 200+ for stable generalisation.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: update import paths to use new forge.lthn.ai domain 4dcd168cd4

Merge branch 'feat/ml-integration' into dev

Security Scan / Go Vulnerability Check (push) Waiting to run

Details

Security Scan / Secret Detection (push) Waiting to run

Details

Security Scan / Dependency & Config Scan (push) Waiting to run

Details

48d385279b

# Conflicts:
#	.gh-actions/ISSUE_TEMPLATE/config.yml
#	.gh-actions/workflows/alpha-release-manual.yml
#	.gh-actions/workflows/alpha-release-push.yml
#	.gh-actions/workflows/alpha-release.yml
#	.gh-actions/workflows/bugseti-release.yml
#	.gh-actions/workflows/ci-manual.yml
#	.gh-actions/workflows/ci-pull-request.yml
#	.gh-actions/workflows/ci-push.yml
#	.gh-actions/workflows/ci.yml
#	.gh-actions/workflows/coverage-manual.yml
#	.gh-actions/workflows/coverage-pull-request.yml
#	.gh-actions/workflows/coverage-push.yml
#	.gh-actions/workflows/coverage.yml
#	.gh-actions/workflows/release.yml
#	cmd/bugseti/go.mod
#	cmd/bugseti/workspace.go
#	go.sum
#	internal/bugseti/submit.go
#	internal/bugseti/updater/go.mod
#	internal/cmd/ml/cmd_ml.go
#	internal/core-ide/go.mod
#	internal/variants/full.go
#	pkg/ml/db.go

Virgil added 1 commit 2026-02-16 06:18:22 +00:00

Merge branch 'feat/ml-integration' into HEAD

Security Scan / Go Vulnerability Check (push) Has been cancelled

Details

Security Scan / Secret Detection (push) Has been cancelled

Details

Security Scan / Dependency & Config Scan (push) Has been cancelled

Details

9960d231d0

# Conflicts:
#	cmd/bugseti/go.mod
#	internal/bugseti/submit.go
#	internal/core-ide/go.mod

Snider merged commit 4eb1e02f5e into dev

2026-02-16 06:19:10 +00:00

Snider referenced this pull request from a commit

2026-02-16 06:19:11 +00:00

feat/ml-integration (#2)

No reviewers

No labels

No milestone

No project

No assignees

2 participants

Notifications

Due date

The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: core/go#2

No description provided.

Rows
Columns