core-agent-ide/codex-rs
Charley Cunningham ec4a2d07e4
Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786)
## Summary
- Stream proposed plans in Plan Mode using `<proposed_plan>` tags parsed
in core, emitting plan deltas plus a plan `ThreadItem`, while stripping
tags from normal assistant output.
- Persist plan items and rebuild them on resume so proposed plans show
in thread history.
- Wire plan items/deltas through app-server protocol v2 and render a
dedicated proposed-plan view in the TUI, including the “Implement this
plan?” prompt only when a plan item is present.

## Changes

### Core (`codex-rs/core`)
- Added a generic, line-based tag parser that buffers each line until it
can disprove a tag prefix; implements auto-close on `finish()` for
unterminated tags. `codex-rs/core/src/tagged_block_parser.rs`
- Refactored proposed plan parsing to wrap the generic parser.
`codex-rs/core/src/proposed_plan_parser.rs`
- In plan mode, stream assistant deltas as:
  - **Normal text** → `AgentMessageContentDelta`
  - **Plan text** → `PlanDelta` + `TurnItem::Plan` start/completion  
  (`codex-rs/core/src/codex.rs`)
- Final plan item content is derived from the completed assistant
message (authoritative), not necessarily the concatenated deltas.
- Strips `<proposed_plan>` blocks from assistant text in plan mode so
tags don’t appear in normal messages.
(`codex-rs/core/src/stream_events_utils.rs`)
- Persist `ItemCompleted` events only for plan items for rollout replay.
(`codex-rs/core/src/rollout/policy.rs`)
- Guard `update_plan` tool in Plan Mode with a clear error message.
(`codex-rs/core/src/tools/handlers/plan.rs`)
- Updated Plan Mode prompt to:  
  - keep `<proposed_plan>` out of non-final reasoning/preambles  
  - require exact tag formatting  
  - allow only one `<proposed_plan>` block per turn  
  (`codex-rs/core/templates/collaboration_mode/plan.md`)

### Protocol / App-server protocol
- Added `TurnItem::Plan` and `PlanDeltaEvent` to core protocol items.
(`codex-rs/protocol/src/items.rs`, `codex-rs/protocol/src/protocol.rs`)
- Added v2 `ThreadItem::Plan` and `PlanDeltaNotification` with
EXPERIMENTAL markers and note that deltas may not match the final plan
item. (`codex-rs/app-server-protocol/src/protocol/v2.rs`)
- Added plan delta route in app-server protocol common mapping.
(`codex-rs/app-server-protocol/src/protocol/common.rs`)
- Rebuild plan items from persisted `ItemCompleted` events on resume.
(`codex-rs/app-server-protocol/src/protocol/thread_history.rs`)

### App-server
- Forward plan deltas to v2 clients and map core plan items to v2 plan
items. (`codex-rs/app-server/src/bespoke_event_handling.rs`,
`codex-rs/app-server/src/codex_message_processor.rs`)
- Added v2 plan item tests.
(`codex-rs/app-server/tests/suite/v2/plan_item.rs`)

### TUI
- Added a dedicated proposed plan history cell with special background
and padding, and moved “• Proposed Plan” outside the highlighted block.
(`codex-rs/tui/src/history_cell.rs`, `codex-rs/tui/src/style.rs`)
- Only show “Implement this plan?” when a plan item exists.
(`codex-rs/tui/src/chatwidget.rs`,
`codex-rs/tui/src/chatwidget/tests.rs`)

<img width="831" height="847" alt="Screenshot 2026-01-29 at 7 06 24 PM"
src="https://github.com/user-attachments/assets/69794c8c-f96b-4d36-92ef-c1f5c3a8f286"
/>

### Docs / Misc
- Updated protocol docs to mention plan deltas.
(`codex-rs/docs/protocol_v1.md`)
- Minor plumbing updates in exec/debug clients to tolerate plan deltas.
(`codex-rs/debug-client/src/reader.rs`, `codex-rs/exec/...`)

## Tests
- Added core integration tests:
  - Plan mode strips plan from agent messages.
  - Missing `</proposed_plan>` closes at end-of-message.  
  (`codex-rs/core/tests/suite/items.rs`)
- Added unit tests for generic tag parser (prefix buffering, non-tag
lines, auto-close). (`codex-rs/core/src/tagged_block_parser.rs`)
- Existing app-server plan item tests in v2.
(`codex-rs/app-server/tests/suite/v2/plan_item.rs`)

## Notes / Behavior
- Plan output no longer appears in standard assistant text in Plan Mode;
it streams via `PlanDelta` and completes as a `TurnItem::Plan`.
- The final plan item content is authoritative and may diverge from
streamed deltas (documented as experimental).
- Reasoning summaries are not filtered; prompt instructs the model not
to include `<proposed_plan>` outside the final plan message.

## Codex Author
`codex fork 019bec2d-b09d-7450-b292-d7bcdddcdbfb`
2026-01-30 18:59:30 +00:00
..
.cargo chore(ci): add cargo audit workflow and policy (#7108) 2025-11-24 12:20:55 -08:00
.config fix flaky test: approval_matrix_covers_all_modes (#7028) 2025-11-20 14:37:42 -08:00
.github/workflows chore(ci): add cargo audit workflow and policy (#7108) 2025-11-24 12:20:55 -08:00
ansi-escape feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
app-server Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
app-server-protocol Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
app-server-test-client Chore: add cmd related info to exec approval request (#9659) 2026-01-21 23:58:53 -08:00
apply-patch feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
arg0 fix: harden arg0 helper PATH handling (#8766) 2026-01-09 12:35:54 -08:00
async-utils feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
backend-client add error messages for the go plan type (#10181) 2026-01-30 01:17:25 +00:00
chatgpt MCP tool call approval (simplified version) (#10200) 2026-01-29 23:40:32 -08:00
cli feat: refactor CodexAuth so invalid state cannot be represented (#10208) 2026-01-30 09:33:23 -08:00
cloud-requirements chore: fix the build breakage that came from a merge race (#10239) 2026-01-30 10:29:54 -08:00
cloud-tasks add codex cloud list (#9324) 2026-01-16 08:56:38 -08:00
cloud-tasks-client add codex cloud list (#9324) 2026-01-16 08:56:38 -08:00
codex-api chore(personality) new schema with fallbacks (#10147) 2026-01-30 00:10:12 -07:00
codex-backend-openapi-models backend-client: add get_config_requirements_file (#10001) 2026-01-27 16:59:53 +00:00
codex-client Log headers in trace mode (#9214) 2026-01-14 18:38:12 +00:00
common feat(tui) /permissions flow (#9561) 2026-01-21 21:38:46 -08:00
core Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
debug-client Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
docs Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
exec Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
exec-server Fetch Requirements from cloud (#10167) 2026-01-30 12:03:29 +00:00
execpolicy feat(core) RequestRule (#9489) 2026-01-28 08:43:17 +00:00
execpolicy-legacy feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
feedback feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
file-search file-search: improve file query perf (#9939) 2026-01-28 10:54:43 -08:00
keyring-store feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
linux-sandbox remove sandbox globals. (#9797) 2026-01-27 11:04:23 -08:00
lmstudio chore(deps): bump tracing from 0.1.43 to 0.1.44 in /codex-rs (#9880) 2026-01-26 15:48:45 -08:00
login feat(app-server): support external auth mode (#10012) 2026-01-29 23:46:04 +00:00
mcp-server Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
mcp-types feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
network-proxy Fetch Requirements from cloud (#10167) 2026-01-30 12:03:29 +00:00
ollama ollama: default to Responses API for built-ins (#8798) 2026-01-13 09:51:41 -08:00
otel feat: backfill timing metric (#10218) 2026-01-30 10:19:41 +01:00
process-hardening feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
protocol Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
responses-api-proxy fix: remove references to corepack (#10138) 2026-01-28 23:31:25 -08:00
rmcp-client Show OAuth error descriptions in callback responses (#9654) 2026-01-27 13:22:54 -08:00
scripts Upgrade to rust 1.93 (#10080) 2026-01-28 17:46:18 +00:00
state nit: fix db with multiple metadata lines (#10237) 2026-01-30 17:32:10 +00:00
stdio-to-uds feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
tui Plan mode: stream proposed plans, emit plan items, and render in TUI (#9786) 2026-01-30 18:59:30 +00:00
utils [bazel] Improve runfiles handling (#10098) 2026-01-29 00:15:44 +00:00
windows-sandbox-rs error code/msg details for failed elevated setup (#9941) 2026-01-27 23:06:10 -08:00
.gitignore [MCP] Prefix MCP tools names with mcp__ (#5309) 2025-10-19 20:41:55 -04:00
BUILD.bazel feat: add support for building with Bazel (#8875) 2026-01-09 11:09:43 -08:00
Cargo.lock chore: fix the build breakage that came from a merge race (#10239) 2026-01-30 10:29:54 -08:00
Cargo.toml Fetch Requirements from cloud (#10167) 2026-01-30 12:03:29 +00:00
clippy.toml fix: switch rate limit reset handling to timestamps (#5304) 2025-10-17 17:39:37 -07:00
code Send text parameter for non-gpt-5 models (#4195) 2025-09-24 22:00:06 +00:00
config.md Fix link to MCP Servers config section (#5301) 2025-10-17 14:58:27 -07:00
default.nix fix: drop stale filedescriptor output hash for nix (#7865) 2025-12-11 10:43:50 -08:00
deny.toml feat: introducing a network sandbox proxy (#8442) 2026-01-23 17:47:09 -08:00
README.md chore: update outdated docs (#8701) 2026-01-03 02:19:52 -08:00
rust-toolchain.toml Upgrade to rust 1.93 (#10080) 2026-01-28 17:46:18 +00:00
rustfmt.toml Update cargo to 2024 edition (#842) 2025-05-07 08:37:48 -07:00

Codex CLI (Rust Implementation)

We provide Codex CLI as a standalone, native executable to ensure a zero-dependency install.

Installing Codex

Today, the easiest way to install Codex is via npm:

npm i -g @openai/codex
codex

You can also install via Homebrew (brew install --cask codex) or download a platform-specific release directly from our GitHub Releases.

Documentation quickstart

What's new in the Rust CLI

The Rust implementation is now the maintained Codex CLI and serves as the default experience. It includes a number of features that the legacy TypeScript CLI never supported.

Config

Codex supports a rich set of configuration options. Note that the Rust CLI uses config.toml instead of config.json. See docs/config.md for details.

Model Context Protocol Support

MCP client

Codex CLI functions as an MCP client that allows the Codex CLI and IDE extension to connect to MCP servers on startup. See the configuration documentation for details.

MCP server (experimental)

Codex can be launched as an MCP server by running codex mcp-server. This allows other MCP clients to use Codex as a tool for another agent.

Use the @modelcontextprotocol/inspector to try it out:

npx @modelcontextprotocol/inspector codex mcp-server

Use codex mcp to add/list/get/remove MCP server launchers defined in config.toml, and codex mcp-server to run the MCP server directly.

Notifications

You can enable notifications by configuring a script that is run whenever the agent finishes a turn. The notify documentation includes a detailed example that explains how to get desktop notifications via terminal-notifier on macOS. When Codex detects that it is running under WSL 2 inside Windows Terminal (WT_SESSION is set), the TUI automatically falls back to native Windows toast notifications so approval prompts and completed turns surface even though Windows Terminal does not implement OSC 9.

codex exec to run Codex programmatically/non-interactively

To run Codex non-interactively, run codex exec PROMPT (you can also pass the prompt via stdin) and Codex will work on your task until it decides that it is done and exits. Output is printed to the terminal directly. You can set the RUST_LOG environment variable to see more about what's going on.

Experimenting with the Codex Sandbox

To test to see what happens when a command is run under the sandbox provided by Codex, we provide the following subcommands in Codex CLI:

# macOS
codex sandbox macos [--full-auto] [--log-denials] [COMMAND]...

# Linux
codex sandbox linux [--full-auto] [COMMAND]...

# Windows
codex sandbox windows [--full-auto] [COMMAND]...

# Legacy aliases
codex debug seatbelt [--full-auto] [--log-denials] [COMMAND]...
codex debug landlock [--full-auto] [COMMAND]...

Selecting a sandbox policy via --sandbox

The Rust CLI exposes a dedicated --sandbox (-s) flag that lets you pick the sandbox policy without having to reach for the generic -c/--config option:

# Run Codex with the default, read-only sandbox
codex --sandbox read-only

# Allow the agent to write within the current workspace while still blocking network access
codex --sandbox workspace-write

# Danger! Disable sandboxing entirely (only do this if you are already running in a container or other isolated env)
codex --sandbox danger-full-access

The same setting can be persisted in ~/.codex/config.toml via the top-level sandbox_mode = "MODE" key, e.g. sandbox_mode = "workspace-write".

Code Organization

This folder is the root of a Cargo workspace. It contains quite a bit of experimental code, but here are the key crates:

  • core/ contains the business logic for Codex. Ultimately, we hope this to be a library crate that is generally useful for building other Rust/native applications that use Codex.
  • exec/ "headless" CLI for use in automation.
  • tui/ CLI that launches a fullscreen TUI built with Ratatui.
  • cli/ CLI multitool that provides the aforementioned CLIs via subcommands.