core-agent-ide

Author	SHA1	Message	Date
jif-oai	284c03ceab	chore: enable sub agents (#11173 )	2026-02-09 11:25:37 +00:00
jif-oai	13de744296	fix: do not show closed agents in `/agent` (#11175 )	2026-02-09 11:25:31 +00:00
jif-oai	753821c90f	chore: enable shell snapshot (#11172 )	2026-02-09 11:23:59 +00:00
jif-oai	6cf61725d0	feat: do not close unified exec processes across turns (#10799 ) With this PR we do not close the unified exec processes (i.e. background terminals) at the end of a turn unless: * The user interrupt the turn * The user decide to clean the processes through `app-server` or `/clean` I made sure that `codex exec` correctly kill all the processes	2026-02-09 10:27:46 +00:00
rakan-oai	4e9e6ca243	tui: avoid no-op status-line redraws (#11155 ) Rate-limit snapshots are polled every 60s, which causes unconditional redraws. This causes spurious "tab changed" indicators in terminal apps.	2026-02-09 00:13:19 -08:00
Michael Bolin	383b45279e	feat: include NetworkConfig through ExecParams (#11105 ) This PR adds the following field to `Config`: ```rust pub network: Option<NetworkProxy>, ``` Though for the moment, it will always be initialized as `None` (this will be addressed in a subsequent PR). This PR does the work to thread `network` through to `execute_exec_env()`, `process_exec_tool_call()`, and `UnifiedExecRuntime.run()` to ensure it is available whenever we span a process.	2026-02-09 03:32:17 +00:00
Michael Bolin	ff74aaae21	chore: reverse the codex-network-proxy -> codex-core dependency (#11121 )	2026-02-08 17:03:24 -08:00
Matthew Zeng	45b7763c3f	[apps] Improve app loading. (#10994 ) There are two concepts of apps that we load in the harness: - Directory apps, which is all the apps that the user can install. - Accessible apps, which is what the user actually installed and can be $ inserted and be used by the model. These are extracted from the tools that are loaded through the gateway MCP. Previously we wait for both sets of apps before returning the full apps list. Which causes many issues because accessible apps won't be available to the UI or the model if directory apps aren't loaded or failed to load. In this PR we are separating them so that accessible apps can be loaded separately and are instantly available to be shown in the UI and to be provided in model context. We also added an app-server event so that clients can subscribe to also get accessible apps without being blocked on the full app list. - [x] Separate accessible apps and directory apps loading. - [x] `app/list` request will also emit `app/list/updated` notifications that app-server clients can subscribe. Which allows clients to get accessible apps list to render in the $ menu without being blocked by directory apps. - [x] Cache both accessible and directory apps with 1 hour TTL to avoid reloading them when creating new threads. - [x] TUI improvements to redraw $ menu and /apps menu when app list is updated.	2026-02-08 15:24:56 -08:00
Michael Bolin	181b721ba5	feat: include [experimental_network] in <environment_context> (#11044 ) If `NetworkConstraints` is set, then include the relevant settings on `<environment_context>`. Example: ```xml <environment_context> <cwd>/repo</cwd> <shell>bash</shell> <network enabled="true"> <allowed>api.example.com</allowed> <allowed>*.openai.com</allowed> <denied>blocked.example.com</denied> </network> </environment_context> ```	2026-02-08 15:16:50 -08:00
Matthew Zeng	9f1009540b	Upgrade rmcp to 0.14 (#10718 ) - [x] Upgrade rmcp to 0.14	2026-02-08 15:07:53 -08:00
Michael Bolin	ef5d26e586	chore: refactor network-proxy so that ConfigReloader is injectable behavior (#11114 ) Currently, `codex-network-proxy` depends on `codex-core`, but this should be the other way around. As a first step, refactor out `ConfigReloader`, which should make it easier to move `codex-rs/network-proxy/src/state.rs` to `codex-core` in a subsequent commit.	2026-02-08 22:28:20 +00:00
zbarsky-openai	44a1355133	[bazel] Upgrade some rulesets in preparation for enabling windows (#11109 )	2026-02-08 13:40:32 -08:00
Tom	409ec76fbc	Gate view_image tool by model input_modalities (#11051 ) - Plumb input modalities from model catalog through the openai model protocol. Default to text and image. - Conditionally add the view_image tool only if input modalities support image.	2026-02-08 10:45:26 -08:00
Eric Traut	b3de6c7f2b	Defer persistence of rollout file (#11028 ) - Defer rollout persistence for fresh threads (`InitialHistory::New`): keep rollout events in memory and only materialize rollout file + state DB row on first `EventMsg::UserMessage`. - Keep precomputed rollout path available before materialization. - Change `thread/start` to build thread response from live config snapshot and optional precomputed path. - Improve pre-materialization behavior in app-server/TUI: clearer invalid-request errors for file-backed ops and a friendlier `/fork` “not ready yet” UX. - Update tests to match deferred semantics across start/read/archive/unarchive/fork/resume/review flows. - Improved resilience of user_shell test, which should be unrelated to this change but must be affected by timing changes For Reviewers: * The primary change is in recorder.rs * Most of the other changes were to fix up broken assumptions in existing tests Testing: * Manually tested CLI * Exercised app server paths by manually running IDE Extension with rebuilt CLI binary * Only user-visible change is that `/fork` in TUI generates visible error if used prior to first turn	2026-02-07 23:05:03 -08:00
pakrym-oai	6d08298f4e	Fallback to HTTP on UPGRADE_REQUIRED (#10824 ) Allow the server to trigger a connection downgrade in case the protocol changes in incompatible ways.	2026-02-08 05:06:33 +00:00
Chriss4123	d68e9c0f19	fix(tui): rehydrate drafts and restore image placeholders (#9040 ) Fixes #9050 When a draft is stashed with Ctrl+C, we now persist the full draft state (text elements, local image paths, and pending paste payloads) in local history. Up/Down recall rehydrates placeholder elements and attachments so styling remains correct and large pastes still expand on submit. Persistent (cross‑session) history remains text‑only. Backtrack prefills now reuse the selected user message’s text elements and local image paths, so image placeholders/attachments rehydrate when rolling back. External editor replacements keep only attachments whose placeholders remain and then normalize image placeholders to `[Image #1]..[Image #N]` to keep the attachment mapping consistent. Docs: - docs/tui-chat-composer.md Testing: - just fix -p codex-tui - cargo test -p codex-tui Co-authored-by: Eric Traut <etraut@openai.com>	2026-02-07 20:08:45 -08:00
Anton Panasenko	a94505a92a	feat: enable premessage-deflate for websockets (#10966 ) note: unfortunately, tokio-tungstenite / tungstenite upgrade triggers some problems with linker of rama-tls-boring with openssl: ``` error: linking with `/Users/apanasenko/Library/Caches/cargo-zigbuild/0.20.1/zigcc-x86_64-unknown-linux-musl-ff6a.sh` failed: exit status: 1 \| = note: "/Users/apanasenko/Library/Caches/cargo-zigbuild/0.20.1/zigcc-x86_64-unknown-linux-musl-ff6a.sh" "-m64" "<sysroot>/lib/rustlib/x86_64-unknown-linux-musl/lib/self-contained/rcrt1.o" "<sysroot>/lib/rustlib/x86_64-unknown-linux-musl/lib/self-contained/crti.o" "<sysroot>/lib/rustlib/x86_64-unknown-linux-musl/lib/self-contained/crtbeginS.o" "<1 object files omitted>" "-Wl,--as-needed" "-Wl,-Bstatic" "/var/folders/kt/52y_g75x3ng8ktvk3rfwm6400000gp/T/rustcyGQdYm/{liblzma_sys-662a82316f96ec30,libbzip2_sys-bf78a2d58d5cbce6,liblibsqlite3_sys-6c004987fd67a36a,libtree_sitter_bash-220b99a97d331ab7,libtree_sitter-858f0a1dbfea58bd,libzstd_sys-6eb237deec748c5b,libring-2a87376483bf916f,libopenssl_sys-7c189e68b37fe2bb,liblibz_sys-4344eef4345520b1,librama_boring_sys-0414e98115015ee0}.rlib" "-lc++" "-lc++abi" "-lunwind" "-lc" "<sysroot>/lib/rustlib/x86_64-unknown-linux-musl/lib/libcompiler_builtins-*.rlib" "-L" "/var/folders/kt/52y_g75x3ng8ktvk3rfwm6400000gp/T/rustcyGQdYm/raw-dylibs" "-Wl,-Bdynamic" "-Wl,--eh-frame-hdr" "-Wl,-z,noexecstack" "-nostartfiles" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/libz-sys-ff5ea50d88c28ffb/out/lib" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/ring-bdec3dddc19f5a5e/out" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/openssl-sys-96e0870de3ca22bc/out/openssl-build/install/lib" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/zstd-sys-0cc37a5da1481740/out" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/tree-sitter-72d2418073317c0f/out" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/tree-sitter-bash-bfd293a9f333ce6a/out" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/libsqlite3-sys-b78b2cfb81a330fc/out" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/bzip2-sys-69a145cc859ef275/out/lib" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/lzma-sys-07e92d0b6baa6fd4/out" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/rama-boring-sys-0bc2dfbf669addc4/out/build/crypto/" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/rama-boring-sys-0bc2dfbf669addc4/out/build/ssl/" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/rama-boring-sys-0bc2dfbf669addc4/out/build/" "-L" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/rama-boring-sys-0bc2dfbf669addc4/out/build" "-L" "<sysroot>/lib/rustlib/x86_64-unknown-linux-musl/lib/self-contained" "-L" "<sysroot>/lib/rustlib/x86_64-unknown-linux-musl/lib" "-o" "/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/deps/codex_network_proxy-d08268b863517761" "-Wl,--gc-sections" "-static-pie" "-Wl,-z,relro,-z,now" "-Wl,-O1" "-Wl,--strip-all" "-nodefaultlibs" "<sysroot>/lib/rustlib/x86_64-unknown-linux-musl/lib/self-contained/crtendS.o" "<sysroot>/lib/rustlib/x86_64-unknown-linux-musl/lib/self-contained/crtn.o" = note: some arguments are omitted. use `--verbose` to show all linker arguments = note: warning: ignoring deprecated linker optimization setting '1' warning: unable to open library directory '/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/rama-boring-sys-0bc2dfbf669addc4/out/build/crypto/': FileNotFound ld.lld: error: duplicate symbol: SSL_export_keying_material >>> defined at ssl_lib.c:3816 (ssl/ssl_lib.c:3816) >>> libssl-lib-ssl_lib.o:(SSL_export_keying_material) in archive /var/folders/kt/52y_g75x3ng8ktvk3rfwm6400000gp/T/rustcyGQdYm/libopenssl_sys-7c189e68b37fe2bb.rlib >>> defined at t1_enc.cc:205 (/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/rama-boring-sys-0bc2dfbf669addc4/out/boringssl/ssl/t1_enc.cc:205) >>> t1_enc.cc.o:(.text.SSL_export_keying_material+0x0) in archive /var/folders/kt/52y_g75x3ng8ktvk3rfwm6400000gp/T/rustcyGQdYm/librama_boring_sys-0414e98115015ee0.rlib ld.lld: error: duplicate symbol: d2i_ASN1_TIME >>> defined at a_time.c:27 (crypto/asn1/a_time.c:27) >>> libcrypto-lib-a_time.o:(d2i_ASN1_TIME) in archive /var/folders/kt/52y_g75x3ng8ktvk3rfwm6400000gp/T/rustcyGQdYm/libopenssl_sys-7c189e68b37fe2bb.rlib >>> defined at a_time.cc:34 (/Users/apanasenko/code/codex/codex-rs/target/x86_64-unknown-linux-musl/release/build/rama-boring-sys-0bc2dfbf669addc4/out/boringssl/crypto/asn1/a_time.cc:34) >>> a_time.cc.o:(.text.d2i_ASN1_TIME+0x0) in archive /var/folders/kt/52y_g75x3ng8ktvk3rfwm6400000gp/T/rustcyGQdYm/librama_boring_sys-0414e98115015ee0.rlib ``` that force me to migrate away from rama-tls-boring to rama-tls-rustls and pin `ring` for rustls.	2026-02-07 17:59:34 -08:00
pakrym-oai	8fe5066bcc	Simplify pre-connect (#11040 )	2026-02-07 15:52:03 -08:00
Michael Bolin	2e89cb9117	feat: include state of [experimental_network] in /debug-config output (#11039 ) #10958 introduced experimental support for a network config in `/etc/codex/requirements.toml`, so this extends `/debug-config` to surface this information, if set, which should make it easier to debug.	2026-02-07 21:38:12 +00:00
Charley Cunningham	e6662d6387	app-server: treat null mode developer instructions as built-in defaults (#10983 ) ## Summary - make `turn/start` normalize `collaborationMode.settings.developer_instructions: null` to the built-in instructions for the selected mode - prevent app-server clients from accidentally clearing mode-switch developer instructions by sending `null` - document this behavior in the v2 protocol and app-server docs ## What changed - `codex-rs/app-server/src/codex_message_processor.rs` - added a small `normalize_turn_start_collaboration_mode` helper - in `turn_start`, apply normalization before `OverrideTurnContext` - `codex-rs/app-server/tests/suite/v2/turn_start.rs` - extended `turn_start_accepts_collaboration_mode_override_v2` to assert the outgoing request includes default-mode instruction text when the client sends `developer_instructions: null` - `codex-rs/app-server-protocol/src/protocol/v2.rs` - clarified `TurnStartParams.collaboration_mode` docs: `settings.developer_instructions: null` means use built-in mode instructions - regenerated schema fixture: - `codex-rs/app-server-protocol/schema/typescript/v2/TurnStartParams.ts` - docs: - `codex-rs/app-server/README.md` - `codex-rs/docs/codex_mcp_interface.md`	2026-02-07 12:59:41 -08:00
viyatb-oai	739908a12c	feat(core): add network constraints schema to requirements.toml (#10958 ) ## Summary Add `requirements.toml` schema support for admin-defined network constraints in the requirements layer example config: ``` [experimental_network] enabled = true allowed_domains = ["api.openai.com"] denied_domains = ["example.com"] ```	2026-02-07 19:48:24 +00:00
Eric Traut	16e7cf05d2	Fixed a flaky Windows test that is consistently causing a CI failure (#10987 ) Loop wait_for_complete/wait_for_updates_at_least until deadline to prevent Windows CI false timeouts in query-change session tests.	2026-02-07 09:08:13 -08:00
Eric Traut	10336068db	Fix flaky windows CI test (#10993 ) Hardens PTY Python REPL test and make MCP test startup deterministic Summary - `utils/pty/src/tests.rs` - Added a REPL readiness handshake (`wait_for_python_repl_ready`) that repeatedly sends a marker and waits for it in PTY output before sending test commands. - Updated `pty_python_repl_emits_output_and_exits` to: - wait for readiness first, - preserve startup output, - append output collected through process exit. - Reduces Windows/ConPTY flakiness from early stdin writes racing REPL startup. - `mcp-server/tests/suite/codex_tool.rs` - Avoid remote model refresh during MCP test startup, reducing timeout-prone nondeterminism.	2026-02-07 08:55:42 -08:00
jif-oai	83c74125bc	Bootstrap shell commands via user shell snapshot (#10909 ) Summary - wrap `shell -lc` executions that use a snapshot with the session shell so the saved environment is sourced before delegating to the original shell - escape single quotes in the generated wrapper and add tests covering Bash/Zsh/sh session bootstrapping Testing - Not run (not requested)	2026-02-07 17:36:44 +01:00
jif-oai	62605fa471	Add resume_agent collab tool (#10903 ) Summary - add the new resume_agent collab tool path through core, protocol, and the app server API, including the resume events - update the schema/TypeScript definitions plus docs so resume_agent appears in generated artifacts and README - note that resumed agents rehydrate rollout history without overwriting their base instructions Testing - Not run (not requested)	2026-02-07 17:31:45 +01:00
Michael Bolin	4cd0c42a28	fix: normalize line endings when reading file on Windows (#10988 ) I did not wait for CI on https://github.com/openai/codex/pull/10980 because it was blocking an alpha release, but apparently it broken the Windows build.	2026-02-06 23:49:19 -08:00
Charley Cunningham	f3f35526a8	Show left/right arrows to navigate in tui request_user_input (#10921 ) <img width="785" height="185" alt="Screenshot 2026-02-06 at 10 25 13 AM" src="https://github.com/user-attachments/assets/402a6e79-4626-4df9-b3da-bc2f28e64611" /> <img width="784" height="213" alt="Screenshot 2026-02-06 at 10 26 37 AM" src="https://github.com/user-attachments/assets/cf9614b2-aa1e-4c61-8579-1d2c7e1c7dc1" /> "left/right to navigate questions" in request_user_input footer	2026-02-06 23:41:08 -08:00
Eric Traut	3779b52e2d	Do not poll for usage when using API Key auth (#10973 ) Fixes #10869 - Gate TUI rate-limit polling on ChatGPT-auth providers only. - `prefetch_rate_limits()` now checks `should_prefetch_rate_limits()`. - New gate requires: - `config.model_provider.requires_openai_auth` - cached auth is ChatGPT (`CodexAuth::is_chatgpt_auth`) - Prevents `/wham/usage` polling in API/custom-endpoint profiles.	2026-02-06 23:26:44 -08:00
Michael Bolin	18bb25557c	fix: use expected line ending in codex-rs/core/config.schema.json (#10977 ) Fixes a line ending that was altered in https://github.com/openai/codex/pull/10861. This is breaking the release due to: `a118494323/.github/workflows/rust-release.yml (L54-L55)` This PR updates the test to check for this so we should catch it in CI (or when running tests locally): `a118494323/codex-rs/core/src/config/schema.rs (L105-L131)`	2026-02-06 22:30:57 -08:00
Michael Bolin	a118494323	feat: add support for allowed_web_search_modes in requirements.toml (#10964 ) This PR makes it possible to disable live web search via an enterprise config even if the user is running in `--yolo` mode (though cached web search will still be available). To do this, create `/etc/codex/requirements.toml` as follows: ```toml # "live" is not allowed; "disabled" is allowed even though not listed explicitly. allowed_web_search_modes = ["cached"] ``` Or set `requirements_toml_base64` MDM as explained on https://developers.openai.com/codex/security/#locations. ### Why - Enforce admin/MDM/`requirements.toml` constraints on web-search behavior, independent of user config and per-turn sandbox defaults. - Ensure per-turn config resolution and review-mode overrides never crash when constraints are present. ### What - Add `allowed_web_search_modes` to requirements parsing and surface it in app-server v2 `ConfigRequirements` (`allowedWebSearchModes`), with fixtures updated. - Define a requirements allowlist type (`WebSearchModeRequirement`) and normalize semantics: - `disabled` is always implicitly allowed (even if not listed). - An empty list is treated as `["disabled"]`. - Make `Config.web_search_mode` a `Constrained<WebSearchMode>` and apply requirements via `ConstrainedWithSource<WebSearchMode>`. - Update per-turn resolution (`resolve_web_search_mode_for_turn`) to: - Prefer `Live → Cached → Disabled` when `SandboxPolicy::DangerFullAccess` is active (subject to requirements), unless the user preference is explicitly `Disabled`. - Otherwise, honor the user’s preferred mode, falling back to an allowed mode when necessary. - Update TUI `/debug-config` and app-server mapping to display normalized `allowed_web_search_modes` (including implicit `disabled`). - Fix web-search integration tests to assert cached behavior under `SandboxPolicy::ReadOnly` (since `DangerFullAccess` legitimately prefers `live` when allowed).	2026-02-07 05:55:15 +00:00
Eric Traut	82c981cafc	Process-group cleanup for stdio MCP servers to prevent orphan process storms (#10710 ) This PR changes stdio MCP child processes to run in their own process group * Add guarded teardown in codex-rmcp-client: send SIGTERM to the group first, then SIGKILL after a short grace period. * Add terminate_process_group helper in process_group.rs. * Add Unix regression test in process_group_cleanup.rs to verify wrapper + grandchild are reaped on client drop. Addresses reported MCP process/thread storm: #10581	2026-02-06 21:26:36 -08:00
Eric Traut	4d52428fa2	Fixed a flaky test (#10970 ) ## Summary Stabilize v2 review integration tests by making them hermetic with respect to model discovery. `app-server` review tests were intermittently timing out in CI (especially on Windows runners) because their test config allowed remote model refresh. During `thread/start`, the test process could issue live `/v1/models` requests, introducing external network latency and nondeterministic timing before review flow assertions. This change disables remote model fetching in the review test config helper used by these tests.	2026-02-06 21:26:26 -08:00
viyatb-oai	8cd46ebad6	refactor(network-proxy): flatten network config under [network] (#10965 ) Summary: - Rename config table from network_proxy to network. - Flatten allowed_domains, denied_domains, allow_unix_sockets, and allow_local_binding onto NetworkProxySettings. - Update runtime, state constraints, tests, and README to the new config shape.	2026-02-07 05:22:44 +00:00
sayan-oai	5d2702f6b8	fix(tui): conditionally restore status indicator using message phase (#10947 ) TLDR: use new message phase field emitted by preamble-supported models to determine whether an AgentMessage is mid-turn commentary. if so, restore the status indicator afterwards to indicate the turn has not completed. ### Problem `commit_tick` hides the status indicator while streaming assistant text. For preamble-capable models, that text can be commentary mid-turn, so hiding was correct during streaming but restore timing mattered: - restoring too aggressively caused jitter/flashing - not restoring caused indicator to stay hidden before subsequent work (tool calls, web search, etc.) ### Fix - Add optional `phase` to `AgentMessageItem` and propagate it from `ResponseItem::Message` - Keep indicator hidden during streamed commit ticks, restore only when: - assistant item completes as `phase=commentary`, and - stream queues are idle + task is still running. - Treat `phase=None` as final-answer behavior (no restore) to keep existing behavior for non-preamble models ### Tests Add/update tests for: - no idle-tick restore without commentary completion - commentary completion restoring status before tool begin - snapshot coverage for preamble/status behavior --------- Co-authored-by: Josh McKinney <joshka@openai.com>	2026-02-07 02:39:52 +00:00
canvrno-oai	1446bd2b23	Mark Config.apps as experimental, correct schema generation issue (#10938 ) This PR makes `Config.apps `experimental-only and fixes a TS schema post-processing bug that removed needed imports. The bug happened because import pruning only checked the inner type body after filtering, not the full alias, so `JsonValue` got dropped from `Config.ts`. We now prune against the full alias body and added a regression test for this scenario.	2026-02-06 16:30:41 -08:00
Javi	87ce50f118	app-server: print help message to console when starting websockets server (#10943 ) Follow-up to https://github.com/openai/codex/pull/10693 <img width="596" height="77" alt="image" src="https://github.com/user-attachments/assets/9140df70-01d1-4c5a-85ee-ca15a09a0e77" />	2026-02-07 00:18:42 +00:00
daniel-oai	84bce2b8e6	TUI/Core: preserve duplicate skill/app mention selection across submit + resume (#10855 ) ## What changed - In `codex-rs/core/src/skills/injection.rs`, we now honor explicit `UserInput::Skill { name, path }` first, then fall back to text mentions only when safe. - In `codex-rs/tui/src/bottom_pane/chat_composer.rs`, mention selection is now token-bound (selected mention is tied to the specific inserted `$token`), and we snapshot bindings at submit time so selection is not lost. - In `codex-rs/tui/src/chatwidget.rs` and `codex-rs/tui/src/bottom_pane/mod.rs`, submit/queue paths now consume the submit-time mention snapshot (instead of rereading cleared composer state). - In `codex-rs/tui/src/mention_codec.rs` and `codex-rs/tui/src/bottom_pane/chat_composer_history.rs`, history now round-trips mention targets so resume restores the same selected duplicate. - In `codex-rs/tui/src/bottom_pane/skill_popup.rs` and `codex-rs/tui/src/bottom_pane/chat_composer.rs`, duplicate labels are normalized to `[Repo]` / `[App]`, app rows no longer show `Connected -`, and description space is a bit wider. <img width="550" height="163" alt="Screenshot 2026-02-05 at 9 56 56 PM" src="https://github.com/user-attachments/assets/346a7eb2-a342-4a49-aec8-68dfec0c7d89" /> <img width="550" height="163" alt="Screenshot 2026-02-05 at 9 57 09 PM" src="https://github.com/user-attachments/assets/5e04d9af-cccf-4932-98b3-c37183e445ed" /> ## Before vs now - Before: selecting a duplicate could still submit the default/repo match, and resume could lose which duplicate was originally selected. - Now: the exact selected target (skill path or app id) is preserved through submit, queue/restore, and resume. ## Manual test 1. Build and run this branch locally: - `cd /Users/daniels/code/codex/codex-rs` - `cargo build -p codex-cli --bin codex` - `./target/debug/codex` 2. Open mention picker with `$` and pick a duplicate entry (not the first one). 3. Confirm duplicate UI: - repo duplicate rows show `[Repo]` - app duplicate rows show `[App]` - app description does not start with `Connected -` 4. Submit the prompt, then press Up to restore draft and submit again. Expected: it keeps the same selected duplicate target. 5. Use `/resume` to reopen the session and send again. Expected: restored mention still resolves to the same duplicate target.	2026-02-06 15:59:00 -08:00
alexsong-oai	daeef06bec	add originator to otel (#10826 )	2026-02-06 15:13:56 -08:00
Brian Yu	1fbf5ed06f	Support alternative websocket API (#10861 ) Test plan ``` cargo build -p codex-cli && RUST_LOG='codex_api::endpoint::responses_websocket=trace,codex_core::client=debug,codex_core::codex=debug' \ ./target/debug/codex \ --enable responses_websockets_v2 \ --profile byok \ --full-auto ```	2026-02-06 14:40:50 -08:00
Ahmed Ibrahim	ba8b5d9018	Treat compaction failure as failure state (#10927 ) - Return compaction errors from local and remote compaction flows.\n- Stop turns/tasks when auto-compaction fails instead of continuing execution.	2026-02-06 13:51:46 -08:00
Owen Lin	1751116ec6	chore(app-server): add experimental annotation to relevant fields (#10928 ) These fields had always been documented as experimental/unstable with docstrings, but now let's actually use the `experimental` annotation to be more explicit. - thread/start.experimentalRawEvents - thread/resume.history - thread/resume.path - thread/fork.path - turn/start.collaborationMode - account/login/start.chatgptAuthTokens	2026-02-06 20:48:04 +00:00
Charley Cunningham	143daadb31	core: refresh developer instructions after compaction replacement history (#10574 ) ## Summary When replaying compacted history (especially `replacement_history` from remote compaction), we should not keep stale developer messages from older session state. This PR trims developer- role messages from compacted replacement history and reinjects fresh developer instructions derived from current turn/session state. This aligns compaction replay behavior with the intended "fresh instructions after summary" model. ## Problem Compaction replay had two paths: - `Compacted { replacement_history: None }`: rebuilt with fresh initial context - `Compacted { replacement_history: Some(...) }`: previously used raw replacement history as-is The second path could carry stale developer instructions (permissions/personality/collab-mode guidance) across session changes. ## What Changed ### 1) Added helper to refresh compacted developer instructions - File: `codex-rs/core/src/compact.rs` - Function: `refresh_compacted_developer_instructions(...)` Behavior: - remove all `ResponseItem::Message { role: "developer", .. }` from compacted history - append fresh developer messages from current `build_initial_context(...)` ### 2) Applied helper in remote compaction flow - File: `codex-rs/core/src/compact_remote.rs` - After receiving compact endpoint output, refresh developer instructions before replacing history and persisting `replacement_history`. ### 3) Applied helper while reconstructing history from rollout - File: `codex-rs/core/src/codex.rs` - In `reconstruct_history_from_rollout(...)`, when processing `Compacted` entries with `replacement_history`, refresh developer instructions instead of directly replacing with raw history. ## Non-Goals / Follow-up This PR does not address the existing first-turn-after-resume double-injection behavior. A follow-up PR will handle resume-time dedup/idempotence separately. If you want, I can also give you a shorter “squash-merge friendly” version of the description. ## Codex author `codex fork 019c25e6-706e-75d1-9198-688ec00a8256`	2026-02-06 12:25:08 -08:00
Josh McKinney	e416e578bb	core: preconnect Responses websocket for first turn (#10698 ) ## Problem The first user turn can pay websocket handshake latency even when a session has already started. We want to reduce that initial delay while preserving turn semantics and avoiding any prompt send during startup. Reviewer feedback also called out duplicated connect/setup paths and unnecessary preconnect state complexity. ## Mental model `ModelClient` owns session-scoped transport state. During session startup, it can opportunistically warm one websocket handshake slot. A turn-scoped `ModelClientSession` adopts that slot once if available, restores captured sticky turn-state, and otherwise opens a websocket through the same shared connect path. If startup preconnect is still in flight, first turn setup awaits that task and treats it as the first connection attempt for the turn. Preconnect is handshake-only. The first `response.create` is still sent only when a turn starts. ## Non-goals This change does not make preconnect required for correctness and does not change prompt/turn payload semantics. It also does not expand fallback behavior beyond clearing preconnect state when fallback activates. ## Tradeoffs The implementation prioritizes simpler ownership and shared connection code over header-match gating for reuse. The single-slot cache keeps lifecycle straightforward but only benefits the immediate next turn. Awaiting in-flight preconnect has the same app-level connect-timeout semantics as existing websocket connect behavior (no new timeout class introduced by this PR). ## Architecture `core/src/client.rs`: - Added session-level preconnect lifecycle state (`Idle` / `InFlight` / `Ready`) carrying one warmed websocket plus optional captured turn-state. - Added `pre_establish_connection()` startup warmup and `preconnect()` handshake-only setup. - Deduped auth/provider resolution into `current_client_setup()` and websocket handshake wiring into `connect_websocket()` / `build_websocket_headers()`. - Updated turn websocket path to adopt preconnect first, await in-flight preconnect when present, then create a new websocket only when needed. - Ensured fallback activation clears warmed preconnect state. - Added documentation for lifecycle, ownership, sticky-routing invariants, and timeout semantics. `core/src/codex.rs`: - Session startup invokes `model_client.pre_establish_connection(...)`. - Turn metadata resolution uses the shared timeout helper. `core/src/turn_metadata.rs`: - Centralized shared timeout helper used by both turn-time metadata resolution and startup preconnect metadata building. `core/tests/common/responses.rs` + websocket test suites: - Added deterministic handshake waiting helper (`wait_for_handshakes`) with bounded polling. - Added startup preconnect and in-flight preconnect reuse coverage. - Fallback expectations now assert exactly two websocket attempts in covered scenarios (startup preconnect + turn attempt before fallback sticks). ## Observability Preconnect remains best-effort and non-fatal. Existing websocket/fallback telemetry remains in place, and debug logs now make preconnect-await behavior and preconnect failures easier to reason about. ## Tests Validated with: 1. `just fmt` 2. `cargo test -p codex-core websocket_preconnect -- --nocapture` 3. `cargo test -p codex-core websocket_fallback -- --nocapture` 4. `cargo test -p codex-core websocket_first_turn_waits_for_inflight_preconnect -- --nocapture`	2026-02-06 19:08:24 +00:00
viyatb-oai	8896ca0ee6	fix(linux-sandbox): block io_uring syscalls in no-network seccomp policy (#10814 ) ## Summary - Add seccomp deny rules for `io_uring` syscalls in the Linux sandbox network policy. - Specifically deny: - `SYS_io_uring_setup` - `SYS_io_uring_enter` - `SYS_io_uring_register`	2026-02-06 11:00:54 -08:00
viyatb-oai	db0d8710d5	feat(network-proxy): add structured policy decision to blocked errors (#10420 ) ## Summary Add explicit, model-visible network policy decision metadata to blocked proxy responses/errors. Introduces a standardized prefix line: `CODEX_NETWORK_POLICY_DECISION {json}` and wires it through blocked paths for: - HTTP requests - HTTPS CONNECT - SOCKS5 TCP/UDP denials ## Why The model should see why a request was blocked (reason/source/protocol/host/port) so it can choose the correct next action. ## Notes - This PR is intentionally independent of config-layering/network-rule runtime integration. - Focus is blocked decision surface only.	2026-02-06 10:46:50 -08:00
canvrno-oai	36c16e0c58	Add app configs to config.toml (#10822 ) Adds app configs to config.toml + tests	2026-02-06 10:29:08 -08:00
Charley Cunningham	b7ecd166a6	Queue nudges while plan generating (#10457 ) ## Summary This PR fixes a UI/streaming race when nudged or steer-enabled messages are queued during an active Plan stream. Previously, `submit_user_message_with_mode` switched collaboration mode immediately (via `set_collaboration_mask`) even when the message was queued. If that happened mid-Plan stream, `active_mode_kind` could flip away from Plan before the turn finished, causing subsequent `on_plan_delta` updates to be ignored in the UI. Now, mode switching is deferred until the queued message is actually submitted. ## What changed - Added a per-message deferred mode override on `UserMessage`: - `collaboration_mode_override: Option<CollaborationModeMask>` - Updated `submit_user_message_with_mode` to: - create a `UserMessage` carrying the mode override - queue or submit that message without mutating global mode immediately - Updated `submit_user_message` to: - apply `collaboration_mode_override` just before constructing/sending `Op::UserTurn` - Kept queueing condition scoped to active Plan stream rendering: - queue only while plan output is actively streaming in TUI (`plan_stream_controller.is_some()`) ## Why This preserves Plan mode for the remainder of the in-flight Plan turn, so streamed plan deltas continue rendering correctly, while still ensuring the follow-up queued message is sent with the intended collaboration mode. ## Behavior after this change - If a nudged/steer submission happens while Plan output is actively streaming: - message is queued - UI stays in Plan mode for the running turn - once dequeued/submitted, mode override is applied and the message is sent in the intended mode - If no Plan stream is active: - submission proceeds immediately and mode override is applied as before ## Tests Added/updated coverage in `tui/src/chatwidget/tests.rs`: - `submit_user_message_with_mode_queues_while_plan_stream_is_active` - asserts mode remains Plan while queued - asserts mode switches to Code when queued message is actually submitted - `submit_user_message_with_mode_submits_when_plan_stream_is_not_active` - `steer_enter_queues_while_plan_stream_is_active` - `steer_enter_submits_when_plan_stream_is_not_active` Also updated existing `UserMessage { ... }` test fixtures to include the new field. ## Codex author `codex fork 019c1047-d5d5-7c92-a357-6009604dc7e8`	2026-02-06 09:43:00 -08:00
Eric Traut	4521a6e852	Removed "exec_policy" feature flag (#10851 ) This is no longer needed because it's on by default	2026-02-06 08:59:47 -08:00
jif-oai	aab61934af	Handle required MCP startup failures across components (#10902 ) Summary - add a `required` flag for MCP servers everywhere config/CLI data is touched so mandatory helpers can be round-tripped - have `codex exec` and `codex app-server` thread start/resume fail fast when required MCPs fail to initialize	2026-02-06 17:14:37 +01:00
jif-oai	3800173459	feat: backfill async again (#10894 )	2026-02-06 15:41:52 +01:00

1 2 3 4 5 ...

2843 commits