core-agent-ide

Author	SHA1	Message	Date
pakrym-oai	30ca89424c	Always fallback to real shell (#6953 ) Either cmd.exe or `/bin/sh`.	2025-11-20 10:58:46 -08:00
Eric Traut	d909048a85	Added feature switch to disable animations in TUI (#6870 ) This PR adds support for a new feature flag `tui.animations`. By default, the TUI uses animations in its welcome screen, "working" spinners, and "shimmer" effects. This animations can interfere with screen readers, so it's good to provide a way to disable them. This change is inspired by [a PR](https://github.com/openai/codex/pull/4014) contributed by @Orinks. That PR has faltered a bit, but I think the core idea is sound. This version incorporates feedback from @aibrahim-oai. In particular: 1. It uses a feature flag (`tui.animations`) rather than the unqualified CLI key `no-animations`. Feature flags are the preferred way to expose boolean switches. They are also exposed via CLI command switches. 2. It includes more complete documentation. 3. It disables a few animations that the other PR omitted.	2025-11-20 10:40:08 -08:00
jif-oai	888c6dd9e7	fix: command formatting for user commands (#7002 )	2025-11-20 17:29:15 +01:00
hanson-openai	b5dd189067	Allow unified_exec to early exit (if the process terminates before yield_time_ms) (#6867 ) Thread through an `exit_notify` tokio `Notify` through to the `UnifiedExecSession` so that we can return early if the command terminates before `yield_time_ms`. As Codex review correctly pointed out below 🙌 we also need a `exit_signaled` flag so that commands which finish before we start waiting can also exit early. Since the default `yield_time_ms` is now 10s, this means that we don't have to wait 10s for trivial commands like ls, sed, etc (which are the majority of agent commands 😅) --------- Co-authored-by: jif-oai <jif@openai.com>	2025-11-20 13:34:41 +01:00
Michael Bolin	54e6e4ac32	fix: when displaying execv, show file instead of arg0 (#6966 ) After merging https://github.com/openai/codex/pull/6958, I realized that the `command` I was displaying was not quite right. Since we know it, we should show the _exact_ program being executed (the first arg to `execve(3)`) rather than `arg0` to be more precise. Below is the same command I used to test https://github.com/openai/codex/pull/6958, but now you can see it shows `/Users/mbolin/.openai/bin/git` instead of just `git`. <img width="1526" height="1444" alt="image" src="https://github.com/user-attachments/assets/428128d1-c658-456e-a64e-fc6a0009cb34" />	2025-11-19 22:42:58 -08:00
Michael Bolin	e8af41de8a	fix: clean up elicitation used by exec-server (#6958 ) Using appropriate message/title fields, I think this looks better now: <img width="3370" height="3208" alt="image" src="https://github.com/user-attachments/assets/e9bbf906-4ba8-4563-affc-62cdc6c97342" /> Though note that in the current version of the Inspector (`0.17.2`), you cannot hit Submit until you fill out the field. I believe this is a bug in the Inspector, as it does not properly handle the case when all fields are optional. I put up a fix: https://github.com/modelcontextprotocol/inspector/pull/926	2025-11-20 04:59:17 +00:00
Owen Lin	d6c30ed25e	[app-server] feat: v2 apply_patch approval flow (#6760 ) This PR adds the API V2 version of the apply_patch approval flow, which centers around `ThreadItem::FileChange`. This PR wires the new RPC (`item/fileChange/requestApproval`, V2 only) and related events (`item/started`, `item/completed` for `ThreadItem::FileChange`, which are emitted in both V1 and V2) through the app-server protocol. The new approval RPC is only sent when the user initiates a turn with the new `turn/start` API so we don't break backwards compatibility with VSCE. Similar to https://github.com/openai/codex/pull/6758, the approach I took was to make as few changes to the Codex core as possible, leveraging existing `EventMsg` core events, and translating those in app-server. I did have to add a few additional fields to `EventMsg::PatchApplyBegin` and `EventMsg::PatchApplyEnd`, but those were fairly lightweight. However, the `EventMsg`s emitted by core are the following: ``` 1) Auto-approved (no request for approval)  - EventMsg::PatchApplyBegin - EventMsg::PatchApplyEnd 2) Approved by user - EventMsg::ApplyPatchApprovalRequest - EventMsg::PatchApplyBegin - EventMsg::PatchApplyEnd 3) Declined by user - EventMsg::ApplyPatchApprovalRequest - EventMsg::PatchApplyBegin - EventMsg::PatchApplyEnd ``` For a request triggering an approval, this would result in: ``` item/fileChange/requestApproval item/started item/completed ``` which is different from the `ThreadItem::CommandExecution` flow introduced in https://github.com/openai/codex/pull/6758, which does the below and is preferable: ``` item/started item/commandExecution/requestApproval item/completed ``` To fix this, we leverage `TurnSummaryStore` on codex_message_processor to store a little bit of state, allowing us to fire `item/started` and `item/fileChange/requestApproval` whenever we receive the underlying `EventMsg::ApplyPatchApprovalRequest`, and no-oping when we receive the `EventMsg::PatchApplyBegin` later. This is much less invasive than modifying the order of EventMsg within core (I tried). The resulting payloads: ``` { "method": "item/started", "params": { "item": { "changes": [ { "diff": "Hello from Codex!\n", "kind": "add", "path": "/Users/owen/repos/codex/codex-rs/APPROVAL_DEMO.txt" } ], "id": "call_Nxnwj7B3YXigfV6Mwh03d686", "status": "inProgress", "type": "fileChange" } } } ``` ``` { "id": 0, "method": "item/fileChange/requestApproval", "params": { "grantRoot": null, "itemId": "call_Nxnwj7B3YXigfV6Mwh03d686", "reason": null, "threadId": "019a9e11-8295-7883-a283-779e06502c6f", "turnId": "1" } } ``` ``` { "id": 0, "result": { "decision": "accept" } } ``` ``` { "method": "item/completed", "params": { "item": { "changes": [ { "diff": "Hello from Codex!\n", "kind": "add", "path": "/Users/owen/repos/codex/codex-rs/APPROVAL_DEMO.txt" } ], "id": "call_Nxnwj7B3YXigfV6Mwh03d686", "status": "completed", "type": "fileChange" } } } ```	2025-11-19 20:13:31 -08:00
zhao-oai	fb9849e1e3	migrating execpolicy -> execpolicy-legacy and execpolicy2 -> execpolicy (#6956 )	2025-11-19 19:14:10 -08:00
Celia Chen	72a1453ac5	Revert "[core] add optional status_code to error events (#6865 )" (#6955 ) This reverts commit `c2ec477d93`. # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-11-20 01:26:14 +00:00
Ahmed Ibrahim	6d67b8b283	stop model migration screen after first time. (#6954 ) it got serialized wrong.	2025-11-19 17:17:04 -08:00
zhao-oai	74a75679d9	update execpolicy quickstart readme (#6952 )	2025-11-19 16:57:27 -08:00
pakrym-oai	92e3046733	Single pass truncation (#6914 )	2025-11-19 16:56:37 -08:00
zhao-oai	65c13f1ae7	execpolicy2 core integration (#6641 ) This PR threads execpolicy2 into codex-core. activated via feature flag: exec_policy (on by default) reads and parses all .codexpolicy files in `codex_home/codex` refactored tool runtime API to integrate execpolicy logic --------- Co-authored-by: Michael Bolin <mbolin@openai.com>	2025-11-19 16:50:43 -08:00
Dylan Hurd	b00a7cf40d	fix(shell) fallback shells (#6948 ) ## Summary Add fallbacks when user_shell_path does not resolve to a known shell type ## Testing - [x] Tests still pass	2025-11-19 16:41:38 -08:00
Michael Bolin	13d378f2ce	chore: refactor exec-server to prepare it for standalone MCP use (#6944 ) This PR reorganizes things slightly so that: - Instead of a single multitool executable, `codex-exec-server`, we now have two executables: - `codex-exec-mcp-server` to launch the MCP server - `codex-execve-wrapper` is the `execve(2)` wrapper to use with the `BASH_EXEC_WRAPPER` environment variable - `BASH_EXEC_WRAPPER` must be a single executable: it cannot be a command string composed of an executable with args (i.e., it no longer adds the `escalate` subcommand, as before) - `codex-exec-mcp-server` takes `--bash` and `--execve` as options. Though if `--execve` is not specified, the MCP server will check the directory containing `std::env::current_exe()` and attempt to use the file named `codex-execve-wrapper` within it. In development, this works out since these executables are side-by-side in the `target/debug` folder. With respect to testing, this also fixes an important bug in `dummy_exec_policy()`, as I was using `ends_with()` as if it applied to a `String`, but in this case, it is used with a `&Path`, so the semantics are slightly different. Putting this all together, I was able to test this by running the following: ``` ~/code/codex/codex-rs$ npx @modelcontextprotocol/inspector \ ./target/debug/codex-exec-mcp-server --bash ~/code/bash/bash ``` If I try to run `git status` in `/Users/mbolin/code/codex` via the `shell` tool from the MCP server: <img width="1589" height="1335" alt="image" src="https://github.com/user-attachments/assets/9db6aea8-7fbc-4675-8b1f-ec446685d6c4" /> then I get prompted with the following elicitation, as expected: <img width="1589" height="1335" alt="image" src="https://github.com/user-attachments/assets/21b68fe0-494d-4562-9bad-0ddc55fc846d" /> Though a current limitation is that the `shell` tool defaults to a timeout of 10s, which means I only have 10s to respond to the elicitation. Ideally, the time spent waiting for a response from a human should not count against the timeout for the command execution. I will address this in a subsequent PR. --- Note `~/code/bash/bash` was created by doing: ``` cd ~/code git clone https://github.com/bminor/bash cd bash git checkout a8a1c2fac029404d3f42cd39f5a20f24b6e4fe4b <apply the patch below> ./configure make ``` The patch: ``` diff --git a/execute_cmd.c b/execute_cmd.c index 070f5119..d20ad2b9 100644 --- a/execute_cmd.c +++ b/execute_cmd.c @@ -6129,6 +6129,19 @@ shell_execve (char command, char args, char env) char sample[HASH_BANG_BUFSIZ]; size_t larray; + char exec_wrapper = getenv("BASH_EXEC_WRAPPER"); + if (exec_wrapper && exec_wrapper && !whitespace (exec_wrapper)) + { + char orig_command = command; + + larray = strvec_len (args); + + memmove (args + 2, args, (++larray) sizeof (char *)); + args[0] = exec_wrapper; + args[1] = orig_command; + command = exec_wrapper; + } + ```	2025-11-19 16:38:14 -08:00
Lionel Cheng	a6597a9958	Fix/correct reasoning display (#6749 ) This closes #6748 by implementing fallback to `model_family.default_reasoning_effort` in `reasoning_effort` display of `/status` when no `model_reasoning_effort` is set in the configuration. ## common/src/config_summary.rs - `create_config_summary_entries` now fills the "reasoning effort" entry with the explicit `config.model_reasoning_effort` when present and falls back to `config.model_family.default_reasoning_effort` when it is `None`, instead of emitting the literal string `none`. - This ensures downstream consumers such as `tui/src/status/helpers.rs` continue to work unchanged while automatically picking up model-family defaults when the user has not selected a reasoning effort. ## tui/src/status/helpers.rs / core/src/model_family.rs `ModelFamily::default_reasoning_effort` metadata is set to `medium` for both `gpt-5*-codex` and `gpt-5` models following the default behaviour of the API and recommendation of the codebase: - per https://platform.openai.com/docs/api-reference/responses/create `gpt-5` defaults to `medium` reasoning when no preset is passed - there is no mention of the preset for `gpt-5.1-codex` in the API docs but `medium` is the default setting for `gpt-5.1-codex` as per `codex-rs/tui/src/chatwidget/snapshots/codex_tui__chatwidget__tests__model_reasoning_selection_popup.snap` --------- Signed-off-by: lionelchg <lionel.cheng@hotmail.fr> Co-authored-by: Eric Traut <etraut@openai.com>	2025-11-19 15:52:24 -08:00
Beehive Innovations	692989c277	fix(context left after review): review footer context after `/review` (#5610 ) ## Summary - show live review token usage while `/review` runs and restore the main session indicator afterward - add regression coverage for the footer behavior ## Testing - just fmt - cargo test -p codex-tui Fixes #5604 --------- Signed-off-by: Fahad <fahad@2doapp.com>	2025-11-19 22:50:07 +00:00
iceweasel-oai	2fde03b4a0	stop over-reporting world-writable directories (#6936 ) Fix world-writable audit false positives by expanding generic permissions with MapGenericMask and then checking only concrete write bits. The earlier check looked for FILE_GENERIC_WRITE/generic masks directly, which shares bits with read permissions and could flag an Everyone read ACE as writable.	2025-11-19 13:59:17 -08:00
Michael Bolin	056c8f8279	fix: prepare ExecPolicy in exec-server for execpolicy2 cutover (#6888 ) This PR introduces an extra layer of abstraction to prepare us for the migration to execpolicy2: - introduces a new trait, `EscalationPolicy`, whose `determine_action()` method is responsible for producing the `EscalateAction` - the existing `ExecPolicy` typedef is changed to return an intermediate `ExecPolicyOutcome` instead of `EscalateAction` - the default implementation of `EscalationPolicy`, `McpEscalationPolicy`, composes `ExecPolicy` - the `ExecPolicyOutcome` includes `codex_execpolicy2::Decision`, which has a `Prompt` variant - when `McpEscalationPolicy` gets `Decision::Prompt` back from `ExecPolicy`, it prompts the user via an MCP elicitation and maps the result into an `ElicitationAction` - now that the end user can reply to an elicitation with `Decline` or `Cancel`, we introduce a new variant, `EscalateAction::Deny`, which the client handles by returning exit code `1` without running anything Note the way the elicitation is created is still not quite right, but I will fix that once we have things running end-to-end for real in a follow-up PR.	2025-11-19 13:55:29 -08:00
Celia Chen	c2ec477d93	[core] add optional status_code to error events (#6865 ) We want to better uncover error status code for clients. Add an optional status_code to error events (thread error, error, stream error) so app server could uncover the status code from the client side later. in event log: ``` < { < "method": "codex/event/stream_error", < "params": { < "conversationId": "019a9a32-f576-7292-9711-8e57e8063536", < "id": "0", < "msg": { < "message": "Reconnecting... 5/5", < "status_code": 401, < "type": "stream_error" < } < } < } < { < "method": "codex/event/error", < "params": { < "conversationId": "019a9a32-f576-7292-9711-8e57e8063536", < "id": "0", < "msg": { < "message": "exceeded retry limit, last status: 401 Unauthorized, request id: 9a0cb03a485067f7-SJC", < "status_code": 401, < "type": "error" < } < } < } ```	2025-11-19 19:51:21 +00:00
Dylan Hurd	20982d5c6a	fix(app-server) move windows world writable warning (#6916 ) ## Summary Move the app-server warning into the process_new_conversation ## Testing - [x] Tested locally	2025-11-19 11:24:49 -08:00
pakrym-oai	64ae9aa3c3	Keep gpt-5.1-codex the default (#6922 )	2025-11-19 11:08:10 -08:00
zhao-oai	72af589398	storing credits (#6858 ) Expand the rate-limit cache/TUI: store credit snapshots alongside primary and secondary windows, render “Credits” when the backend reports they exist (unlimited vs rounded integer balances)	2025-11-19 10:49:35 -08:00
iceweasel-oai	b3d320433f	have world_writable_warning_details accept cwd as a param (#6913 ) this enables app-server to pass in the correct workspace cwd for the current conversation	2025-11-19 10:10:03 -08:00
jif-oai	91a1d20e2d	use another prompt (#6912 )	2025-11-19 17:47:47 +00:00
jif-oai	87716e7cd0	NITs (#6911 )	2025-11-19 17:43:51 +00:00
jif-oai	8976551f0d	Fix ordering 2 (#6910 )	2025-11-19 17:40:27 +00:00
jif-oai	f1d6767685	fix: ordering (#6909 )	2025-11-19 17:39:07 +00:00
Ahmed Ibrahim	d62cab9a06	fix: don't truncate at new lines (#6907 )	2025-11-19 17:05:48 +00:00
Ahmed Ibrahim	d5dfba2509	feat: arcticfox in the wild (#6906 ) <img width="485" height="600" alt="image" src="https://github.com/user-attachments/assets/4341740d-dd58-4a3e-b69a-33a3be0606c5" /> --------- Co-authored-by: jif-oai <jif@openai.com>	2025-11-19 16:31:06 +00:00
Owen Lin	1924500250	[app-server] populate thread>turns>items on thread/resume (#6848 ) This PR allows clients to render historical messages when resuming a thread via `thread/resume` by reading from the list of `EventMsg` payloads loaded from the rollout, and then transforming them into Turns and ThreadItems to be returned on the `Thread` object. This is implemented by leveraging `SessionConfiguredNotification` which returns this list of `EventMsg` objects when resuming a conversation, and then applying a stateful `ThreadHistoryBuilder` that parses from this EventMsg log and transforms it into Turns and ThreadItems. Note that we only persist a subset of `EventMsg`s in a rollout as defined in `policy.rs`, so we lose fidelity whenever we resume a thread compared to when we streamed the thread's turns originally. However, this behavior is at parity with the legacy API.	2025-11-19 15:58:09 +00:00
jif-oai	cfc57e14c7	nit: useless log to debug (#6898 ) When you type too fast in most terminals, it gets interpreted as paste, making this log spam	2025-11-19 12:32:53 +00:00
Dylan Hurd	15b5eb30ed	fix(core) Support changing /approvals before conversation (#6836 ) ## Summary Setting `/approvals` before the start of a conversation was not updating the environment_context for a conversation. Not sure exactly when this problem was introduced, but this should reduce model confusion dramatically. ## Testing - [x] Added unit test to reproduce bug, confirmed fix with update - [x] Tested locally	2025-11-19 11:32:48 +00:00
jif-oai	3e9e1d993d	chore: consolidate compaction token usage (#6894 )	2025-11-19 11:26:01 +00:00
Dylan Hurd	44c747837a	chore(app-server) world-writable windows notification (#6880 ) ## Summary On app-server startup, detect whether the experimental sandbox is enabled, and send a notification . Note New conversations will not respect the feature because we [ignore cli overrides in NewConversation](`a75321a64c/codex-rs/app-server/src/codex_message_processor.rs (L1237-L1252)`). However, this should be okay, since we don't actually use config for this, we use a [global variable](`87cce88f48/codex-rs/core/src/safety.rs (L105-L110)`). We should carefully unwind this setup at some point. ## Testing - [ ] In progress: testing locally --------- Co-authored-by: jif-oai <jif@openai.com>	2025-11-19 11:19:34 +00:00
jif-oai	4985a7a444	fix: parallel tool call instruction injection (#6893 )	2025-11-19 11:01:57 +00:00
jif-oai	10d571f236	nit: stable (#6895 )	2025-11-19 10:43:43 +00:00
jif-oai	956d3bfac6	feat: warning large commits (#6838 )	2025-11-19 10:22:10 +00:00
Thibault Sottiaux	73488657cb	fix label (#6892 )	2025-11-19 10:11:30 +00:00
Ahmed Ibrahim	efebc62fb7	Move shell to use `truncate_text` (#6842 ) Move shell to use the configurable `truncate_text` --------- Co-authored-by: pakrym-oai <pakrym@openai.com>	2025-11-19 01:56:08 -08:00
pakrym-oai	75f38f16dd	Run remote auto compaction (#6879 )	2025-11-19 00:43:58 -08:00
Ahmed Ibrahim	0440a3f105	flaky-unified_exec_formats_large_output_summary (#6884 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-11-19 00:00:37 -08:00
pakrym-oai	ee0484a98c	shell_command returns freeform output (#6860 ) Instead of returning structured out and then re-formatting it into freeform, return the freeform output from shell_command tool. Keep `shell` as the default tool for GPT-5.	2025-11-18 23:38:43 -08:00
Dylan Hurd	7e0e675db4	chore(core) arcticfox (#6876 ) ..	2025-11-18 23:38:08 -08:00
Dylan Hurd	84458f12f6	fix(tui) ghost snapshot notifications (#6881 ) ## Summary - avoid surfacing ghost snapshot warnings in the TUI when snapshot creation fails, logging the conditions instead - continue to capture successful ghost snapshots without changing existing behavior ## Testing - `cargo test -p codex-core` (fails: default_client::tests::test_create_client_sets_default_headers, default_client::tests::test_get_codex_user_agent, exec::tests::kill_child_process_group_kills_grandchildren_on_timeout) ------ [Codex Task](https://chatgpt.com/codex/tasks/task_i_691c02238db08322927c47b8c2d72c4c)	2025-11-18 23:23:00 -08:00
Ahmed Ibrahim	793063070b	fix: typos in model picker (#6859 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2025-11-19 06:29:02 +00:00
ae	030d1d5b1c	chore: update windows docs url (#6877 ) - Testing: None	2025-11-19 06:24:17 +00:00
ae	7e6316d4aa	feat: tweak windows sandbox strings (#6875 ) New strings: 1. Approval mode picker just says "Select Approval Mode" 1. Updated "Auto" to "Agent" 1. When you select "Agent", you get "Agent mode on Windows uses an experimental sandbox to limit network and filesystem access. [Learn more]" 1. Updated world-writable warning to "The Windows sandbox cannot protect writes to folders that are writable by Everyone. Consider removing write access for Everyone from the following folders: {folders}" --------- Co-authored-by: iceweasel-oai <iceweasel@openai.com>	2025-11-19 06:00:06 +00:00
Michael Bolin	a75321a64c	fix: add more fields to ThreadStartResponse and ThreadResumeResponse (#6847 ) This adds the following fields to `ThreadStartResponse` and `ThreadResumeResponse`: ```rust pub model: String, pub model_provider: String, pub cwd: PathBuf, pub approval_policy: AskForApproval, pub sandbox: SandboxPolicy, pub reasoning_effort: Option<ReasoningEffort>, ``` This is important because these fields are optional in `ThreadStartParams` and `ThreadResumeParams`, so the caller needs to be able to determine what values were ultimately used to start/resume the conversation. (Though note that any of these could be changed later between turns in the conversation.) Though to get this information reliably, it must be read from the internal `SessionConfiguredEvent` that is created in response to the start of a conversation. Because `SessionConfiguredEvent` (as defined in `codex-rs/protocol/src/protocol.rs`) did not have all of these fields, a number of them had to be added as part of this PR. Because `SessionConfiguredEvent` is referenced in many tests, test instances of `SessionConfiguredEvent` had to be updated, as well, which is why this PR touches so many files.	2025-11-18 21:18:43 -08:00
ae	7508e4fd2d	chore: update windows sandbox docs (#6872 )	2025-11-18 21:02:04 -08:00

1 2 3 4 5 ...

2079 commits