core-agent-ide

Author	SHA1	Message	Date
Ahmed Ibrahim	b7fa7ca8e9	Update Model Info (#7853 )	2025-12-11 14:06:07 -08:00
iceweasel-oai	3e81ed4b91	Elevated Sandbox 3 (#7809 ) dedicated sandbox command runner exe.	2025-12-11 13:51:27 -08:00
Ahmed Ibrahim	c4f3f566a5	remove release script (#7885 )	2025-12-11 13:40:48 -08:00
Ahmed Ibrahim	b9fb3b81e5	Chore: limit find family visability (#7891 ) a little bit more code quality of life	2025-12-11 13:30:56 -08:00
Anton Panasenko	0af7e4a195	fix: omit reasoning summary when ReasoningSummary::None (#7845 ) ``` { "error": { "message": "Invalid value: 'none'. Supported values are: 'concise', 'detailed', and 'auto'.", "type": "invalid_request_error", "param": "reasoning.summary", "code": "invalid_value" } } ```	2025-12-11 11:59:40 -08:00
Tyler Anton	8c4c6a19e0	fix: drop stale filedescriptor output hash for nix (#7865 ) Fixes: #7863 - Remove the `filedescriptor-0.8.3` entry from `codex-rs/default.nix` output hashes because the crate now comes from crates.io.	2025-12-11 10:43:50 -08:00
sayan-oai	703bf12b36	fix: dont quit on 'q' in onboarding ApiKeyEntry state (#7869 ) ### What Don't treat `q` as a special quit character on the API key paste page in the onboarding flow. This addresses #7413, where pasting API keys with `q` would cause codex to quit on Windows. ### Test Plan Tested on Windows and MacOS.	2025-12-11 09:57:59 -08:00
pakrym-oai	bb8fdb20dc	Revert "Only show Worked for after the final assistant message" (#7884 ) Reverts openai/codex#7854	2025-12-11 09:11:42 -08:00
Ahmed Ibrahim	238ce7dfad	feat: robin (#7882 ) <img width="554" height="554" alt="image" src="https://github.com/user-attachments/assets/aa86f4c8-fb34-4b0e-8b03-3a9980dfdb08" /> --------- Co-authored-by: Dylan Hurd <dylan.hurd@openai.com>	2025-12-11 09:04:08 -08:00
jif-oai	d4554ce6c8	fix: flaky tests 4 (#7875 )	2025-12-11 14:26:27 +00:00
jif-oai	29381ba5c2	feat: add shell snapshot for shell command (#7786 )	2025-12-11 13:46:43 +00:00
jif-oai	b2280d6205	feat: warning for long snapshots (#7870 )	2025-12-11 12:42:47 +00:00
Dylan Hurd	dca7f4cb60	fix(stuff) (#7855 ) Co-authored-by: Ahmed Ibrahim <aibrahim@openai.com>	2025-12-11 00:39:47 -08:00
iceweasel-oai	13c0919bff	Elevated Sandbox 2 (#7792 ) - DPAPI helpers for storing Sandbox user passwords securely - creation of Offline/Online sandbox users - ACL setup for sandbox users - firewall rule setup	2025-12-10 21:23:16 -08:00
pakrym-oai	83aac0f985	Only show Worked for after the final assistant message (#7854 ) Before: <img width="1908" height="246" alt="image" src="https://github.com/user-attachments/assets/f4d5993a-8d37-4982-a6fd-d37f449215b2" /> After: <img width="1102" height="586" alt="image" src="https://github.com/user-attachments/assets/e833140d-690a-4c33-8bc7-e2b69b9dc92d" />	2025-12-10 21:13:13 -08:00
Eric Traut	057250020a	Fixed regression that broke fuzzy matching for slash commands (#7859 ) This addresses bug #7857 which was introduced recently as part of PR #7704.	2025-12-10 20:42:45 -08:00
Michael Bolin	3fc8b2894f	fix: remove inaccurate `#[allow(dead_code)]` marker (#7851 ) Me reading this clippy warning: <img width="263" height="191" alt="image" src="https://github.com/user-attachments/assets/3a936a17-f91d-47bc-a08a-cafb154e9e32" />	2025-12-10 17:48:46 -08:00
Celia Chen	ce19dbbb22	[app-server] Update readme to include mcp endpoints (#7850 ) n/a	2025-12-11 01:08:31 +00:00
Michael Bolin	038767af69	fix: add a hopefully-temporary sleep to reduce test flakiness (#7848 ) Let's see if this `sleep()` call is good enough to fix the test flakiness we currently see in CI. It will take me some time to upstream a proper fix, and I would prefer not to disable this test in the interim.	2025-12-11 00:51:33 +00:00
Celia Chen	7cabe54fc7	[app-server] make app server not throw error when login id is not found (#7831 ) Our previous design of cancellation endpoint is not idempotent, which caused a bunch of flaky tests. Make app server just returned a not_found status instead of throwing an error if the login id is not found. Keep V1 endpoint behavior the same.	2025-12-10 16:19:40 -08:00
zhao-oai	c1367808fb	fixing typo in execpolicy docs (#7847 )	2025-12-10 16:11:46 -08:00
Michael Bolin	87f5b69b24	fix: ensure accept_elicitation_for_prompt_rule() test passes locally (#7832 ) When I originally introduced `accept_elicitation_for_prompt_rule()` in https://github.com/openai/codex/pull/7617, it worked for me locally because I had run `codex-rs/exec-server/tests/suite/bash` once myself, which had the side-effect of installing the corresponding DotSlash artifact. In CI, I added explicit logic to do this as part of `.github/workflows/rust-ci.yml`, which meant the test also passed in CI, but this logic should have been done as part of the test so that it would work locally for devs who had not installed the DotSlash artifact for `codex-rs/exec-server/tests/suite/bash` before. This PR updates the test to do this (and deletes the setup logic from `rust-ci.yml`), creating a new `DOTSLASH_CACHE` in a temp directory so that this is handled independently for each test. While here, also added a check to ensure that the `codex` binary has been built prior to running the test, as we have to ensure it is symlinked as `codex-linux-sandbox` on Linux in order for the integration test to work on that platform.	2025-12-10 15:17:13 -08:00
Javi	e2559ab28d	fix: thread/list returning fewer than the requested amount due to filtering CXA-293 (#7509 ) This caused some conversations to not appear when they otherwise should. Prior to this change, `thread/list`/`list_conversations_common` would: - Fetch N conversations from `RolloutRecorder::list_conversations` - Then it would filter those (like by the provided `model_providers`) - This would make it potentially return less than N items. With this change: - `list_conversations_common` now continues fetching more conversations from `RolloutRecorder::list_conversations` until it "fills up" the `requested_page_size`. - Ultimately this means that clients can rely on getting eg 20 conversations if they request 20 conversations.	2025-12-10 23:06:32 +00:00
Josh McKinney	90f262e9a4	feat(tui2): copy tui crate and normalize snapshots (#7833 ) Introduce a full codex-tui source snapshot under the new codex-tui2 crate so viewport work can be replayed in isolation. This change copies the entire codex-rs/tui/src tree into codex-rs/tui2/src in one atomic step, rather than piecemeal, to keep future diffs vs the original viewport bookmark easy to reason about. The goal is for codex-tui2 to render identically to the existing TUI behind the `features.tui2` flag while we gradually port the viewport/history commits from the joshka/viewport bookmark onto this forked tree. While on this baseline change, we also ran the codex-tui2 snapshot test suite and accepted all insta snapshots for the new crate, so the snapshot files now use the codex-tui2 naming scheme and encode the unmodified legacy TUI behavior. This keeps later viewport commits focused on intentional behavior changes (and their snapshots) rather than on mechanical snapshot renames.	2025-12-10 22:53:46 +00:00
Ahmed Ibrahim	321625072a	Show the default model in model picker (#7838 ) See the snapshot	2025-12-10 14:01:18 -08:00
xl-openai	b36ecb6c32	Inject SKILL.md when it's explicitly mentioned. (#7763 ) 1. Skills load once in core at session start; the cached outcome is reused across core and surfaced to TUI via SessionConfigured. 2. TUI detects explicit skill selections, and core injects the matching SKILL.md content into the turn when a selected skill is present.	2025-12-10 13:59:17 -08:00
pakrym-oai	eb2e5458cc	Disable ansi codes in tui log file (#7836 )	2025-12-10 13:56:48 -08:00
Celia Chen	bfb4d5710b	[app-server-protocol] Add types for config (#7658 ) Currently the config returned by `config/read` in untyped. Add types so it's easier for client to parse the config. Since currently configs are all defined in snake case we'll keep that instead of using camel case like the rest of V2. Sample output by testing using the app server test client: ``` { < "id": "f28449f4-b015-459b-b07b-eef06980165d", < "result": { < "config": { < "approvalPolicy": null, < "compactPrompt": null, < "developerInstructions": null, < "features": { < "experimental_use_rmcp_client": true < }, < "forcedChatgptWorkspaceId": null, < "forcedLoginMethod": null, < "instructions": null, < "model": "gpt-5.1-codex-max", < "modelAutoCompactTokenLimit": null, < "modelContextWindow": null, < "modelProvider": null, < "modelReasoningEffort": null, < "modelReasoningSummary": null, < "modelVerbosity": null, < "model_providers": { < "local": { < "base_url": "http://localhost:8061/api/codex", < "env_http_headers": { < "ChatGPT-Account-ID": "OPENAI_ACCOUNT_ID" < }, < "env_key": "CHATGPT_TOKEN_STAGING", < "name": "local", < "wire_api": "responses" < } < }, < "model_reasoning_effort": "medium", < "notice": { < "hide_gpt-5.1-codex-max_migration_prompt": true, < "hide_gpt5_1_migration_prompt": true < }, < "profile": null, < "profiles": {}, < "projects": { < "/Users/celia/code": { < "trust_level": "trusted" < }, < "/Users/celia/code/codex": { < "trust_level": "trusted" < }, < "/Users/celia/code/openai": { < "trust_level": "trusted" < } < }, < "reviewModel": null, < "sandboxMode": null, < "sandboxWorkspaceWrite": null, < "tools": { < "viewImage": null, < "webSearch": null < } < }, < "origins": { < "features.experimental_use_rmcp_client": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "model": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "model_providers.local.base_url": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "model_providers.local.env_http_headers.ChatGPT-Account-ID": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "model_providers.local.env_key": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "model_providers.local.name": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "model_providers.local.wire_api": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "model_reasoning_effort": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "notice.hide_gpt-5.1-codex-max_migration_prompt": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "notice.hide_gpt5_1_migration_prompt": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "projects./Users/celia/code.trust_level": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "projects./Users/celia/code/codex.trust_level": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "projects./Users/celia/code/openai.trust_level": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < }, < "tools.web_search": { < "name": "user", < "source": "/Users/celia/.codex/config.toml", < "version": "sha256:a1d8eaedb5d9db5dfdfa69f30fa9df2efec66bb4dd46aa67f149fcc67cd0711c" < } < } < } < } ```	2025-12-10 21:35:31 +00:00
Ahmed Ibrahim	4953b2ae09	Error when trying to push a release while another release is in progress (#7834 ) <img width="995" height="171" alt="image" src="https://github.com/user-attachments/assets/7bab541a-a933-4064-a968-26e9566360ec" /> Currently, we just cancel the in progress release which can be annoying	2025-12-10 12:15:39 -08:00
Robby He	1a5809624d	fix: Prevent slash command popup from activating on invalid inputs (#7704 ) ## Slash Command popup issue #7659 When recalling history, the composer(`codex_tui::bottom_pane::chat_composer`) restores the previous prompt text (which may start with `/`) and then calls `sync_command_popup`. The logic in `sync_command_popup` treats any first line that starts with `/` and has the caret inside the initial `/name` token as an active slash command name: ```rust let is_editing_slash_command_name = if first_line.starts_with('/') && caret_on_first_line { let token_end = first_line .char_indices() .find(\|(_, c)\| c.is_whitespace()) .map(\|(i, _)\| i) .unwrap_or(first_line.len()); cursor <= token_end } else { false }; ``` This detection does not distinguish between an actual interactive slash command being typed and a normal historical prompt that happens to begin with `/`. As a result, after history recall, the restored prompt like `/ test` is interpreted as an "editing command name" context and the slash-command popup is (re)activated. Once `active_popup` is `ActivePopup::Command`, subsequent `Up` key presses are handled by `handle_key_event_with_slash_popup` instead of `handle_key_event_without_popup`, so they no longer trigger `history.navigate_up(...)` and the session prompt history cannot be scrolled.	2025-12-10 11:38:15 -08:00
Ahmed Ibrahim	cb9a189857	make `model` optional in config (#7769 ) - Make Config.model optional and centralize default-selection logic in ModelsManager, including a default_model helper (with codex-auto-balanced when available) so sessions now carry an explicit chosen model separate from the base config. - Resolve `model` once in `core` and `tui` from config. Then store the state of it on other structs. - Move refreshing models to be before resolving the default model	2025-12-10 11:19:00 -08:00
Celia Chen	8a71f8b634	[app-server] Make sure that config writes preserve comments & order or configs (#7789 ) Make sure that config writes preserve comments and order of configs by utilizing the ConfigEditsBuilder in core. Tested by running a real example and made sure that nothing in the config file changes other than the configs to edit.	2025-12-10 19:14:27 +00:00
pakrym-oai	4b684c53ae	Remove conversation_id and bring back request ID logging (#7830 )	2025-12-10 10:44:12 -08:00
Koichi Shiraishi	9f40d6eeeb	fix: remove duplicated parallel FeatureSpec (#7823 ) regression: #7589 Signed-off-by: Koichi Shiraishi <zchee.io@gmail.com>	2025-12-10 10:23:01 -08:00
Amit Halfon	bd51d1b103	fix: Upgrade @modelcontextprotocol/sdk to ^1.24.0 (#7817 ) ## What? Upgrades @modelcontextprotocol/sdk from ^1.20.2 to ^1.24.0 in the TypeScript SDK's devDependencies. ## Why? Related to #7737 - keeping development dependencies up to date with the latest MCP SDK version that includes the fix for CVE-2025-66414. Note: This change does not address the CVE for Codex users, as the MCP SDK is only in devDependencies here. The actual MCP integration that would be affected by the CVE is in the Rust codebase. ## How? • Updated dependency version in sdk/typescript/package.json • Ran pnpm install to update lockfile • Fixed formatting (added missing newline in package.json) ## Related Issue Related to #7737 ## Test Status ⚠️ After this upgrade, 2 additional tests timeout (1 test was already failing on main): • tests/run.test.ts: "sends previous items when run is called twice" • tests/run.test.ts: "resumes thread by id" • tests/runStreamed.test.ts: "sends previous items when runStreamed is called twice" Marking as draft to investigate test timeouts. Maintainer guidance would be appreciated. Co-authored-by: HalfonA <amit@miggo.io>	2025-12-10 10:17:00 -08:00
jif-oai	f677d05871	fix: flaky tests 3 (#7826 )	2025-12-10 17:57:53 +00:00
Eric Traut	c4af707e09	Removed experimental "command risk assessment" feature (#7799 ) This experimental feature received lukewarm reception during internal testing. Removing from the code base.	2025-12-10 09:48:11 -08:00
zhao-oai	e0fb3ca1db	refactoring with_escalated_permissions to use SandboxPermissions instead (#7750 ) helpful in the future if we want more granularity for requesting escalated permissions: e.g when running in readonly sandbox, model can request to escalate to a sandbox that allows writes	2025-12-10 17:18:48 +00:00
jif-oai	97b90094cd	feat: use remote branch for review is local trails (#7813 )	2025-12-10 17:04:52 +00:00
jif-oai	463249eff3	fix: flaky test 2 (#7818 )	2025-12-10 16:35:28 +00:00
jif-oai	0ad54982ae	chore: rework unified exec events (#7775 )	2025-12-10 10:30:38 +00:00
Shijie Rao	d1c5db5796	chore: disable trusted signing pkg cache hit (#7807 )	2025-12-09 22:14:14 -08:00
Gav Verma	6fa24d65f5	Express rate limit warning as % remaining (#7795 ) <img width="342" height="264" alt="image" src="https://github.com/user-attachments/assets/f1e932ff-c550-47b3-9035-0299ada4998d" /> Earlier, the warning was expressed as consumed% whereas status was expressed as remaining%. This change brings the two into sync to minimize confusion and improve visual consistency.	2025-12-09 21:17:57 -08:00
Shijie Rao	ab9ddcd50b	Revert "Revert "feat: windows codesign with Azure trusted signing"" (#7806 ) Reverts openai/codex#7804	2025-12-09 20:42:00 -08:00
Shijie Rao	f11520f5f1	Revert "feat: windows codesign with Azure trusted signing" (#7804 ) Reverts openai/codex#7757	2025-12-09 20:19:37 -08:00
Shijie Rao	42e0817398	Revert "Revert "feat: windows codesign with Azure trusted signing"" (#7757 ) Reverts openai/codex#7753 Updated the tag ref matching at https://github.com/openai/openai/pull/594858 so that release with tag change can be picked up correctly.	2025-12-09 19:31:46 -08:00
iceweasel-oai	fc4249313b	Elevated Sandbox 1 (#7788 ) - updating helpers, refactoring some functions that will be used in the elevated sandbox - better logging - better and faster handling of ACL checks/writes - No functional change—legacy restricted-token sandbox remains the only path.	2025-12-09 19:00:33 -08:00
pakrym-oai	967d063f4b	parse rg \| head a search (#7797 )	2025-12-09 18:30:16 -08:00
Shijie Rao	893f5261eb	feat: support mcp in-session login (#7751 ) ### Summary * Added `mcpServer/oauthLogin` in app server for supporting in session MCP server login * Added `McpServerOauthLoginParams` and `McpServerOauthLoginResponse` to support above method with response returning the auth URL for consumer to open browser or display accordingly. * Added `McpServerOauthLoginCompletedNotification` which the app server would emit on MCP server login success or failure (i.e. timeout). * Refactored rmcp-client oath_login to have the ability on starting a auth server which the codex_message_processor uses for in-session auth.	2025-12-09 17:43:53 -08:00
Michael Bolin	fa4cac1e6b	fix: introduce AbsolutePathBuf and resolve relative paths in config.toml (#7796 ) This PR attempts to solve two problems by introducing a `AbsolutePathBuf` type with a special deserializer: - `AbsolutePathBuf` attempts to be a generally useful abstraction, as it ensures, by constructing, that it represents a value that is an absolute, normalized path, which is a stronger guarantee than an arbitrary `PathBuf`. - Values in `config.toml` that can be either an absolute or relative path should be resolved against the folder containing the `config.toml` in the relative path case. This PR makes this easy to support: the main cost is ensuring `AbsolutePathBufGuard` is used inside `deserialize_config_toml_with_base()`. While `AbsolutePathBufGuard` may seem slightly distasteful because it relies on thread-local storage, this seems much cleaner to me than using than my various experiments with https://docs.rs/serde/latest/serde/de/trait.DeserializeSeed.html. Further, since the `deserialize()` method from the `Deserialize` trait is not async, we do not really have to worry about the deserialization work being spread across multiple threads in a way that would interfere with `AbsolutePathBufGuard`. To start, this PR introduces the use of `AbsolutePathBuf` in `OtelTlsConfig`. Note how this simplifies `otel_provider.rs` because it no longer requires `settings.codex_home` to be threaded through. Furthermore, this sets us up better for a world where multiple `config.toml` files from different folders could be loaded and then merged together, as the absolutifying of the paths must be done against the correct parent folder.	2025-12-09 17:37:52 -08:00

1 2 3 4 5 ...

2382 commits