core-agent-ide

Author	SHA1	Message	Date
jif-oai	4971e96a98	nit: shell snapshot retention to 3 days (#10382 )	2026-02-02 12:52:45 +00:00
jif-oai	3cc9122ee2	feat: experimental flags (#10231 ) ## Problem being solved - We need a single, reliable way to mark app-server API surface as experimental so that: 1. the runtime can reject experimental usage unless the client opts in 2. generated TS/JSON schemas can exclude experimental methods/fields for stable clients. Right now that’s easy to drift or miss when done ad-hoc. ## How to declare experimental methods and fields - Experimental method: add `#[experimental("method/name")]` to the `ClientRequest` variant in `client_request_definitions!`. - Experimental field: on the params struct, derive `ExperimentalApi` and annotate the field with `#[experimental("method/name.field")]` + set `inspect_params: true` for the method variant so `ClientRequest::experimental_reason()` inspects params for experimental fields. ## How the macro solves it - The new derive macro lives in `codex-rs/codex-experimental-api-macros/src/lib.rs` and is used via `#[derive(ExperimentalApi)]` plus `#[experimental("reason")]` attributes. - Structs: - Generates `ExperimentalApi::experimental_reason(&self)` that checks only annotated fields. - The “presence” check is type-aware: - `Option<T>`: `is_some_and(...)` recursively checks inner. - `Vec`/`HashMap`/`BTreeMap`: must be non-empty. - `bool`: must be `true`. - Other types: considered present (returns `true`). - Registers each experimental field in an `inventory` with `(type_name, serialized field name, reason)` and exposes `EXPERIMENTAL_FIELDS` for that type. Field names are converted from `snake_case` to `camelCase` for schema/TS filtering. - Enums: - Generates an exhaustive `match` returning `Some(reason)` for annotated variants and `None` otherwise (no wildcard arm). - Wiring: - Runtime gating uses `ExperimentalApi::experimental_reason()` in `codex-rs/app-server/src/message_processor.rs` to reject requests unless `InitializeParams.capabilities.experimental_api == true`. - Schema/TS export filters use the inventory list and `EXPERIMENTAL_CLIENT_METHODS` from `client_request_definitions!` to strip experimental methods/fields when `experimental_api` is false.	2026-02-02 11:06:50 +00:00
jif-oai	9513f18bfe	chore: collab experimental (#10381 )	2026-02-02 10:57:44 +00:00
pap-openai	1644cbfc6d	Session picker shows thread_name if set (#10340 ) - shows names of threads in the ResumePicker used by `/resume` and `codex resume` if set, default to preview (previous behaviour) if none - adds a `find_thread_names_by_ids` that maps names to IDs in `codex-rs/core/src/rollout/session_index.rs`. It reads sequentially in normal (instead of reverse order in `codex resume <name>`) the index mapping file. This function is called from a list of session (default page is 25, pages loaded depends of height of terminal), for which most of them will always have at least one session unnamed and require the whole file to be read therefore. Could be better and sqlite integration will make this better - those reads won't be needed when leveraging sqlite Opened questions: - We could rename the TUI "Conversation" column to "Name" or "Thread" that would feel more accurate. Could be a fast-follow if we implement auto-naming as it'll always be a name instead?	2026-02-02 08:13:17 +00:00
Michael Bolin	974355cfdd	feat: vendor app-server protocol schema fixtures (#10371 ) Similar to what @sayan-oai did in openai/codex#8956 for `config.schema.json`, this PR updates the repo so that it includes the output of `codex app-server generate-json-schema` and `codex app-server generate-ts` and adds a test to verify it is in sync with the current code. Motivation: - This makes any schema changes introduced by a PR transparent during code review. - In particular, this should help us catch PRs that would introduce a non-backwards-compatible change to the app schema (eventually, this should also be enforced by tooling). - Once https://github.com/openai/codex/pull/10231 is in to formalize the notion of "experimental" fields, we can work on ensuring the non-experimental bits are backwards-compatible. `codex-rs/app-server-protocol/tests/schema_fixtures.rs` was added as the test and `just write-app-server-schema` can be use to generate the vendored schema files. Incidentally, when I run: ``` rg _ codex-rs/app-server-protocol/schema/typescript/v2 ``` I see a number of `snake_case` names that should be `camelCase`.	2026-02-01 23:38:43 -08:00
Dylan Hurd	08a5ad95a8	fix(personality) prompt patch (#10375 ) ## Summary We had 2 typos in #10373 ## Testing - [x] unit tests pass	2026-02-01 23:32:07 -08:00
Dylan Hurd	a90ff831e7	chore(core) gpt-5.2-codex personality template (#10373 ) ## Summary Consolidate prompts ## Testing - [x] Existing tests pass	2026-02-01 22:54:12 -08:00
Dylan Hurd	6c22360bcb	fix(core) Deduplicate prefix_rules before appending (#10309 ) ## Summary We ideally shouldn't make it to this point in the first place, but if we do try to append a rule that already exists, we shouldn't append the same rule twice. ## Testing - [x] Added unit test for this case	2026-02-01 20:30:38 -08:00
pakrym-oai	03fcd12e77	Do not append items on override turn context (#10354 )	2026-02-01 18:51:26 -08:00
Dylan Hurd	8b95d3e082	fix(rules) Limit rules listed in conversation (#10351 ) ## Summary We should probably warn users that they have a million rules, and help clean them up. But for now, we should handle this unbounded case. Limit rules listed in conversations, with shortest / broadest rules first. ## Testing - [x] Updated unit tests	2026-02-02 02:26:15 +00:00
Gav Verma	5fb46187b2	fix: System skills marker includes nested folders recursively (#10350 ) Updated system skills bundled with Codex were not correctly replacing the user's skills in their .system folder. - Fix `.codex-system-skills.marker` not updating by hashing embedded system skills recursively (nested dirs + file contents), so updates trigger a reinstall. - Added a build Cargo hook to rerun if there are changes in `src/skills/assets/samples/*`, ensuring embedded skill updates rebuild correctly under caching. - Add a small unit test to ensure nested entries are included in the fingerprint.	2026-02-01 18:17:32 -08:00
Charley Cunningham	d3514bbdd2	Bump thread updated_at on unarchive to refresh sidebar ordering (#10280 ) ## Summary - Touch restored rollout files on `thread/unarchive` so `updatedAt` reflects the unarchive time. - Add a regression test to ensure unarchiving bumps `updated_at` from an old mtime. ## Notes This fixes the UX issue where unarchived old threads don’t reappear near the top of recent threads.	2026-02-01 12:53:47 -08:00
Charley Cunningham	3dd9a37e0b	Improve plan mode interaction rules (#10329 ) ## Summary - Replace the “Hard interaction rule” with a clearer “Response constraints” section that enumerates the allowed exceptions for Plan Mode replies. - Remove the stray Phase 1 exception line about simple questions. - Update plan content requirements to ask for a brief summary section and generalize API/type wording.	2026-01-31 23:20:27 -08:00
Dylan Hurd	ae4eeff440	fix(config) config schema newline (#10323 ) ## Summary Looks like we may have introduced a formatting issue in recent PRs. ## Testing - [x] ran `just write-config-schema`	2026-02-01 05:08:29 +00:00
Gav Verma	e470461a96	Sync system skills from public repo for openai yaml changes (#10322 ) Follow-up to https://github.com/openai/codex/pull/10320 Syncing additional changes from https://github.com/openai/skills/tree/main/skills/.system	2026-01-31 21:07:35 -08:00
Gav Verma	dfba95309f	Sync system skills from public repo (#10320 ) Syncs the system skills included in Codex with the updates in https://github.com/openai/skills/tree/main/skills/.system	2026-01-31 20:44:18 -08:00
Dylan Hurd	11c912c4af	chore(features) Personality => Stable (#10310 ) ## Summary Bump `/personality` to stable ## Testing - [x] unit tests pass	2026-01-31 20:32:32 -08:00
Dylan Hurd	a33fa4bfe5	chore(config) Rename config setting to personality (#10314 ) ## Summary Let's make the setting name consistent with the SlashCommand! ## Testing - [x] Updated tests	2026-01-31 19:38:06 -08:00
Anton Panasenko	101d359cd7	Add websocket telemetry metrics and labels (#10316 ) Summary - expose websocket telemetry hooks through the responses client so request durations and event processing can be reported - record websocket request/event metrics and emit runtime telemetry events that the history UI now surfaces - improve tests to cover websocket telemetry reporting and guard runtime summary updates <img width="824" height="79" alt="Screenshot 2026-01-31 at 5 28 12 PM" src="https://github.com/user-attachments/assets/ea9a7965-d8b4-4e3c-a984-ef4fdc44c81d" />	2026-01-31 19:16:44 -08:00
xl-openai	aab3705c7e	Make skills prompt explicit about relative-path lookup (#10282 ) Fix cases where the model tries to locate skill scripts from the cwd and fails.	2026-01-31 19:08:25 -08:00
Gav Verma	39a6a84097	feat: Support loading skills from .agents/skills (#10317 ) This PR adds support for loading [skills](https://developers.openai.com/codex/skills) from `.agents/skills/`. - Issue: https://github.com/agentskills/agentskills/issues/15 - Motivation: When skills live on the filesystem, sharing them across agents is awkward and often ends up requiring symlinks/duplication. A single location under `.agents/` makes it easier to share skills. - Loading from `.codex/skills/` will remain but will be deprecated soon. The change only applies to the [REPO scope](https://developers.openai.com/codex/skills#where-to-save-skills). - Documentation will be updated before this change is live. Testing with skills in two locations of this repo: <img width="960" height="152" alt="image" src="https://github.com/user-attachments/assets/28975ff9-7363-46dd-ad40-f4c7bfdb8234" /> When starting Codex with CWD in `$repo_root` (should only pick up at root): <img width="513" height="143" alt="image" src="https://github.com/user-attachments/assets/389e1ea7-020c-481e-bda0-ce58562db59f" /> When starting Codex with CWD in `$repo_root/codex-rs` (should pick up at cwd and crawl up to root): <img width="552" height="177" alt="image" src="https://github.com/user-attachments/assets/a5beb8de-11b4-45ed-8660-80707c77006a" />	2026-01-31 18:45:05 -08:00
alexsong-oai	b164ac6d1e	feat: fire tracking events for skill invocation (#10120 )	2026-01-31 18:06:26 -08:00
Ahmed Ibrahim	30ed29a7b3	enable plan mode (#10313 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2026-02-01 00:58:17 +00:00
Dylan Hurd	0f9858394b	feat(core,tui,app-server) personality migration (#10307 ) ## Summary Keep existing users on Pragmatic, to preserve behavior while new users default to Friendly ## Testing - [x] Tested locally - [x] add integration tests	2026-01-31 17:25:14 -07:00
Dylan Hurd	8a461765f3	chore(core) Default to friendly personality (#10305 ) ## Summary Update default personality to friendly ## Testing - [x] Unit tests pass	2026-01-31 17:11:32 -07:00
Ahmed Ibrahim	2d6757430a	plan mode prompt (#10308 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2026-01-31 13:55:52 -08:00
Dylan Hurd	ed9e02c9dc	chore(app-server) add personality update test (#10306 ) ## Summary Add some additional validation to ensure app-server handles Personality changes ## Testing - [x] These are tests	2026-01-31 14:49:55 -07:00
Fouad Matin	49342b156d	Fix npm README image link (#10303 ) ### Motivation - The image referenced in the package README was 404ing on the npm package page because it used a relative file path that doesn't resolve on npm, so the splash image needs a GitHub-hosted URL to render correctly. ### Description - Update `README.md` to replace the relative image path `./.github/codex-cli-splash.png` with the GitHub-hosted URL `https://github.com/openai/codex/blob/main/.github/codex-cli-splash.png`. ### Testing - No automated tests were run because this is a docs-only change and does not affect code or test behavior. ------ [Codex Task](https://chatgpt.com/codex/tasks/task_i_697e58dbce34832d87c7847779e8f4a5)	2026-01-31 20:33:06 +00:00
Dylan Hurd	28f3a71809	chore(features) remove Experimental tag from UTF8 (#10296 ) ## Summary This has been default on for some time, it should now be the default. ## Testing - [x] Existing tests pass	2026-01-31 13:17:24 -07:00
douglaz	9a10121fd6	fix(nix): update flake for newer Rust toolchain requirements (#10302 ) ## Summary - Add rust-overlay input to provide newer Rust versions (rama crates require rustc 1.91.0+) - Add devShells output with complete development environment - Add missing git dependency hashes to codex-rs/default.nix ## Changes flake.nix: - Added `rust-overlay` input to get newer Rust toolchains - Updated `packages` output to use `rust-bin.stable.latest.minimal` for builds - Added `devShells` output with: - Rust with `rust-src` and `rust-analyzer` extensions for IDE support - Required build dependencies: `pkg-config`, `openssl`, `cmake`, `libclang` - Environment variables: `PKG_CONFIG_PATH`, `LIBCLANG_PATH` codex-rs/default.nix: - Added missing `outputHashes` for git dependencies: - `nucleo-0.5.0`, `nucleo-matcher-0.3.1` - `runfiles-0.1.0` - `tokio-tungstenite-0.28.0`, `tungstenite-0.28.0` ## Test Plan - [x] `nix develop` enters shell successfully - [x] `nix develop -c rustc --version` shows 1.93.0 - [x] `nix develop -c cargo build` completes successfully	2026-01-31 11:34:53 -08:00
willwang-openai	2a299317d2	display promo message in usage error (#10285 ) If a promo message is attached to a rate limit response, then display it in the error message.	2026-01-31 08:13:25 -08:00
Anton Panasenko	8660ad6c64	feat: show runtime metrics in console (#10278 ) Summary of changes: - Adds a new feature flag: runtime_metrics - Declared in core/src/features.rs - Added to core/config.schema.json - Wired into OTEL init in core/src/otel_init.rs - Enables on-demand runtime metric snapshots in OTEL - Adds runtime_metrics: bool to otel/src/config.rs - Enables experimental custom reader features in otel/Cargo.toml - Adds snapshot/reset/summary APIs in: - otel/src/lib.rs - otel/src/metrics/client.rs - otel/src/metrics/config.rs - otel/src/metrics/error.rs - Defines metric names and a runtime summary builder - New files: - otel/src/metrics/names.rs - otel/src/metrics/runtime_metrics.rs - Summarizes totals for: - Tool calls - API requests - SSE/streaming events - Instruments metrics collection in OTEL manager - otel/src/traces/otel_manager.rs now records: - API call counts + durations - SSE event counts + durations (success/failure) - Tool call metrics now use shared constants - Surfaces runtime metrics in the TUI - Resets runtime metrics at turn start in tui/src/chatwidget.rs - Displays metrics in the final separator line in tui/src/history_cell.rs - Adds tests - New OTEL tests: - otel/tests/suite/snapshot.rs - otel/tests/suite/runtime_summary.rs - New TUI test: - final_message_separator_includes_runtime_metrics in tui/src/history_cell.rs Scope: - 19 files changed - ~652 insertions, 38 deletions <img width="922" height="169" alt="Screenshot 2026-01-30 at 4 11 34 PM" src="https://github.com/user-attachments/assets/1efd754d-a16d-4564-83a5-f4442fd2f998" />	2026-01-30 22:20:02 -08:00
Dylan Hurd	a8c9e386e7	feat(core) Smart approvals on (#10286 ) ## Summary Turn on Smart Approvals by default ## Testing - [x] Updated unit tests	2026-01-30 23:12:25 -07:00
Ruyut	9327e99b28	Fix minor typos in comments and documentation (#10287 ) ## Summary I have read the contribution guidelines. All changes in this PR are limited to text corrections and do not modify any business logic, runtime behavior, or user-facing functionality. ## Details This PR fixes several minor typos, including: - `create` -> `crate` - `analagous` -> `analogous` - `apply-patch` -> `apply_patch` - `codecs` -> `codex` - ` '/" ` -> ` '/' ` - `Respesent` -> `Represent`	2026-01-30 22:11:02 -08:00
gt-oai	47faa1594c	Turn on cloud requirements for business too (#10283 ) Need to check "enterprise" and "business"	2026-01-31 02:57:42 +00:00
sayan-oai	eb86663dcb	add missing fields to WebSearchAction and update app-server types (#10276 ) - add `WebSearchAction` to app-server v2 types - add `queries` to `WebSearchAction::Search` type Updated tests.	2026-01-30 16:37:56 -08:00
gt-oai	149f3aa27a	Add enforce_residency to requirements (#10263 ) Add `enforce_residency` to requirements.toml and thread it through to a header on `default_client`.	2026-01-31 00:26:25 +00:00
gt-oai	a046481ad9	Wire up cloud reqs in exec, app-server (#10241 ) We're fetching cloud requirements in TUI in https://github.com/openai/codex/pull/10167. This adds the same fetching in exec and app-server binaries also.	2026-01-30 23:53:41 +00:00
Michael Bolin	10ea117ee1	chore: implement Mul for TruncationPolicy (#10272 ) Codex thought this was a good idea while working on https://github.com/openai/codex/pull/10192.	2026-01-30 15:50:20 -08:00
Eric Traut	8d142fd63d	Validate CODEX_HOME before resolving (#10249 ) Summary - require `CODEX_HOME` to point to an existing directory before canonicalizing and surface clear errors otherwise - share the same helper logic in both `core` and `rmcp-client` and add unit tests that cover missing, non-directory, valid, and default paths This addresses #9222	2026-01-30 15:46:33 -08:00
Yuvraj Angad Singh	13e85b1549	fix: update file search directory when session CWD changes (#9279 ) ## Summary Fixes #9041 - Adds update_search_dir() method to FileSearchManager to allow updating the search directory after initialization - Calls this method when the session CWD changes: new session, resume, or fork ## Problem The FileSearchManager was created once with the initial search_dir and never updated. When a user: 1. Starts Codex in a non-git directory (e.g., /tmp/random) 2. Resumes or forks a session from a different workspace 3. The @filename lookup still searched the original directory This caused no matches to be returned even when files existed in the current workspace. ## Solution Update FileSearchManager.search_dir whenever the session working directory changes: - AppEvent::NewSession: Use current config CWD - SessionSelection::Resume: Use resumed session CWD - SessionSelection::Fork: Use forked session CWD ## Test plan - [ ] Start Codex in /tmp/test-dir (non-git) - [ ] Resume a session from a project with actual files - [ ] Verify @filename returns matches from the resumed session directory --------- Co-authored-by: Eric Traut <etraut@openai.com>	2026-01-30 14:59:20 -08:00
sayan-oai	31d1e49340	fix: dont auto-enable web_search for azure (#10266 ) seeing issues with azure after default-enabling web search: #10071, #10257. need to work with azure to fix api-side, for now turning off default-enable of web_search for azure. diff is big because i moved logic to reuse	2026-01-30 22:52:37 +00:00
Jeremy Rose	d59685f6d4	file-search: multi-root walk (#10240 ) Instead of a separate walker for each root in a multi-root walk, use a single walker.	2026-01-30 22:20:23 +00:00
pakrym-oai	748141bdda	Update announcement_tip.toml (#10267 ) Extend the test for dev version	2026-01-30 14:14:29 -08:00
pakrym-oai	0fac2744f7	Hide /approvals from the slash-command list (#10265 ) `/permissions` is the replacement. `/approvals` still available when typing.	2026-01-30 22:12:50 +00:00
pakrym-oai	5f81e8e70b	Fix main (#10262 )	2026-01-30 21:54:05 +00:00
Skylar Graika	9008a0eff9	core: prevent shell_snapshot from inheriting stdin (#9735 ) Fixes #9559. When `shell_snapshot` runs, it may execute user startup files (e.g. `.bashrc`). If those files read from stdin (or if stdin is an interactive TTY under job control), the snapshot subprocess can block or receive `SIGTTIN` (as reported over SSH). This change explicitly sets `stdin` to `Stdio::null()` for the snapshot subprocess, so it can't read from the terminal. Regression test added that would hang/timeout without this change. Tests: `ulimit -n 4096 && cargo test -p codex-core`. cc @dongdongbh @etraut-openai --------- Co-authored-by: Skylar Graika <sgraika127@gmail.com>	2026-01-30 13:47:10 -08:00
pakrym-oai	aacd530a41	Update copy (#10256 ) <img width="839" height="62" alt="image" src="https://github.com/user-attachments/assets/ca987cdb-9e8c-403e-8856-a9b37baa7673" />	2026-01-30 12:57:19 -08:00
daniel-oai	dd6c1d3787	Skip loading codex home as project layer (#10207 ) Summary: - Fixes issue #9932: https://github.com/openai/codex/issues/9932 - Prevents `$CODEX_HOME` (typically `~/.codex`) from being discovered as a project `.codex` layer by skipping it during project layer traversal. We compare both normalized absolute paths and best-effort canonicalized paths to handle symlinks. - Adds regression tests for home-directory invocation and for the case where `CODEX_HOME` points to a project `.codex` directory (e.g., worktrees/editor integrations). Testing: - `cargo build -p codex-cli --bin codex` - `cargo build -p codex-rmcp-client --bin test_stdio_server` - `cargo test -p codex-core` - `cargo test --all-features` - Manual: ran `target/debug/codex` from `~` and confirmed the disabled-folder warning and trust prompt no longer appear.	2026-01-30 12:42:07 -08:00
Charley Cunningham	83317ed4bf	Make plan highlight use popup grey background (#10253 ) ## Summary - align proposed plan background with popup surface color by reusing `user_message_bg` - remove the custom blue-tinted plan background <img width="1572" height="1568" alt="image" src="https://github.com/user-attachments/assets/63a5341e-4342-4c07-b6b0-c4350c3b2639" />	2026-01-30 12:39:15 -08:00

... 8 9 10 11 12 ...

3718 commits