Running Codex safely at OpenAI

As AI systems become more capable, they increasingly act on behalf of users. Coding agents can autonomously review repositories, run commands, and interact with development tools. These are tasks that previously required direct human execution.

With Codex, we’ve designed these capabilities alongside the controls organizations need for safe deployment. Security teams need ways to govern how agents operate: what they can access, when human approval is required, which systems they can interact with, and what telemetry exists to explain their behavior.

At OpenAI, we deploy Codex with a few clear goals: keep the agent inside clear technical boundaries, let developers move quickly on low-risk actions, and make higher-risk actions explicit. We also preserve agent-native telemetry so we can understand and audit what the agent did. In practice, that means managed configuration, constrained execution, network policies, and agent-native logs.

Controlling how Codex operates

We deploy Codex with a simple principle that it should be productive inside a bounded environment, low-risk everyday actions should be frictionless, and higher-risk actions should stop for review.

Sandboxing and approvals

Approvals and sandboxing work together. The sandbox defines the technical execution boundary, including where Codex can write, whether it can reach the network, and which paths remain protected. Approval policy determines when Codex must ask to perform an action, such as when it needs to do something outside of the sandbox. Users can approve the action once, or approve that type of action for that session.

For routine approval requests, we are using Auto-review mode, which is a feature that, when turned on, auto-approves certain kinds of requests to reduce how often users have to stop and approve Codex actions. Codex sends the planned action and recent context to the auto-approval subagent, which can automatically approve low-risk actions instead of interrupting the user. That keeps Codex moving on routine work while still stopping on higher-risk or actions with unintended consequences.

Plain Text

1# config.toml23# Turn on auto_review4approvals_reviewer = "auto_review" 5# Add known development directories to the sandbox automatically6sandbox_workspace_write.writable_roots = ["~/development"] 78# requirements.toml910# Require Codex to operate inside the sandbox11allowed_sandbox_modes = ["read-only", "workspace-write"]

Network access

We do not run Codex with open-ended outbound access. Our managed network policy allows expected destinations, blocks destinations we do not want Codex reaching, and requires approval for unfamiliar domains. That lets Codex complete common, known-good workflows without giving it broad network access.

1# requirements.toml23# Ensure web fetch only comes from OpenAI's cache4allowed_web_search_modes = ["cached"] 56[experimental_network]7# Turn on Network Proxy8enabled = true9# Allow Codex to interact with localhost10allow_local_binding = true 11# Block all requests to this domain12denied_domains = ["pastebin.com"] 13# Auto-allow requests to these domains14allowed_domains = ["login.microsoftonline.com", "*.openai.com"]

Identity and credentials

We also manage how Codex authenticates. CLI and MCP OAuth credentials are stored in the secure OS keyring, login is forced through ChatGPT, and access is pinned to our ChatGPT enterprise workspace. That keeps Codex usage tied to our workspace-level controls and makes Codex activity available in the ChatGPT Compliance Logs Platform for our enterprise workspace.

1# config.toml23# Store CLI Auth Creds in OS Keychain4cli_auth_credentials_store = "keyring" 5# Store MCP Creds in OS Keychain6mcp_oauth_credentials_store = "keyring" 7# Require Auth via ChatGPT8forced_login_method = "chatgpt" 9# Require Auth to Specific ChatGPT Workspace10forced_chatgpt_workspace_id = ""

Rules

We use rules so Codex does not treat every shell command as equally safe. Common benign commands that engineers use in day-to-day development are allowed without approval outside of the sandbox and specific dangerous commands can be blocked or require approval. That lets Codex move quickly through ordinary engineering tasks while still forcing review or blocking patterns we do not want to run outside the sandbox.

1# default.rules23prefix_rule(4 pattern = ["gh", "pr", ["view", "list"]],5 decision = "allow",6 justification = "Allows read-only GitHub PR inspection via gh CLI.",7)8prefix_rule(9 pattern = ["kubectl", ["get", "describe", "logs"]],10 decision = "allow",11 justification = "Allows Kubernetes resource inspection for debugging.",12)

Managed configs

We apply this posture through a combination of cloud-managed requirements, macOS managed preferences, and local requirements files. Requirements are admin-enforced controls that users cannot override. The macOS managed preferences and local requirements files allow us to keep a consistent baseline while still testing different configurations by team, user group, or environment. These configurations apply across local Codex surfaces, including the desktop app, CLI, and IDE extension.

Agent-native telemetry and audit trails

Control is only half the job. Once agents are deployed, security teams need visibility into what these agents are doing and why. Traditional security logs are still useful when looking at actions taken by Codex, but they mostly answer what happened: a process started, a file changed, a network connection was attempted. Defenders are still left to figure out why Codex did something, or the user's intent.

Codex can give security teams a more agent-aware view. Codex supports OpenTelemetry log export for various Codex events such as user prompts, tool approval decisions, tool execution results, MCP server usage, and network proxy allow or deny events. Codex activity logs are also available through the OpenAI Compliance Platform for Enterprise and Edu customers.

1# config.toml23[otel]4log_user_prompt = true5environment = "prod"67[otel.exporter.otlp-http]8endpoint = "http://localhost:14318/v1/logs"9protocol = "binary"

At OpenAI, we use Codex logs alongside our AI-powered security triage agent. When an endpoint alert says Codex did something unusual, the endpoint security tool tells us that a suspicious event occurred. Codex logs then help explain the surrounding intent by the user and agent. Our AI security triage agent uses Codex logs to inspect the original request, tool activity, approval decisions, tool results, and any relevant network policy decision or block. The AI security triage agent surfaces its analysis to our security team for review to distinguish between expected agent behavior, benign mistakes, and activity that truly warrants escalation.

We also use the same telemetry operationally. We use these logs to understand how internal adoption is changing, which tools and MCP servers are being used, how often the network sandbox is blocking or prompting, and where the rollout still needs tuning. These OpenTelemetry logs can be centralized in SIEM and compliance logging systems.

Looking ahead

As coding agents like Codex become integrated into development workflows, security teams need tools specifically designed for managing this shift. Codex provides the control surfaces, configuration management, sandboxing, and detailed agent-aware telemetry needed to ensure safe adoption. With those capabilities in place, security teams can enable Codex with greater confidence, balancing developer productivity with the visibility and control required for enterprise security. More information on configuring Codex can be found here⁠(opens in a new window), and the Compliance API here⁠(opens in a new window).

1# config.toml23[otel]4log_user_prompt = true5environment = "prod"67[otel.exporter.otlp-http]8endpoint = "http://localhost:14318/v1/logs"9protocol = "binary"

At OpenAI, we use Codex logs alongside our AI-powered security triage agent. When an endpoint alert says Codex did something unusual, the endpoint security tool tells us that a suspicious event occurred. Codex logs then help explain the surrounding intent by the user and agent. Our AI security triage agent uses Codex logs to inspect the original request, tool activity, approval decisions, tool results, and any relevant network policy decision or block. The AI security triage agent surfaces its analysis to our security team for review to distinguish between expected agent behavior, benign mistakes, and activity that truly warrants escalation.

Author

OpenAI

* 2026

* Codex

Author

OpenAI

Running Codex safely at OpenAI

We deploy Codex with a simple principle that it should be productive inside a bounded environment, low-risk everyday actions should be frictionless, and higher-risk actions should stop for review.

OpenAI

Author

More from ChatGPT

Advancing voice intelligence with new models in the API

Simplex rethinks software development with Codex

Scaling Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber

Introducing Trusted Contact in ChatGPT

Comments

Running Codex safely at OpenAI

We deploy Codex with a simple principle that it should be productive inside a bounded environment, low-risk everyday actions should be frictionless, and higher-risk actions should stop for review.

OpenAI

Author

More from ChatGPT

Advancing voice intelligence with new models in the API

Simplex rethinks software development with Codex

Scaling Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber

Introducing Trusted Contact in ChatGPT

Comments

The Next Input keeps optional media off until you say yes.