AI

OpenAI Launches ChatGPT Lockdown Mode to Block Prompt Injection Data Exfiltration

dark6 8 June 2026
Read Time:3 Minute, 48 Second

OpenAI has launched ChatGPT Lockdown Mode, a new enterprise security feature designed to block the final stage of prompt injection attacks: the unauthorized exfiltration of sensitive data to attacker-controlled destinations. The feature is now available to eligible personal accounts, self-serve ChatGPT Business users, and managed enterprise workspaces — representing a significant step forward in hardening AI assistants against one of the most persistent attack vectors in the LLM threat landscape.

The Problem: Prompt Injection and Data Exfiltration

Prompt injection attacks occur when malicious instructions are embedded in content that an AI model processes — such as a cached webpage, an uploaded PDF, or a document retrieved during deep research. These hidden instructions can manipulate the model’s behavior, causing it to act against the user’s intent. The most dangerous outcome is data exfiltration: the AI is tricked into sending sensitive information (conversation history, uploaded documents, retrieved data) to an external URL controlled by the attacker.

This attack class has grown significantly in sophistication as AI agents gain broader access to tools, external services, and sensitive enterprise data. Standard content filtering has proven insufficient to reliably detect and block all injection payloads, particularly when they are obfuscated or embedded in complex documents.

What Lockdown Mode Does

Lockdown Mode addresses the exfiltration pathway rather than the injection vector itself. When enabled, it restricts ChatGPT’s outbound network capabilities, cutting off the channels that injected code most commonly exploits to leak data. The following capabilities are disabled or restricted under Lockdown Mode:

  • Live web browsing — Limited to cached content only; real-time results are unavailable
  • Image retrieval — ChatGPT cannot fetch or display web-derived images
  • Deep research — Fully disabled
  • Agent mode — Fully disabled
  • Canvas networking — Canvas-generated code cannot make network requests
  • File downloads — ChatGPT cannot download external files for analysis (manually uploaded files remain accessible)

Memory, file uploads, conversation sharing, and model training settings are unaffected and remain independently configurable.

Important Limitations to Understand

OpenAI is transparent that Lockdown Mode is not a complete defense. Several critical limitations apply:

  • Injection itself is not prevented. Malicious payloads in uploaded files, cached pages, or ingested content can still enter the model’s context and influence its responses and reasoning — only the exfiltration pathway is blocked.
  • Third-party app risk persists. Connected apps and connectors can still serve as exfiltration sinks if not properly audited and restricted.
  • Lockdown Mode does not affect Codex. Codex network access is not governed by this feature.
  • Developer Mode is incompatible. Enabling Lockdown Mode automatically disables Developer Mode and vice versa.

Risk Tiers for App and Connector Configurations

OpenAI has introduced a risk classification framework for app configurations in Lockdown Mode environments:

  • High risk (not recommended): Read or write actions for untrusted apps; write actions for trusted apps with broad or uncertain data visibility.
  • Medium risk: Sync connectors and read actions for trusted apps carry lower exfiltration sink risk but can still expose sensitive source data.
  • Lower risk: Write actions for trusted apps where side effects are confirmed to be visible only to trusted parties.

Enterprise Deployment and Administration

For managed enterprise workspaces, Lockdown Mode does not automatically disable all connected apps. Administrators must manually configure role-based access controls (RBAC), designate trusted apps, and audit connector permissions to achieve meaningful protection. The recommended approach is to create a dedicated RBAC role for Lockdown Mode and assign members or groups accordingly.

The Compliance API Logs Platform provides persistent audit visibility into app usage, shared data, and connected sources, independent of Lockdown Mode status — giving security teams the observability needed to detect anomalous behavior even when Lockdown Mode is active.

How to Enable Lockdown Mode

  • Personal and Business users: Settings > Security > Advanced Security > Lockdown Mode
  • Enterprise admins: Create a custom RBAC role designated as a Lockdown Mode role and assign members. Consult OpenAI’s RBAC documentation and Compliance API guidance for workspace-wide deployment.

Why This Matters for Enterprise AI Security

Lockdown Mode represents a meaningful architectural hardening for organizations using ChatGPT with sensitive data. While not a silver bullet, it removes the most common exfiltration pathways that prompt injection exploits, raising the bar for attackers who embed malicious instructions in enterprise content.

As organizations deploy AI assistants with increasingly broad access to internal systems, documents, and data, securing the outbound data flows of these agents becomes as critical as protecting inbound access. Lockdown Mode is a step in the right direction — but security teams should pair it with rigorous app auditing, content scanning, and behavioral monitoring to build a comprehensive defense-in-depth posture for AI-powered workflows.

Source: Cyber Security News | OpenAI Help Center

Leave a Reply

💬 [[ unisciti alla discussione! ]]


Se vuoi commentare su OpenAI Launches ChatGPT Lockdown Mode to Block Prompt Injection Data Exfiltration, utilizza la discussione sul Forum.
Condividi esempi, IOCs o tecniche di detection efficaci nel nostro 👉 forum community