In addition, the Responses API now supports WebSockets. This enables persistent connections in which only new information is transmitted, rather than resending the full context with each request. OpenAI says this change can speed up complex AI agents with heavy tool usage by 20% to 40%.