// OpenAI's new Spark model codes 15x faster than GPT-5.3-Codex

The Codex team at OpenAI is on fire. Less than two weeks after releasing a dedicated agent-based Codex app for Macs, and only a week after releasing the faster and more steerable GPT-5.3-Codex language model, OpenAI is counting on lightning striking for a third time. By David Gewirtz.

OpenAI’s latest release, GPT-5.3-Codex-Spark, is a purpose-built model for real-time coding collaboration. It aims to transform the developer experience from a slow, batch-process-like interaction to a fluid, conversational one. The model achieves a reported 15x faster code generation through significant latency reductions: an 80% cut in client/server roundtrip overhead and a 50% improvement in time-to-first-token.

Key technical features enabling this include support for mid-task interruption and a persistent WebSocket connection to avoid renegotiation delays. Powered by Cerebras’s WSE-3 wafer-scale chips, Spark is optimized for lightweight, targeted edits. The major caveat is its performance trade-off. On benchmarks like SWE-Bench Pro, it underperforms the full GPT-5.3-Codex and is explicitly noted as not meeting OpenAI’s "high capability" threshold for cybersecurity. Initially available to Pro-tier users, Spark is positioned not as a replacement but as a complement for rapid iteration, while the main model handles more complex, long-running tasks.

This forces a strategic decision for developers: prioritize speed for quick prototyping or rely on the more robust, deliberate intelligence of the standard model for critical work. Good read!

[Read More]

Tags ai programming app-development performance