AI Gateway now uses the AI REST API on api.cloudflare.com . You can call any model — whether from OpenAI, Anthropic, Google, or hosted on Workers AI — through one unified API, using the same endpoints and authentication regardless of provider. Four endpoints are available: POST /ai/run — universal endpoint for all m...
AI Gateway - Call any AI model through AI Gateway's new REST API
Cloudflare Developer Platform Changelog · 2026-05-21
Related items
AI Gateway - Request timeouts and retries with AI Gateway
AI Gateway adds additional ways to handle requests - Request Timeouts and Request Retries , making it easier to keep your applications responsive and reliable. Timeouts and retries can be used on both the Universal Endpoint or directly to a supported provider . Request timeouts A request timeout allows you to trigge...
AI Gateway - AI Gateway launches Realtime WebSockets API
We are excited to announce that AI Gateway now supports real-time AI interactions with the new Realtime WebSockets API . This new capability allows developers to establish persistent, low-latency connections between their applications and AI models, enabling natural, real-time conversational AI experiences, includin...
AI Search - New AI Search REST API endpoints for /search and /chat/completions
AI Search now offers new REST API endpoints for search and chat that use an OpenAI compatible format. This means you can use the familiar messages array structure that works with existing OpenAI SDKs and tools. The messages array also lets you pass previous messages within a session, so the model can maintain contex...
Workers AI - New models in Workers AI
Workers AI is excited to add 4 new models to the catalog, including 2 brand new classes of models with a text-to-speech and reranker model. Introducing: @cf/baai/bge-m3 - a multi-lingual embeddings model that supports over 100 languages. It can also simultaneously perform dense retrieval, multi-vector retrieval, and...
AI Gateway - AI Gateway adds Cerebras, ElevenLabs, and Cartesia as new providers
AI Gateway has added three new providers: Cartesia , Cerebras , and ElevenLabs , giving you more even more options for providers you can use through AI Gateway. Here's a brief overview of each: Cartesia provides text-to-speech models that produce natural-sounding speech with low latency. Cerebras delivers low-latenc...
AI Gateway - AI Gateway Introduces New Worker Binding Methods
We have released new Workers bindings API methods , allowing you to connect Workers applications to AI Gateway directly. These methods simplify how Workers calls AI services behind your AI Gateway configurations, removing the need to use the REST API and manually authenticate. To add an AI binding to your Worker, in...