AI Search now supports four additional Workers AI models across text generation and embedding. Text generation Model Context window (tokens) @cf/zai-org/glm-4.7-flash 131,072 @cf/qwen/qwen3-30b-a3b-fp8 32,000 GLM-4.7-Flash is a lightweight model from Zhipu AI with a 131,072 token context window, suitable for long-do...
AI Search - New Workers AI models for text generation and embedding in AI Search
Cloudflare Developer Platform Changelog · 2026-04-08
Related items
Workers AI - New models in Workers AI
Workers AI is excited to add 4 new models to the catalog, including 2 brand new classes of models with a text-to-speech and reranker model. Introducing: @cf/baai/bge-m3 - a multi-lingual embeddings model that supports over 100 languages. It can also simultaneously perform dense retrieval, multi-vector retrieval, and...
Workers AI - Introducing EmbeddingGemma from Google on Workers AI
We're excited to be a launch partner alongside Google to bring their newest embedding model, EmbeddingGemma , to Workers AI that delivers best-in-class performance for its size, enabling RAG and semantic search use cases. @cf/google/embeddinggemma-300m is a 300M parameter embedding model from Google, built from Gemm...
Workers AI - New conversion options for Markdown Conversion
You can now customize how the Markdown Conversion service processes different file types by passing a conversionOptions object. Available options: Images : Set the language for AI-generated image descriptions HTML : Use CSS selectors to extract specific content, or provide a hostname to resolve relative links PDF :...
Workers AI - Workers AI for Developer Week - faster inference, new models, async batch API, expanded LoRA support
Happy Developer Week 2025! Workers AI is excited to announce a couple of new features and improvements available today. Check out our blog for all the announcement details. Faster inference + New models We’re rolling out some in-place improvements to our models that can help speed up inference by 2-4x! Users of the...
Workers - Support for Node.js DNS, Net, and Timer APIs in Workers
When using a Worker with the nodejscompat compatibility flag enabled, you can now use the following Node.js APIs: node:net node:dns node:timers node:net You can use node:net to create a direct connection to servers via a TCP sockets with net.Socket . JavaScript import net from "node:net" ; const exampleIP = "127.0.0...
Agents, Workers, Workflows - Run AI-generated code on-demand with Code Sandboxes (new)
AI is supercharging app development for everyone, but we need a safe way to run untrusted, LLM-written code. We’re introducing Sandboxes , which let your Worker run actual processes in a secure, container-based environment. import { getSandbox } from "@cloudflare/sandbox" ; export { Sandbox } from "@cloudflare/sandb...