We're excited to be a launch partner alongside Google to bring their newest embedding model, EmbeddingGemma , to Workers AI that delivers best-in-class performance for its size, enabling RAG and semantic search use cases. @cf/google/embeddinggemma-300m is a 300M parameter embedding model from Google, built from Gemm...
Workers AI - Introducing EmbeddingGemma from Google on Workers AI
Cloudflare Developer Platform Changelog · 2025-09-05
Related items
Workers AI - Google Gemma 4 26B A4B now available on Workers AI
We are partnering with Google to bring @cf/google/gemma-4-26b-a4b-it to Workers AI. Gemma 4 26B A4B is a Mixture-of-Experts (MoE) model built from Gemini 3 research, with 26B total parameters and only 4B active per forward pass. By activating a small subset of parameters during inference, the model runs almost as fa...
Stream - Introducing Stream Bindings for Workers
You can now interact with your Stream video library using new bindings for Workers! This allows customers to upload content to Stream, provision direct uploads, manage videos, and generate signed URLs from a Worker without making authenticated API calls. We're excited to bring Stream and Workers closer together to e...
Agents, Workers - Introducing the Agents SDK
We've released the Agents SDK , a package and set of tools that help you build and ship AI Agents. You can get up and running with a chat-based AI Agent (and deploy it to Workers) that uses the Agents SDK, tool calling, and state syncing with a React-based front-end by running the following command: npm create cloud...
AI Search - New Workers AI models for text generation and embedding in AI Search
AI Search now supports four additional Workers AI models across text generation and embedding. Text generation Model Context window (tokens) @cf/zai-org/glm-4.7-flash 131,072 @cf/qwen/qwen3-30b-a3b-fp8 32,000 GLM-4.7-Flash is a lightweight model from Zhipu AI with a 131,072 token context window, suitable for long-do...
Workers, Agents, Workers AI - Introducing GLM-4.7-Flash on Workers AI, @cloudflare/tanstack-ai, and workers-ai-provider v3.1.1
We're excited to announce GLM-4.7-Flash on Workers AI, a fast and efficient text generation model optimized for multilingual dialogue and instruction-following tasks, along with the brand-new @cloudflare/tanstack-ai package and workers-ai-provider v3.1.1 . You can now run AI agents entirely on Cloudflare. With GLM-4...
AI Gateway - AI Gateway Introduces New Worker Binding Methods
We have released new Workers bindings API methods , allowing you to connect Workers applications to AI Gateway directly. These methods simplify how Workers calls AI services behind your AI Gateway configurations, removing the need to use the REST API and manually authenticate. To add an AI binding to your Worker, in...