Original text is not available for public display.
How Synthesia optimizes generative AI video inference on Amazon EC2 G7e instances
AWS Architecture Blog · 2026-05-19
Related items
How Generali Malaysia optimizes operations with Amazon EKS
In this post, we look at how Generali is using Amazon EKS Auto Mode and its integration with other AWS services to enhance performance while reducing operational overhead, optimizing costs, and enhancing security.
Unlock efficient model deployment: Simplified Inference Operator setup on Amazon SageMaker HyperPod
In this post, we walk through the new installation experience, demonstrate three deployment methods (console, CLI, and Terraform), and show how features like multi-instance-type deployment and native node affinity give you fine-grained control over inference scheduling
AI Search - Create AI Search instances programmatically via REST API
You can now create AI Search instances programmatically using the API . For example, use the API to create instances for each customer in a multi-tenant application or manage AI Search alongside your other infrastructure. If you have created an AI Search instance via the dashboard before, you already have a service...
Capacity Efficiency at Meta: How Unified AI Agents Optimize Performance at Hyperscale
We’re sharing insights into Meta’s Capacity Efficiency Program, where we’ve built an AI agent platform that helps automate finding and fixing performance issues throughout our infrastructure. By leveraging encoded domain expertise across a unified, standardized tool interface these agents help save power and free up...
Automate safety monitoring with computer vision and generative AI
This post describes a solution that uses fixed camera networks to monitor operational environments in near real-time, detecting potential safety hazards while capturing object floor projections and their relationships to floor markings. While we illustrate the approach through distribution center deployment examples...
Workflows, Workers - Increased concurrency, creation rate, and queued instance limits for Workflows instances
Workflows limits have been raised to the following: Limit Previous New Concurrent instances (running in parallel) 10,000 50,000 Instance creation rate (per account) 100/second per account 300/second per account, 100/second per workflow Queued instances per Workflow 1 1 million 2 million These increases apply to all...