Digest

AIHugging Face Blog

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

2026-05-27

AIHugging Face Blog

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

2026-05-27

AIarXiv cs.AI

Algorithmic Monocultures in Hiring

Many employers screen job applicants with algorithms built by the same few algorithm vendors. We hypothesize that algorithmic monoculture leads to the same individuals and members of the same racial groups facing rejection. We acquire and analyze a novel dataset of 3 million applicants submitting 4 million applicati...

2026-05-26

AIarXiv cs.AI

MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation

Large language model (LLM) agents rely on reusable skills to solve complex tasks. However, existing skill creation approaches treat skills as isolated and static artifacts, limiting their reusability, reliability, and long-term improvement. We propose MUSE-Autoskill Agent (Memory-Utilizing Skill Evolution), a skill-...

2026-05-26

AIarXiv cs.LG

From Scores to Gibbs Correctors: Accelerating Uniform-Rate Discrete Diffusion Models

Discrete diffusion models have achieved strong empirical performance in text and other symbolic domains, but, especially for uniform-rate models, they often require many steps to generate a single sample. Existing acceleration methods either rely on training additional quantities or suffer from slow mixing. In this...

2026-05-26

AIarXiv cs.LG

Towards Controllable Image Generation through Representation-Conditioned Diffusion Models

Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such as text prompts or semantic maps, which require extensively annotated datasets. In thi...

2026-05-26

AIGoogle AI

Catch up on the Dialogues stage at Google I/O 2026.

Alphabet CEO Sundar Pichai in conversation on the I/O 2026 Dialogues stage

2026-05-22

AIGoogle DeepMind

We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks

2026-05-21

AIGoogle AI

We’re announcing new community investments in Missouri.

We’re helping build the state’s next-generation workforce and investing in energy programs.

2026-05-20

AIGoogle DeepMind

Fast-tracking genetic leads to reverse cellular aging

Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.

2026-05-18

AIHugging Face Blog

Reachy Mini goes fully local

2026-05-27

AIarXiv cs.AI

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Vision-language models (VLMs) commonly formulate visual grounding and detection as a coordinate-token generation problem, serializing each 2D box into multiple 1D tokens that are learned and decoded largely independently. This token-by-token decoding mismatches the coupled structure of box geometry and creates a pra...

2026-05-26