The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Hugging Face Blog · 2024-04-19

Chinese Original

Related items

AIHugging Face Blog2024-01-29

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

AIGoogle DeepMind2025-12-09

FACTS Benchmark Suite: Systematically evaluating the factuality of large language models

Systematically evaluating the factuality of large language models with the FACTS Benchmark Suite.

AIHugging Face Blog2026-05-06

Adding Benchmaxxer Repellant to the Open ASR Leaderboard

AIHugging Face Blog2024-05-14

Introducing the Open Arabic LLM Leaderboard

AIHugging Face Blog2024-05-05

Introducing the Open Leaderboard for Hebrew LLMs!

AIHugging Face Blog2024-10-04

Introducing the Open FinLLM Leaderboard

Feedback

TypeMessage