AIHugging Face Blog2024-01-29The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models
AIGoogle DeepMind2025-12-09FACTS Benchmark Suite: Systematically evaluating the factuality of large language modelsSystematically evaluating the factuality of large language models with the FACTS Benchmark Suite.