AIHugging Face Blog2024-05-24CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models
AIHugging Face Blog2021-07-15Deep Learning over the Internet: Training Language Models Collaboratively
AIHugging Face Blog2025-04-16Introducing HELMET: Holistically Evaluating Long-context Language Models