📚 3LM: A Benchmark for Arabic LLMs in STEM and Code

Hugging Face Blog · 2025-08-01

Chinese Original

Related items

AIHugging Face Blog2023-05-04

StarCoder: A State-of-the-Art LLM for Code

AIHugging Face Blog2025-02-04

DABStep: Data Agent Benchmark for Multi-step Reasoning

AIHugging Face Blog2024-10-01

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

AIHugging Face Blog2024-12-04

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

AIHugging Face Blog2023-09-26

Llama 2 on Amazon SageMaker a Benchmark

AIHugging Face Blog2024-11-19

Judge Arena: Benchmarking LLMs as Evaluators

Feedback

TypeMessage