AI
📚 3LM: A Benchmark for Arabic LLMs in STEM and Code
Hugging Face Blog · 2025-08-01
Personal
Home
Save
Read later
Chinese
Original
Open source
Related items
AI
Hugging Face Blog
2023-05-04
StarCoder: A State-of-the-Art LLM for Code
AI
Hugging Face Blog
2025-02-04
DABStep: Data Agent Benchmark for Multi-step Reasoning
AI
Hugging Face Blog
2024-10-01
🇨🇿 BenCzechMark - Can your LLM Understand Czech?
AI
Hugging Face Blog
2024-12-04
Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard
AI
Hugging Face Blog
2023-09-26
Llama 2 on Amazon SageMaker a Benchmark
AI
Hugging Face Blog
2024-11-19
Judge Arena: Benchmarking LLMs as Evaluators
Feedback
Type
Correction
Quality issue
Duplicate
Broken link
Rights concern
Message
Submit feedback