AIHugging Face Blog2023-03-28Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator
AIHugging Face Blog2023-12-05AMD + š¤: Large Language Models Out-of-the-Box Acceleration with AMD GPU
AIHugging Face Blog2025-09-29Accelerating Qwen3-8B Agent on Intel® Core⢠Ultra with Depth-Pruned Draft Models