AIHugging Face Blog2022-03-16Accelerate BERT inference with Hugging Face Transformers and AWS Inferentia
AIHugging Face Blog2024-01-15Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive
AIHugging Face Blog2023-03-28Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator