AIHugging Face Blog2023-03-28Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator