AIHugging Face Blog2023-12-05Optimum-NVIDIA Unlocking blazingly fast LLM inference in just 1 line of code
AIHugging Face Blog2024-01-30Accelerate StarCoder with ๐ค Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding