AI

Preference Tuning LLMs with Direct Preference Optimization Methods

Hugging Face Blog ยท 2024-01-18

Feedback