Related items
AIHugging Face Blog
Smol2Operator: Post-Training GUI Agents for Computer Use
AIHugging Face Blog
Training mRNA Language Models Across 25 Species for $165
AIHugging Face Blog
Training Design for Text-to-Image Models: Lessons from Ablations
AIHugging Face Blog
TRL v1.0: Post-Training Library Built to Move with the Field
AIarXiv cs.AI
Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders
Model internals encode rich information about how a large language model (LLM) processes its training data; however, post-training data engineering largely relies on external signals and ignores rich intrinsic signals lying in model internals. We propose SAERL, a data engineering framework for LLM reinforcement lear...
AIHugging Face Blog