Related items
AIHugging Face Blog
KV Cache from scratch in nanoVLM
AIHugging Face Blog
How to train a new language model from scratch using Transformers and Tokenizers
AIHugging Face Blog
LoRA training scripts of the world, unite!
AIHugging Face Blog
Training Design for Text-to-Image Models: Lessons from Ablations
AIHugging Face Blog
Smol2Operator: Post-Training GUI Agents for Computer Use
AIarXiv cs.AI
Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders
Model internals encode rich information about how a large language model (LLM) processes its training data; however, post-training data engineering largely relies on external signals and ignores rich intrinsic signals lying in model internals. We propose SAERL, a data engineering framework for LLM reinforcement lear...