Igor Slinko, Ilia Zavidnyi, Egor Bogomolov, Yaroslav Zharov

Step Rejection Fine-Tuning: A Practical Distillation Recipe

Igor Slinko, Ilia Zavidnyi, Egor Bogomolov, Yaroslav Zharov / May 12, 2026

arXiv:2605.10674v1 Announce Type: cross
Abstract: Rejection Fine-Tuning (RFT) is a standard method for training LLM agents, where unsuccessful trajectories are discarded from the training set. In the context of SWE-bench tasks, this corresponds to fil…

Author name: Igor Slinko, Ilia Zavidnyi, Egor Bogomolov, Yaroslav Zharov

Step Rejection Fine-Tuning: A Practical Distillation Recipe