CiPO: Counterfactual Unlearning for Large Reasoning Models through Iterative Preference Optimization
arXiv:2604.15847v1 Announce Type: new
Abstract: Machine unlearning has gained increasing attention in recent years, as a promising technique to selectively remove unwanted privacy or copyrighted information from Large Language Models that are trained …