Learning to summarize with human feedbackBy OpenAI News / September 4, 2020 We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.