Studying Sutton and Barto’s RL book and its connections to RL for LLMs (e.g., tool use, math reasoning, agents, and so on)? [D]
Hi everyone, I graduated from a Master in Math program last summer. In recent months, I have been trying to understand more about ML/DL and LLMs, so I have been reading books and sometimes papers on LLMs and their reasoning capacities (I'm especial…