Papers
Collection
Large Language Model (LLM) and NLP related papers. • 354 items • Updated • 16
The overview covers key aspects of deep reinforcement learning and sequential decision making, including value-based RL, policy-gradient methods, and model-based methods.
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics (including a very brief discussion of RL+LLMs).
Get this paper in your agent:
hf papers read 2412.05265 curl -LsSf https://hf.co/cli/install.sh | bash No model linking this paper
No dataset linking this paper
No Space linking this paper