Your selections:
On the limitations of scalarisation for multi-objective reinforcement learning of Pareto fronts
- Vamplew, Peter, Yearwood, John, Dazeley, Richard, Berry, Adam
Coarse Q-Learning : Addressing the convergence problem when quantizing continuous state variables
- Dazeley, Richard, Vamplew, Peter, Bignold, Adam
Adaptive Dynamic Programming based Control Scheme for Uncertain Two-Wheel Robots
- Van Nguyen, Thien, Le, Hai, Tran, Hoang, Nguyen, Duc, Nguyen, Minh, Nguyen, Linh
Language representations for generalization in reinforcement learning
- Goodger, Nikolaj, Vamplew, Peter, Foale, Cameron, Dazeley, Richard
A brief guide to multi-objective reinforcement learning and planning JAAMAS track
- Hayes, Conor, Bargiacchi, Eugenio, Källström, Johan, Macfarlane, Matthew, Reymond, Mathieu, Verstraeten, Timothy, Zintgraf, Luisa, Dazeley, Richard, Heintz, Frederik, Howley, Enda, Irissappane, Aathirai, Mannion, Patrick, Nowé, Ann, Ramos, Gabriel, Restelli, Marcello, Vamplew, Peter, Roijers, Diederik
Scalar reward is not enough JAAMAS Track
- Vamplew, Peter, Smith, Benjamin, Källström, Johan, Ramos, Gabriel, Rădulescu, Roxana, Roijers, Diederik, Hayes, Conor, Heintz, Frederik, Mannion, Patrick, Libin, Pieter, Dazeley, Richard, Foale, Cameron
Elastic step DDPG : multi-step reinforcement learning for improved sample efficiency
- Ly, Adrian, Dazeley, Richard, Vamplew, Peter, Cruz, Francisco, Aryal, Sunil
Are you sure you would like to clear your session, including search history and login status?