Your selections:
On the limitations of scalarisation for multi-objective reinforcement learning of Pareto fronts
- Vamplew, Peter, Yearwood, John, Dazeley, Richard, Berry, Adam
Coarse Q-Learning : Addressing the convergence problem when quantizing continuous state variables
- Dazeley, Richard, Vamplew, Peter, Bignold, Adam
Adaptive Dynamic Programming based Control Scheme for Uncertain Two-Wheel Robots
- Van Nguyen, Thien, Le, Hai, Tran, Hoang, Nguyen, Duc, Nguyen, Minh, Nguyen, Linh
Scalar reward is not enough JAAMAS Track
- Vamplew, Peter, Smith, Benjamin, Källström, Johan, Ramos, Gabriel, Rădulescu, Roxana, Roijers, Diederik, Hayes, Conor, Heintz, Frederik, Mannion, Patrick, Libin, Pieter, Dazeley, Richard, Foale, Cameron
Elastic step DDPG : multi-step reinforcement learning for improved sample efficiency
- Ly, Adrian, Dazeley, Richard, Vamplew, Peter, Cruz, Francisco, Aryal, Sunil
Are you sure you would like to clear your session, including search history and login status?