A conceptual framework for externally-influenced agents: an assisted reinforcement learning review
- Bignold, Adam, Cruz, Francisco, Taylor, Matthew, Brys, Tim, Dazeley, Richard, Vamplew, Peter, Foale, Cameron
A nethack learning environment language wrapper for autonomous agents
- Goodger, Nikolaj, Vamplew, Peter, Foale, Cameron, Dazeley, Richard
An evaluation methodology for interactive reinforcement learning with simulated users
- Bignold, Adam, Cruz, Francisco, Dazeley, Richard, Vamplew, Peter, Foale, Cameron
Discrete-to-deep reinforcement learning methods
- Kurniawan, Budi, Vamplew, Peter, Papasimeon, Michael, Dazeley, Richard, Foale, Cameron
Human engagement providing evaluative and informative advice for interactive reinforcement learning
- Bignold, Adam, Cruz, Francisco, Dazeley, Richard, Vamplew, Peter, Foale, Cameron
Human-aligned artificial intelligence is a multiobjective problem
- Vamplew, Peter, Dazeley, Richard, Foale, Cameron, Firmin, Sally, Mummery, Jane
Language representations for generalization in reinforcement learning
- Goodger, Nikolaj, Vamplew, Peter, Foale, Cameron, Dazeley, Richard
Levels of explainable artificial intelligence for human-aligned conversational explanations
- Dazeley, Richard, Vamplew, Peter, Foale, Cameron, Young, Cameron, Aryal, Sunil, Cruz, Francisco
Modeling neurocognitive reaction time with gamma distribution
- Santhanagopalan, Meena, Chetty, Madhu, Foale, Cameron, Aryal, Sunil, Klein, Britt
Non-functional regression : A new challenge for neural networks
- Vamplew, Peter, Dazeley, Richard, Foale, Cameron, Choudhury, Tanveer
Portal-based sound propagation for first-person computer games
- Foale, Cameron, Vamplew, Peter
Potential-based multiobjective reinforcement learning approaches to low-impact agents for AI safety
- Vamplew, Peter, Foale, Cameron, Dazeley, Richard, Bignold, Adam
Reinforcement learning of pareto-optimal multiobjective policies using steering
- Vamplew, Peter, Issabekov, Rustam, Dazeley, Richard, Foale, Cameron
Relevance of frequency of heart-rate peaks as indicator of ‘Biological’ Stress level
- Santhanagopalan, Meena, Chetty, Madhu, Foale, Cameron, Aryal, Sunil, Klein, Britt
Scalar reward is not enough : a response to Silver, Singh, Precup and Sutton (2021)
- Vamplew, Peter, Smith, Benjamin, Källström, Johan, Ramos, Gabriel, Rădulescu, Roxana, Roijers, Diederik, Hayes, Conor, Heintz, Fredrik, Mannion, Patrick, Libin, Pieter, Dazeley, Richard, Foale, Cameron
Scalar reward is not enough JAAMAS Track
- Vamplew, Peter, Smith, Benjamin, Källström, Johan, Ramos, Gabriel, Rădulescu, Roxana, Roijers, Diederik, Hayes, Conor, Heintz, Fredrik, Mannion, Patrick, Libin, Pieter, Dazeley, Richard, Foale, Cameron
Softmax exploration strategies for multiobjective reinforcement learning
- Vamplew, Peter, Dazeley, Richard, Foale, Cameron
Statistical calibration of long-term reanalysis data for australian fire weather conditions
- Biswas, Soubhik, Chand, Savin, Dowdy, Andrew, Wright, Wendy, Foale, Cameron, Zhao, Xiaohui, Deo, A
Steering approaches to Pareto-optimal multiobjective reinforcement learning
- Vamplew, Peter, Issabekov, Rustam, Dazeley, Richard, Foale, Cameron, Berry, Adam, Moore, Tim, Creighton, Douglas
The impact of environmental stochasticity on value-based multiobjective reinforcement learning
- Vamplew, Peter, Foale, Cameron, Dazeley, Richard
Are you sure you would like to clear your session, including search history and login status?