Utility-based reinforcement learning : unifying single-objective and multi-objective reinforcement learning

- Vamplew, Peter, Foale, Cameron, Hayes, Conor, Mannion, Patrick, Howley, Enda, Dazeley, Richard, Johnson, Scott, Källström, Johan, Ramos, Gabriel, Rădulescu, Roxana, Röpke, Willem, Roijers, Diederik