An online scalarization multi-objective reinforcement learning algorithm : TOPSIS Q-learning

- Mirzanejad, Mohammad, Ebrahimi, Morteza, Vamplew, Peter, Veisi, Hadi