Your selections:
Elastic step DDPG : multi-step reinforcement learning for improved sample efficiency
- Ly, Adrian, Dazeley, Richard, Vamplew, Peter, Cruz, Francisco, Aryal, Sunil
Scalar reward is not enough JAAMAS Track
- Vamplew, Peter, Smith, Benjamin, Källström, Johan, Ramos, Gabriel, Rădulescu, Roxana, Roijers, Diederik, Hayes, Conor, Heintz, Frederik, Mannion, Patrick, Libin, Pieter, Dazeley, Richard, Foale, Cameron
An online scalarization multi-objective reinforcement learning algorithm : TOPSIS Q-learning
- Mirzanejad, Mohammad, Ebrahimi, Morteza, Vamplew, Peter, Veisi, Hadi
A prioritized objective actor-critic method for deep reinforcement learning
- Nguyen, Ngoc, Nguyen, Thanh, Vamplew, Peter, Dazeley, Richard, Nahavandi, Saeid
Reanimating historic malware samples
- Black, Paul, Gondal, Iqbal, Vamplew, Peter, Lakhotia, Arun
API based discrimination of ransomware and benign cryptographic programs
- Black, Paul, Sohail, Ammar, Gondal, Iqbal, Kamruzzaman, Joarder, Vamplew, Peter, Watters, Paul
Identifying cross-version function similarity using contextual features
- Black, Paul, Gondal, Iqbal, Vamplew, Peter, Lakhotia, Arun
- Ul Haq, Ikram, Gondal, Iqbal, Vamplew, Peter, Brown, Simon
Evolved similarity techniques in malware analysis
- Black, Paul, Gondal, Iqbal, Vamplew, Peter, Lakhotia, Arun
Integrating biological heuristics and gene expression data for gene regulatory network inference
- Zarnegar, Armita, Jelinek, Herbert, Vamplew, Peter, Stranieri, Andrew
An anomaly intrusion detection system using C5 decision tree classifier
- Khraisat, Ansam, Gondal, Iqbal, Vamplew, Peter
Rapid anomaly detection using integrated prudence analysis (IPA)
- Maruatona, Omaru, Vamplew, Peter, Dazeley, Richard, Watters, Paul
A taxonomy of griefer type by motivation in massively multiplayer online role-playing games
- Achterbosch, Leigh, Miller, Charlynn, Vamplew, Peter
A heuristic gene regulatory networks model for cardiac function and pathology
- Zarnegar, Armita, Vamplew, Peter, Stranieri, Andrew, Jelinek, Herbert
Coarse Q-Learning : Addressing the convergence problem when quantizing continuous state variables
- Dazeley, Richard, Vamplew, Peter, Bignold, Adam
Patient admission prediction using a pruned fuzzy min-max neural network with rule extraction
- Wang, Jin, Lim, Cheepeng, Creighton, Douglas, Khorsavi, Abbas, Nahavandi, Saeid, Ugon, Julien, Vamplew, Peter, Stranieri, Andrew, Martin, Laura, Freischmidt, Anton
Reinforcement learning of pareto-optimal multiobjective policies using steering
- Vamplew, Peter, Issabekov, Rustam, Dazeley, Richard, Foale, Cameron
- Achterbosch, Leigh, Miller, Charlynn, Vamplew, Peter
Applications of machine learning for linguistic analysis of texts
- Torney, Rosemary, Yearwood, John, Vamplew, Peter, Kelarev, Andrei
RM and RDM, a preliminary evaluation of two prudent RDR Techniques
- Maruatona, Omaru, Vamplew, Peter, Dazeley, Richard
Are you sure you would like to clear your session, including search history and login status?