Cooperative multi-agent reinforcement learning (MARL) requires agents to...
Centralised training (CT) is the basis for many popular multi-agent
rein...
Many real-world settings involve costs for performing actions; transacti...
Efficient reinforcement learning (RL) involves a trade-off between
"expl...
Satisfying safety constraints almost surely (or with probability one) ca...
Reward shaping (RS) is a powerful method in reinforcement learning (RL) ...
Computing the Nash equilibrium (NE) for N-player non-zerosum stochastic ...
Many real-world systems such as taxi systems, traffic networks and smart...
Although multi-agent reinforcement learning can tackle systems of
strate...