We present a simple linear regression based approach for learning the we...
The ability to learn robust policies while generalizing over large discr...
We present a simple, sample-efficient algorithm for introducing large bu...
A significant challenge in reinforcement learning is quantifying the com...
Modern deep neural network models are known to erroneously classify
out-...
The vehicle routing problem is a well known class of NP-hard combinatori...
Most existing literature on supply chain and inventory management consid...
Exploration versus exploitation dilemma is a significant problem in
rein...
We describe a novel decision-making problem developed in response to the...
Several real-world scenarios, such as remote control and sensing, are
co...
This paper develops an inherently parallelised, fast, approximate
learni...
Pommerman is a hybrid cooperative/adversarial multi-agent environment, w...
We describe our solution approach for Pommerman TeamRadio, a competition...
We propose a Deep Reinforcement Learning (Deep RL) algorithm for solving...
This paper describes the application of reinforcement learning (RL) to
m...
We propose a generic reward shaping approach for improving rate of
conve...
In the context of the ongoing Covid-19 pandemic, several reports and stu...
The Pommerman simulation was recently developed to mimic the classic Jap...
This paper describes a purely data-driven solution to a class of sequent...