Large transformer models trained on diverse datasets have shown a remark...
Coagent networks for reinforcement learning (RL) [Thomas and Barto, 2011...
Representation learning and exploration are among the key challenges for...
In this work, we consider the off-policy policy evaluation problem for
c...
We propose a first-order method for convex optimization, where instead o...
Methods for sequential decision-making are often built upon a foundation...
We study the learning dynamics of self-predictive learning for reinforce...
Shared autonomy refers to approaches for enabling an autonomous agent to...
Many sequential decision making problems are high-stakes and require
off...
When faced with sequential decision-making problems, it is often useful ...
Many sequential decision-making systems leverage data collected using pr...
Many real-world sequential decision-making problems involve critical sys...
Strategic recommendations (SR) refer to the problem where an intelligent...
Performance evaluations are critical for quantifying algorithmic advance...
Most reinforcement learning methods are based upon the key assumption th...
We propose a new objective function for finite-horizon episodic Markov
d...
The Markov decision process (MDP) formulation used to model many real-wo...
In many real-world sequential decision making problems, the number of
av...
Most model-free reinforcement learning methods leverage state representa...
Semi-supervised node classification involves learning to classify unlabe...
Given a graph wherein every node has certain attributes associated with ...