Complex, long-horizon planning and its combinatorial nature pose steep
c...
Reward functions are difficult to design and often hard to align with hu...
Sequential decision making algorithms often struggle to leverage differe...
Modern Deep Reinforcement Learning (RL) algorithms require estimates of ...
While reinforcement learning (RL) has become a more popular approach for...