Kishan Panaganti | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Mohammad Ghavamzadeh
73 publications
Dileep Kalathil
37 publications
Debdeep Pati
35 publications
Paul Mineiro
28 publications
Cheng Tan
15 publications
Akanksha Saran
8 publications
Mark Rucker
7 publications
Bani Mallick
6 publications
Yabo Niu
5 publications
Sutanoy Dasgupta
4 publications
Zaiyan Xu
2 publications

research

∙ 03/05/2023

Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning

We consider the problem of learning a control policy that is robust agai...

0 Zaiyan Xu, et al. ∙

research

∙ 11/28/2022

Personalized Reward Learning with Interaction-Grounded Learning (IGL)

In an era of countless content offerings, recommender systems alleviate ...

0 Jessica Maghakian, et al. ∙

research

∙ 08/10/2022

Robust Reinforcement Learning using Offline Data

The goal of robust reinforcement learning (RL) is to learn a policy that...

0 Kishan Panaganti, et al. ∙

research

∙ 12/18/2021

Off-Policy Evaluation Using Information Borrowing and Context-Based Switching

We consider the off-policy evaluation (OPE) problem in contextual bandit...

9 Sutanoy Dasgupta, et al. ∙

research

∙ 12/02/2021

Sample Complexity of Robust Reinforcement Learning with a Generative Model

The Robust Markov Decision Process (RMDP) framework focuses on designing...

0 Kishan Panaganti, et al. ∙

research

∙ 06/20/2020

Model-Free Robust Reinforcement Learning with Linear Function Approximation

This paper addresses the problem of model-free reinforcement learning fo...

0 Kishan Panaganti, et al. ∙

research

∙ 06/20/2020

Robust Reinforcement Learning using Least Squares Policy Iteration

This paper addresses the problem of model-free reinforcement learning fo...

0 Kishan Panaganti, et al. ∙

research

∙ 03/03/2020

Bounded Regret for Finitely Parameterized Multi-Armed Bandits

We consider the problem of finitely parameterized multi-armed bandits wh...

0 Kishan Panaganti, et al. ∙

Success!

An error occurred