Seyed Mohammad Asghari | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Benjamin Van Roy
63 publications
Zheng Wen
50 publications
Ian Osband
31 publications
Sven Gowal
31 publications
Botao Hao
23 publications
Ashutosh Nayyar
20 publications
Yi Ouyang
18 publications
Navid NaderiAlizadeh
18 publications
Brendan O'Donoghue
18 publications
Geoffrey Irving
18 publications
Xiuyuan Lu
14 publications

research

∙ 02/18/2023

Approximate Thompson Sampling via Epistemic Neural Networks

Thompson sampling (TS) is a popular heuristic for action selection, but ...

0 Ian Osband, et al. ∙

research

∙ 11/03/2022

Fine-Tuning Language Models via Epistemic Neural Networks

Large language models are now part of a powerful new paradigm in machine...

0 Ian Osband, et al. ∙

research

∙ 07/01/2022

Robustness of Epinets against Distributional Shifts

Recent work introduced the epinet as a new approach to uncertainty model...

0 Xiuyuan Lu, et al. ∙

research

∙ 06/08/2022

Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping

In machine learning, an agent needs to estimate uncertainty to efficient...

0 Vikranth Dwaracherla, et al. ∙

research

∙ 02/28/2022

Evaluating High-Order Predictive Distributions in Deep Learning

Most work on supervised learning research has focused on marginal predic...

0 Ian Osband, et al. ∙

research

∙ 10/09/2021

Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?

Posterior predictive distributions quantify uncertainties ignored by poi...

0 Ian Osband, et al. ∙

research

∙ 01/27/2020

Regret Bounds for Decentralized Learning in Cooperative Multi-Agent Dynamical Systems

Regret analysis is challenging in Multi-Agent Reinforcement Learning (MA...

53 Seyed Mohammad Asghari, et al. ∙

research

∙ 12/09/2019

Learning to Code: Coded Caching via Deep Reinforcement Learning

We consider a system comprising a file library and a network with a serv...

0 Navid NaderiAlizadeh, et al. ∙

Success!

An error occurred