Jiajun Fan | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Hao Wang
366 publications
Xing Xie
128 publications
Jianye Hao
82 publications
Yue Huang
46 publications
Chaozhuo Li
25 publications
Jianxun Lian
20 publications
Weiming Liu
16 publications
Yuxin Huang
12 publications
Changnan Xiao
10 publications
Haoxuan Li
8 publications
Wanyue Xu
6 publications

research

∙ 08/05/2023

ConvFormer: Revisiting Transformer for Sequential User Modeling

Sequential user modeling, a critical task in personalized recommender sy...

0 Hao Wang, et al. ∙

research

∙ 10/20/2022

Entire Space Counterfactual Learning: Tuning, Analytical Properties and Industrial Applications

As a basic research problem for building effective recommender systems, ...

0 Hao Wang, et al. ∙

research

∙ 06/07/2022

Generalized Data Distribution Iteration

To obtain higher sample efficiency and superior final performance simult...

0 Jiajun Fan, et al. ∙

research

∙ 12/08/2021

A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions

The Arcade Learning Environment (ALE) is proposed as an evaluation platf...

0 Jiajun Fan, et al. ∙

research

∙ 06/11/2021

GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning

Deep Q Network (DQN) firstly kicked the door of deep reinforcement learn...

0 Jiajun Fan, et al. ∙

research

∙ 06/01/2021

An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning

Policy-based reinforcement learning methods suffer from the policy colla...

0 Changnan Xiao, et al. ∙

research

∙ 05/09/2021

CASA-B: A Unified Framework of Model-Free Reinforcement Learning

Building on the breakthrough of reinforcement learning, this paper intro...

0 Changnan Xiao, et al. ∙

research

∙ 11/13/2020

Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning

Constructing agents with planning capabilities has long been one of the ...

0 Jiajun Fan, et al. ∙

Success!

An error occurred