Xiulei Song | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Simon Lucas
15 publications
Greg Slabaugh
7 publications
Yizhao Jin
2 publications

research

∙ 01/26/2023

Partial advantage estimator for proximal policy optimization

Estimation of value in policy gradient methods is a fundamental problem....

0 Xiulei Song, et al. ∙

research

∙ 01/26/2023

Joint action loss for proximal policy optimization

PPO (Proximal Policy Optimization) is a state-of-the-art policy gradient...

0 Xiulei Song, et al. ∙

Success!

An error occurred