b'Shengyi Huang'

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Shuicheng Yan
187 publications
Bo Liu
167 publications
Min Lin
38 publications
Santiago Ontañón
32 publications
Zhongwen Xu
28 publications
Anssi Kanervisto
26 publications
Simon Lucas
15 publications
Weixun Wang
14 publications
Viktor Makoviychuk
13 publications
Antonin Raffin
7 publications
Zichen Liu
5 publications

research

∙ 06/21/2022

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

There has been significant progress in developing reinforcement learning...

8 Jiayi Weng, et al. ∙

research

∙ 05/18/2022

A2C is a special case of PPO

Advantage Actor-critic (A2C) and Proximal Policy Optimization (PPO) are ...

11 Shengyi Huang, et al. ∙

research

∙ 11/16/2021

CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms

CleanRL is an open-source library that provides high-quality single-file...

0 Shengyi Huang, et al. ∙

research

∙ 11/12/2020

Griddly: A platform for AI research in games

In recent years, there have been immense breakthroughs in Game AI resear...

0 Chris Bamford, et al. ∙

research

∙ 10/05/2020

Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games

Training agents using Reinforcement Learning in games with sparse reward...

0 Shengyi Huang, et al. ∙

research

∙ 06/25/2020

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

In recent years, Deep Reinforcement Learning (DRL) algorithms have achie...

8 Shengyi Huang, et al. ∙

research

∙ 10/26/2019

Comparing Observation and Action Representations for Deep Reinforcement Learning in MicroRTS

This paper presents a preliminary study comparing different observation ...

0 Shengyi Huang, et al. ∙

Shengyi Huang

Featured Co-authors

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

A2C is a special case of PPO

CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms

Griddly: A platform for AI research in games

Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Comparing Observation and Action Representations for Deep Reinforcement Learning in MicroRTS

Sign in with Google

Consider DeepAI Pro