b'Shaoduo Gan'

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Ji Liu
135 publications
Ce Zhang
133 publications
Tong Zhang
99 publications
Jieping Ye
93 publications
Jun Ma
90 publications
Yue Liu
77 publications
Yuxiong He
34 publications
Hanlin Tang
29 publications
Shuang Qiu
26 publications
Shasha Li
26 publications
Gustavo Alonso
25 publications

research

∙ 08/17/2022

Few-shot Named Entity Recognition with Entity-level Prototypical Network Enhanced by Dispersedly Distributed Prototypes

Few-shot named entity recognition (NER) enables us to build a NER system...

0 Bin Ji, et al. ∙

research

∙ 06/12/2022

Stochastic Gradient Descent without Full Data Shuffle

Stochastic gradient descent (SGD) is the cornerstone of modern machine l...

39 Lijie Xu, et al. ∙

research

∙ 12/26/2021

FRuDA: Framework for Distributed Adversarial Domain Adaptation

Breakthroughs in unsupervised domain adaptation (uDA) can help in adapti...

0 Shaoduo Gan, et al. ∙

research

∙ 05/17/2021

Towards Demystifying Serverless Machine Learning Training

The appeal of serverless (FaaS) has triggered a growing interest on how ...

17 Jiawei Jiang, et al. ∙

research

∙ 02/04/2021

1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed

Scalable training of large models (like BERT and GPT-3) requires careful...

0 Hanlin Tang, et al. ∙

research

∙ 08/26/2020

APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm

Adam is the important optimization algorithm to guarantee efficiency and...

11 Hanlin Tang, et al. ∙

research

∙ 03/17/2018

Decentralization Meets Quantization

Optimizing distributed learning systems is an art of balancing between c...

0 Hanlin Tang, et al. ∙

Shaoduo Gan

Featured Co-authors

Few-shot Named Entity Recognition with Entity-level Prototypical Network Enhanced by Dispersedly Distributed Prototypes

Stochastic Gradient Descent without Full Data Shuffle

FRuDA: Framework for Distributed Adversarial Domain Adaptation

Towards Demystifying Serverless Machine Learning Training

1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed

APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm

Decentralization Meets Quantization

Sign in with Google

Consider DeepAI Pro