Transfer learning is an essential tool for improving the performance of
...
The Transformer architecture has improved the performance of deep learni...
Data parallelism does a good job in speeding up the training. However, w...
The recent Natural Language Processing techniques have been refreshing t...
We propose Partially Interpretable Estimators (PIE) which attribute a
pr...
Tensors are becoming prevalent in modern applications such as medical im...
Distance weighted discrimination (DWD) is a margin-based classifier with...
Distance weighted discrimination (DWD) was originally proposed to handle...