Modeling customer shopping intentions is a crucial task for e-commerce, ...
Predicting missing facts in a knowledge graph (KG) is crucial as modern ...
We study the problem of query attribute value extraction, which aims to
...
In this paper, we propose an Omni-perception Pre-Trainer (OPT) for
cross...
Nowadays, with many e-commerce platforms conducting global business,
e-c...
Existing methods for skeleton-based action recognition mainly focus on
i...
Autoregressive sequence Generation models have achieved state-of-the-art...
Dynamic skeletal data, represented as the 2D/3D coordinates of human joi...
Most image captioning models are autoregressive, i.e. they generate each...
In this paper, a new perspective is presented for skeleton-based action
...
Self-attention (SA) network has shown profound value in image captioning...
Graph convolutional networks (GCNs), which generalize CNNs to more gener...
Pose-based action recognition has drawn considerable attention recently....
Recent works attempt to improve scene parsing performance by exploring
d...
This document describes our solution for the VATEX Captioning Challenge ...
Image captioning attempts to generate a sentence composed of several
lin...
We present a novel and easy-to-implement training framework for visual
t...
Scene text recognition has received increased attention in the research
...
In this paper, we address the scene segmentation task by capturing rich
...
Traditional deep methods for skeleton-based action recognition usually
s...
Deep neural networks have evolved remarkably over the past few years and...
Deep Neural Networks have achieved remarkable progress during the past f...
Reading text in the wild is a challenging task in the field of computer
...
The region-based Convolutional Neural Network (CNN) detectors such as Fa...