With the exponential growth of video data, there is an urgent need for
a...
Action detection is a challenging video understanding task, requiring
mo...
In this report, we present our champion solutions to five tracks at Ego4...
Capturing the state changes of interacting objects is a key technology f...
We provide the technical report for Ego4D audio-only diarization challen...
The transductive inference is an effective technique in the few-shot lea...
Temporal action detection (TAD) is extensively studied in the video
unde...
Temporal action detection aims to locate the boundaries of action in the...
The existing action recognition methods are mainly based on clip-level
c...