Object detectors are at the heart of many semi- and fully autonomous dec...
Nowadays, face recognition systems surpass human performance on several
...
Recently, face recognition systems have demonstrated remarkable performa...
Smart City applications such as intelligent traffic routing or accident
...
Image resolution, or in general, image quality, plays an essential role ...
State-of-the-art face recognition (FR) approaches have shown remarkable
...
Gait recognition is a promising biometric with unique properties for
ide...
Countless applications depend on accurate predictions with reliable
conf...
Real-world face recognition applications often deal with suboptimal imag...
Face recognition approaches often rely on equal image resolution for
ver...
Photos of faces captured in unconstrained environments, such as large cr...
Successful active speaker detection requires a three-stage pipeline: (i)...
Gait recognition is a promising video-based biometric for identifying
in...
Person Re-Identification aims to retrieve person identities from images
...
Many end-to-end Automatic Speech Recognition (ASR) systems still rely on...
Distracted drivers are more likely to fail to anticipate hazards, which
...
Convolutional Neural Networks with 3D kernels (3D CNNs) currently achiev...
Audio Adversarial Examples (AAE) represent specially created inputs mean...
Recent advances in Automatic Speech Recognition (ASR) demonstrated how
e...
Recent end-to-end Automatic Speech Recognition (ASR) systems demonstrate...
Nowadays, attention models are one of the popular candidates for speech
...
Approaches for kinship verification often rely on cosine distances betwe...
The use of hand gestures provides a natural alternative to cumbersome
in...
For many practical problems and applications, it is not feasible to crea...
Spatiotemporal action localization requires incorporation of two sources...
Keyword Spotting (KWS) enables speech-based user interaction on smart
de...
Understanding actions and gestures in video streams requires temporal
re...
Many road accidents occur due to distracted drivers. Today, driver monit...
Video representation is a key challenge in many computer vision applicat...
The use of hand gestures provides a natural alternative to cumbersome
in...
Recently, convolutional neural networks with 3D kernels (3D CNNs) have b...
Real-time recognition of dynamic hand gestures from video streams is a
c...
A convolutional layer in a Convolutional Neural Network (CNN) consists o...
The task of multiple people tracking in monocular videos is challenging
...
Gait as a biometric property for person identification plays a key role ...
Acquiring spatio-temporal states of an action is the most crucial step f...
In this work, we present a novel background subtraction system that uses...
Transcription of broadcast news is an interesting and challenging applic...
We present a system for identifying humans by their walking sounds. This...