Sound scene geotagging is a new topic of research which has evolved from...
We analyse multi-purpose audio using tools to visualise similarities wit...
In this paper we investigate the importance of the extent of memory in
s...
Polyphonic Sound Event Detection (SED) in real-world recordings is a
cha...
The majority of sound scene analysis work focuses on one of two clearly
...
Acoustic Scene Classification (ASC) and Sound Event Detection (SED) are ...
The absence of the queen in a beehive is a very strong indicator of the ...
Lipreading is a difficult gesture classification task. One problem in
co...
We present a new extensible and divisible taxonomy for open set sound sc...
Language models (LM) are very powerful in lipreading systems. Language m...
Visual lip gestures observed whilst lipreading have a few working
defini...
Visemes are the visual equivalent of phonemes. Although not precisely
de...
There is debate if phoneme or viseme units are the most effective for a
...
For machines to lipread, or understand speech from lip movement, they de...
Recent adoption of deep learning methods to the field of machine lipread...
We are at an exciting time for machine lipreading. Traditional research
...
Machine lipreading (MLR) is speech recognition from visual cues and a ni...
To undertake machine lip-reading, we try to recognise speech from a visu...
In machine lip-reading there is continued debate and research around the...
In machine lip-reading, which is identification of speech from visual-on...
A critical assumption of all current visual speech recognition systems i...
In the quest for greater computer lip-reading performance there are a nu...
Visual-only speech recognition is dependent upon a number of factors tha...