Krishna Somandepalli

research

∙ 09/07/2023

LanSER: Language-Model Supported Speech Emotion Recognition

Speech emotion recognition (SER) models typically rely on costly human-l...

0 Taesik Gong, et al. ∙

research

∙ 08/27/2023

MM-AU:Towards Multimodal Understanding of Advertisement Videos

Advertisement videos (ads) play an integral part in the domain of Intern...

0 Digbalay Bose, et al. ∙

research

∙ 03/13/2023

Contextually-rich human affect perception using multimodal scene information

The process of human affect understanding involves the ability to infer ...

10 Digbalay Bose, et al. ∙

research

∙ 03/05/2023

Heterogeneous Graph Learning for Acoustic Event Classification

Heterogeneous graphs provide a compact, efficient, and scalable way to m...

0 Amir Shirian, et al. ∙

research

∙ 02/14/2023

A dataset for Audio-Visual Sound Event Detection in Movies

Audio event detection is a widely studied audio processing task, with ap...

0 Rajat Hebbar, et al. ∙

research

∙ 10/20/2022

MovieCLIP: Visual Scene Recognition in Movies

Longform media such as movies have complex narrative structures, with ev...

17 Digbalay Bose, et al. ∙

research

∙ 07/16/2022

Visually-aware Acoustic Event Detection using Heterogeneous Graphs

Perception of auditory events is inherently multimodal relying on both a...

1 Amir Shirian, et al. ∙

research

∙ 06/24/2022

Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers

This technical report presents the modeling approaches used in our submi...

0 Josh Belanich, et al. ∙

research

∙ 01/31/2022

Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data

Large scale databases with high-quality manual annotations are scarce in...

4 Amir Shirian, et al. ∙

research

∙ 10/08/2021

Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis

Societal ideas and trends dictate media narratives and cinematic depicti...

1 Sabyasachee Baruah, et al. ∙

research

∙ 08/25/2020

Multi-Face: Self-supervised Multiview Adaptation for Robust Face Clustering in Videos

Robust face clustering is a key step towards computational understanding...

0 Krishna Somandepalli, et al. ∙

research

∙ 08/19/2020

Victim or Perpetrator? Analysis of Violent Characters Portrayals from Movie Scripts

Violent content in the media can influence viewers' perception of the so...

0 Victor R Martinez, et al. ∙

research

∙ 05/12/2020

Generalized Multi-view Shared Subspace Learning using View Bootstrapping

A key objective in multi-view learning is to model the information commo...

3 Krishna Somandepalli, et al. ∙

research

∙ 03/09/2020

Crossmodal learning for audio-visual speech event localization

An objective understanding of media depictions, such as about inclusive ...

4 Rahul Sharma, et al. ∙

research

∙ 02/10/2020

An empirical analysis of information encoded in disentangled neural speaker representations

The primary characteristic of robust speaker representations is that the...

0 Raghuveer Peri, et al. ∙

research

∙ 11/03/2019

Robust speaker recognition using unsupervised adversarial invariance

In this paper, we address the problem of speaker recognition in challeng...

0 Raghuveer Peri, et al. ∙

research

∙ 04/03/2019

Multimodal Representation Learning using Deep Multiset Canonical Correlation

We propose Deep Multiset Canonical Correlation Analysis (dMCCA) as an ex...

0 Krishna Somandepalli, et al. ∙

Krishna Somandepalli

Featured Co-authors

Sign in with Google

Consider DeepAI Pro