b'Ming Cheng'

research

∙ 09/15/2023

AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder

Learning high-quality video representation has shown significant applica...

0 Xingjian Diao, et al. ∙

research

∙ 08/22/2023

Masked Cross-image Encoding for Few-shot Segmentation

Few-shot segmentation (FSS) is a dense prediction task that aims to infe...

0 Wenbo Xu, et al. ∙

research

∙ 08/14/2023

VoxBlink: A Large Scale Speaker Verification Dataset on Camera

In this paper, we introduce a large-scale and high-quality audio-visual ...

0 Yuke Lin, et al. ∙

research

∙ 08/14/2023

Viia-hand: a Reach-and-grasp Restoration System Integrating Voice interaction, Computer vision and Auditory feedback for Blind Amputees

Visual feedback plays a crucial role in the process of amputation patien...

0 Chunhao Peng, et al. ∙

research

∙ 05/08/2023

DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models

The art of communication beyond speech there are gestures. The automatic...

0 Sicheng Yang, et al. ∙

research

∙ 03/04/2023

The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis

This paper further explores our previous wake word spotting system ranke...

0 Haoxu Wang, et al. ∙

research

∙ 10/28/2022

Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction

Target-speaker voice activity detection is currently a promising approac...

0 Ming Cheng, et al. ∙

research

∙ 10/07/2022

GMA3D: Local-Global Attention Learning to Estimate Occluded Motions of Scene Flow

Scene flow is the collection of each point motion information in the 3D ...

0 Zhiyang Lu, et al. ∙

research

∙ 08/04/2022

H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

High-speed, high-resolution stereoscopic (H2-Stereo) video allows us to ...

6 Ming Cheng, et al. ∙

research

∙ 01/24/2022

Multi-Graph Fusion Networks for Urban Region Embedding

Learning the embeddings for urban regions from human mobility data can r...

42 Shangbin Wu, et al. ∙

research

∙ 06/01/2021

DLA-Net: Learning Dual Local Attention Features for Semantic Segmentation of Large-Scale Building Facade Point Clouds

Semantic segmentation of building facade is significant in various appli...

4 Yanfei Su, et al. ∙

research

∙ 10/01/2020

DASGIL: Domain Adaptation for Semantic and Geometric-aware Image-based Localization

Long-Term visual localization under changing environments is a challengi...

0 Hanjiang Hu, et al. ∙

research

∙ 11/14/2019

RWF-2000: An Open Large Scale Video Database for Violence Detection

In recent years, surveillance cameras are widely deployed in public plac...

0 Ming Cheng, et al. ∙

research

∙ 09/28/2019

A Dual Camera System for High Spatiotemporal Resolution Video Acquisition

This paper presents a dual camera system for high spatiotemporal resolut...

14 Ming Cheng, et al. ∙

research

∙ 06/03/2019

RF-Net: An End-to-End Image Matching Network based on Receptive Field

This paper proposes a new end-to-end trainable matching network based on...

8 Xuelun Shen, et al. ∙

research

∙ 05/03/2019

Learned Quality Enhancement via Multi-Frame Priors for HEVC Compliant Low-Delay Applications

Networked video applications, e.g., video conferencing, often suffer fro...

0 Ming Lu, et al. ∙

research

∙ 04/17/2019

LO-Net: Deep Real-time Lidar Odometry

We present a novel deep convolutional network pipeline, LO-Net, for real...

16 Qing Li, et al. ∙

research

∙ 12/27/2018

Advancing the State of the Art in Open Domain Dialog Systems through the Alexa Prize

Building open domain conversational systems that allow users to have eng...

0 Chandra Khatri, et al. ∙

research

∙ 10/15/2018

Bandit Inspired Beam Searching Scheme for mmWave High-Speed Train Communications

High-speed trains (HSTs) are being widely deployed around the world. To ...

0 Jun-Bo Wang, et al. ∙

research

∙ 01/11/2018

On Evaluating and Comparing Conversational Agents

Conversational agents are exploding in popularity. However, much work re...

0 Anu Venkatesh, et al. ∙

research

∙ 01/11/2018

Conversational AI: The Science Behind the Alexa Prize

Conversational agents are exploding in popularity. However, much work re...

0 Ashwin Ram, et al. ∙

research

∙ 10/17/2016

Partial Procedural Geometric Model Fitting for Point Clouds

Geometric model fitting is a fundamental task in computer graphics and c...

0 Zongliang Zhang, et al. ∙

Ming Cheng

Featured Co-authors

Sign in with Google

Consider DeepAI Pro