Yi Wang

research

∙ 09/19/2023

OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking

Multiple pedestrian tracking faces the challenge of tracking pedestrians...

0 Jianjun Gao, et al. ∙

research

∙ 09/05/2023

Representation Learning for Sequential Volumetric Design Tasks

Volumetric design, also called massing design, is the first and critical...

0 Md Ferdous Alam, et al. ∙

research

∙ 09/04/2023

Direct and Indirect Treatment Effects in the Presence of Semi-Competing Risks

Semi-competing risks refer to the phenomenon that the terminal event (su...

0 Yuhao Deng, et al. ∙

research

∙ 08/22/2023

Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts

Cross-scene generalizable NeRF models, which can directly synthesize nov...

0 Wenyan Cong, et al. ∙

research

∙ 08/04/2023

Deep Semantic Model Fusion for Ancient Agricultural Terrace Detection

Discovering ancient agricultural terraces in desert regions is important...

0 Yi Wang, et al. ∙

research

∙ 07/14/2023

Benchmarks and Custom Package for Electrical Load Forecasting

Load forecasting is of great significance in the power industry as it ca...

0 Zhixian Wang, et al. ∙

research

∙ 07/13/2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

This paper introduces InternVid, a large-scale video-centric multimodal ...

0 Yi Wang, et al. ∙

research

∙ 07/03/2023

JourneyDB: A Benchmark for Generative Image Understanding

While recent advancements in vision-language models have revolutionized ...

0 Junting Pan, et al. ∙

research

∙ 07/01/2023

PersonaGen: A Tool for Generating Personas from User Feedback

Personas are crucial in software development processes, particularly in ...

0 Xishuo Zhang, et al. ∙

research

∙ 06/29/2023

SimPLe: Similarity-Aware Propagation Learning for Weakly-Supervised Breast Cancer Segmentation in DCE-MRI

Breast dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) pl...

0 Yuming Zhong, et al. ∙

research

∙ 06/28/2023

Separable Pathway Effects of Semi-Competing Risks via Multi-State Models

Semi-competing risks refer to the phenomenon where a primary outcome eve...

0 Yuhao Deng, et al. ∙

research

∙ 06/27/2023

Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition

Automatic recognition of disordered and elderly speech remains highly ch...

0 Tianzi Wang, et al. ∙

research

∙ 06/19/2023

Semi-Supervised Learning for hyperspectral images by non parametrically predicting view assignment

Hyperspectral image (HSI) classification is gaining a lot of momentum in...

0 Shivam Pande, et al. ∙

research

∙ 06/15/2023

Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models

Video Question Answering (VideoQA) has been significantly advanced from ...

0 Junting Pan, et al. ∙

research

∙ 06/15/2023

SSL4EO-L: Datasets and Foundation Models for Landsat Imagery

The Landsat program is the longest-running Earth observation program in ...

0 Adam J. Stewart, et al. ∙

research

∙ 06/14/2023

SaDI: A Self-adaptive Decomposed Interpretable Framework for Electric Load Forecasting under Extreme Events

Accurate prediction of electric load is crucial in power grid planning a...

0 Hengbo Liu, et al. ∙

research

∙ 06/09/2023

ModeT: Learning Deformable Image Registration via Motion Decomposition Transformer

The Transformer structures have been widely used in computer vision and ...

0 Haiqiao Wang, et al. ∙

research

∙ 06/08/2023

Towards An Empirical Theory of Ideologies in the Open Source Software Movement

Encompassing a diverse population of developers, non-technical users, or...

0 Yang Yue, et al. ∙

research

∙ 06/02/2023

Multi-Modal Emotion Recognition for Enhanced Requirements Engineering: A Novel Approach

Requirements engineering (RE) plays a crucial role in developing softwar...

0 Ben Cheng, et al. ∙

research

∙ 05/31/2023

DiffLoad: Uncertainty Quantification in Load Forecasting with Diffusion Model

Electrical load forecasting is of great significance for the decision ma...

0 Zhixian Wang, et al. ∙

research

∙ 05/22/2023

VideoLLM: Modeling Video Sequence with Large Language Models

With the exponential growth of video data, there is an urgent need for a...

0 Guo Chen, et al. ∙

research

∙ 05/10/2023

VideoChat: Chat-Centric Video Understanding

In this study, we initiate an exploration into video understanding by in...

0 Kunchang Li, et al. ∙

research

∙ 05/09/2023

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

We present an interactive visual framework named InternGPT, or iGPT for ...

0 Zhaoyang Liu, et al. ∙

research

∙ 05/05/2023

Physics-based network fine-tuning for robust quantitative susceptibility mapping from high-pass filtered phase

Purpose: To improve the generalization ability of convolutional neural n...

1 Jinwei Zhang, et al. ∙

research

∙ 04/28/2023

Optimizing Workflow for Elite Developers: Perspectives on Leveraging SE Bots

Small-scale automation services in Software Engineering, known as SE Bot...

0 Zhendong Wang, et al. ∙

research

∙ 04/24/2023

Label-free timing analysis of modularized nuclear detectors with physics-constrained deep learning

Pulse timing is an important topic in nuclear instrumentation, with far-...

0 Pengcheng Ai, et al. ∙

research

∙ 04/22/2023

SSN: Stockwell Scattering Network for SAR Image Change Detection

Recently, synthetic aperture radar (SAR) image change detection has beco...

0 Gong Chen, et al. ∙

research

∙ 04/14/2023

A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram Embeddings

File fragment classification (FFC) on small chunks of memory is essentia...

8 Wenyang Liu, et al. ∙

research

∙ 04/14/2023

Bitstream-Corrupted JPEG Images are Restorable: Two-stage Compensation and Alignment Framework for Image Restoration

In this paper, we study a real-world JPEG image restoration problem with...

2 Wenyang Liu, et al. ∙

research

∙ 04/07/2023

Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

Efficient automatic segmentation of multi-level (i.e. main and branch) p...

1 Gongning Luo, et al. ∙

research

∙ 04/07/2023

Detecting Chinese Fake News on Twitter during the COVID-19 Pandemic

The outbreak of COVID-19 has led to a global surge of Sinophobia partly ...

0 Yongjun Zhang, et al. ∙

research

∙ 03/29/2023

VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Scale is the primary factor for building a powerful foundation model tha...

0 Limin Wang, et al. ∙

research

∙ 03/28/2023

Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Video Foundation Models (VFMs) have received limited exploration due to ...

0 Kunchang Li, et al. ∙

research

∙ 03/17/2023

VPU-EM: An Event-based Modeling Framework to Evaluate NPU Performance and Power Efficiency at Scale

State-of-art NPUs are typically architected as a self-contained sub-syst...

0 Charles Qi, et al. ∙

research

∙ 03/13/2023

NeRFLiX: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-viewpoint MiXer

Neural radiance fields (NeRF) show great success in novel view synthesis...

1 Kun Zhou, et al. ∙

research

∙ 03/12/2023

PointPatchMix: Point Cloud Mixing with Patch Scoring

Data augmentation is an effective regularization strategy for mitigating...

0 Yi Wang, et al. ∙

research

∙ 02/28/2023

Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition

Automatic recognition of disordered and elderly speech remains a highly ...

0 Shujie Hu, et al. ∙

research

∙ 02/02/2023

Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition

The spiking neural network (SNN) using leaky-integrated-and-fire (LIF) n...

0 Minglun Han, et al. ∙

research

∙ 01/25/2023

Rate-Perception Optimized Preprocessing for Video Coding

In the past decades, lots of progress have been done in the video compre...

0 Chengqian Ma, et al. ∙

research

∙ 01/22/2023

Learning Open-vocabulary Semantic Segmentation Models From Natural Language Supervision

In this paper, we consider the problem of open-vocabulary semantic segme...

7 Jilan Xu, et al. ∙

research

∙ 12/26/2022

A Survey of Face Recognition

Recent years witnessed the breakthrough of face recognition with deep co...

5 Xinyi Wang, et al. ∙

research

∙ 12/06/2022

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

The foundation models have recently shown excellent performance on a var...

4 Yi Wang, et al. ∙

research

∙ 12/01/2022

Noisy Label Detection for Speaker Recognition

The success of deep neural networks requires both high annotation qualit...

0 Ruibin Yuan, et al. ∙

research

∙ 11/29/2022

NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views

Virtual reality and augmented reality (XR) bring increasing demand for 3...

0 Dejia Xu, et al. ∙

research

∙ 11/26/2022

CMC v2: Towards More Accurate COVID-19 Detection with Discriminative Video Priors

This paper presents our solution for the 2nd COVID-19 Competition, occur...

0 Junlin Hou, et al. ∙

research

∙ 11/20/2022

Metadata Caching in Presto: Towards Fast Data Processing

Presto is an open-source distributed SQL query engine for OLAP, aiming f...

0 Beinan Wang, et al. ∙

research

∙ 11/19/2022

Adjacent Slice Feature Guided 2.5D Network for Pulmonary Nodule Segmentation

More and more attention has been paid to the segmentation of pulmonary n...

0 Xinwei Xue, et al. ∙

research

∙ 11/17/2022

UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Learning discriminative spatiotemporal representation is the key problem...

0 Kunchang Li, et al. ∙

research

∙ 11/17/2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

In this report, we present our champion solutions to five tracks at Ego4...

0 Guo Chen, et al. ∙

research

∙ 11/13/2022

SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation

Self-supervised pre-training bears potential to generate expressive repr...

0 Yi Wang, et al. ∙

Yi Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro