Multiple pedestrian tracking faces the challenge of tracking pedestrians...
Volumetric design, also called massing design, is the first and critical...
Semi-competing risks refer to the phenomenon that the terminal event (su...
Cross-scene generalizable NeRF models, which can directly synthesize nov...
Discovering ancient agricultural terraces in desert regions is important...
Load forecasting is of great significance in the power industry as it ca...
This paper introduces InternVid, a large-scale video-centric multimodal
...
While recent advancements in vision-language models have revolutionized
...
Personas are crucial in software development processes, particularly in ...
Breast dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) pl...
Semi-competing risks refer to the phenomenon where a primary outcome eve...
Automatic recognition of disordered and elderly speech remains highly
ch...
Hyperspectral image (HSI) classification is gaining a lot of momentum in...
Video Question Answering (VideoQA) has been significantly advanced from ...
The Landsat program is the longest-running Earth observation program in
...
Accurate prediction of electric load is crucial in power grid planning a...
The Transformer structures have been widely used in computer vision and ...
Encompassing a diverse population of developers, non-technical users,
or...
Requirements engineering (RE) plays a crucial role in developing softwar...
Electrical load forecasting is of great significance for the decision ma...
With the exponential growth of video data, there is an urgent need for
a...
In this study, we initiate an exploration into video understanding by
in...
We present an interactive visual framework named InternGPT, or iGPT for
...
Purpose: To improve the generalization ability of convolutional neural
n...
Small-scale automation services in Software Engineering, known as SE Bot...
Pulse timing is an important topic in nuclear instrumentation, with
far-...
Recently, synthetic aperture radar (SAR) image change detection has beco...
File fragment classification (FFC) on small chunks of memory is essentia...
In this paper, we study a real-world JPEG image restoration problem with...
Efficient automatic segmentation of multi-level (i.e. main and branch)
p...
The outbreak of COVID-19 has led to a global surge of Sinophobia partly
...
Scale is the primary factor for building a powerful foundation model tha...
Video Foundation Models (VFMs) have received limited exploration due to ...
State-of-art NPUs are typically architected as a self-contained sub-syst...
Neural radiance fields (NeRF) show great success in novel view synthesis...
Data augmentation is an effective regularization strategy for mitigating...
Automatic recognition of disordered and elderly speech remains a highly
...
The spiking neural network (SNN) using leaky-integrated-and-fire (LIF)
n...
In the past decades, lots of progress have been done in the video compre...
In this paper, we consider the problem of open-vocabulary semantic
segme...
Recent years witnessed the breakthrough of face recognition with deep
co...
The foundation models have recently shown excellent performance on a var...
The success of deep neural networks requires both high annotation qualit...
Virtual reality and augmented reality (XR) bring increasing demand for 3...
This paper presents our solution for the 2nd COVID-19 Competition, occur...
Presto is an open-source distributed SQL query engine for OLAP, aiming f...
More and more attention has been paid to the segmentation of pulmonary
n...
Learning discriminative spatiotemporal representation is the key problem...
In this report, we present our champion solutions to five tracks at Ego4...
Self-supervised pre-training bears potential to generate expressive
repr...