Software-defined networking (SDN) and software-defined flash (SDF) have ...
Dense matrix multiply (MM) serves as one of the most heavily used kernel...
Neural Architecture Search (NAS) has become a de facto approach in the r...
Convolutional models have been widely used in multiple domains. However,...
Quantization for CNN has shown significant progress with the intention o...
In recent years, hardware accelerators based on field-programmable gate
...
Quantization for Convolutional Neural Network (CNN) has shown significan...
Performance of object detection models has been growing rapidly on two m...
Most existing neural architecture search (NAS) algorithms are dedicated ...
High-level synthesis (HLS) has been widely adopted as it significantly
i...
The combination of Winograd's algorithm and systolic array architecture ...
FPGAs have become emerging computing infrastructures for accelerating
ap...
Optimizing the quality of result (QoR) and the quality of service (QoS) ...
Artificial intelligence (AI) technologies have dramatically advanced in
...
Graph Convolutional Networks (GCNs) are increasingly adopted in large-sc...
With the increasing adoption of graph neural networks (GNNs) in the mach...
Binary neural networks (BNNs) have 1-bit weights and activations. Such
n...
High quality AI solutions require joint optimization of AI algorithms, s...
Existing FPGA-based DNN accelerators typically fall into two design
para...
Quantization has been proven to be an effective method for reducing the
...
The pervasive adoption of Deep Learning (DL) and Graph Processing (GP) m...
High quality AI solutions require joint optimization of AI algorithms an...
To speedup Deep Neural Networks (DNN) accelerator design and enable effe...
Transformer-based models pre-trained on large-scale corpora achieve
stat...
Recent breakthroughs in Deep Neural Networks (DNNs) have fueled a growin...
The rapidly growing demands for powerful AI algorithms in many applicati...
Developing object detection and tracking on resource-constrained embedde...
Unlike traditional PCIe-based FPGA accelerators, heterogeneous SoC-FPGA
...
Developing artificial intelligence (AI) at the edge is always challengin...
Developing deep learning models for resource-constrained Internet-of-Thi...
While embedded FPGAs are attractive platforms for DNN acceleration on
ed...
Connectionist temporal classification (CTC) training criterion provides ...
Neural network accelerators with low latency and low energy consumption ...
Furui first demonstrated that the identity of both consonant and vowel c...
Deep Convolutional Neural Networks have become a Swiss knife in solving
...
The proliferation of high-throughput sequencing machines ensures rapid
g...
We propose a network for Congested Scene Recognition called CSRNet to pr...
Most mainstream Automatic Speech Recognition (ASR) systems consider all
...
Most mainstream Automatic Speech Recognition (ASR) systems consider all
...