Transformer models are foundational to natural language processing (NLP)...
A fundamental challenge in Bayesian inference is efficient representatio...
Low-rank tensor compression has been proposed as a promising approach to...
Consistency training is a popular method to improve deep learning models...
The deep neural network (DNN) based AI applications on the edge require ...
Various hardware accelerators have been developed for energy-efficient a...
Low-rank tensor decomposition is one of the most effective approaches to...
Tensor decomposition is an effective approach to compress over-parameter...
Streaming tensor factorization is a powerful tool for processing high-vo...