With the fast growth of parameter size, it becomes increasingly challeng...
Text-to-image generation (TTI) refers to the usage of models that could
...
ChatGPT-like models have revolutionized various applications in artifici...
Large language models (LLMs) such as ChatGPT have demonstrated unprecede...
Collaborative filtering (CF) has been proven to be one of the most effec...
Sparse linear algebra kernels play a critical role in numerous applicati...
The safety of an automated vehicle hinges crucially upon the accuracy of...
Training wide and deep neural networks (DNNs) require large amounts of
s...
Bayesian Neural Networks (BNNs) that possess a property of uncertainty
e...
Multi-accelerator servers are increasingly being deployed in shared
mult...
Recent top-k computation efforts explore the possibility of revising
var...
The quest for determinism in machine learning has disproportionately foc...
Python has become a popular programming language because of its excellen...
High Quality Mobile Virtual Reality (VR) is what the incoming graphics
t...
Designing efficient and scalable sparse linear algebra kernels on modern...
Deep neural networks (DNNs) are becoming increasingly deeper, wider, and...
Linear algebra operations have been widely used in big data analytics an...
With the strong computation capability, NUMA-based multi-GPU system is a...
In recent years, the CNNs have achieved great successes in the image
pro...
High performance multi-GPU computing becomes an inevitable trend due to ...
Going deeper and wider in neural architectures improves the accuracy, wh...
Modeling data sharing in GPU programs is a challenging task because of t...