Andrey Kuzmin

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Max Welling
167 publications
Victor Lempitsky
53 publications
Tijmen Blankevoort
27 publications
Arash Behboodi
24 publications
Markus Nagel
23 publications
Suraj Srinivas
20 publications
Mart van Baalen
12 publications
Sanghyuk Lee
9 publications
Chirag Patel
6 publications
Joseph Soriaga
5 publications
Andrii Skliar
5 publications

research

∙ 07/06/2023

Pruning vs Quantization: Which is Better?

Neural network pruning and quantization techniques are almost as old as ...

0 Andrey Kuzmin, et al. ∙

research

∙ 03/31/2023

FP8 versus INT8 for efficient deep learning inference

Recently, the idea of using FP8 as a number format for neural network tr...

5 Mart van Baalen, et al. ∙

research

∙ 08/19/2022

FP8 Quantization: The Power of the Exponent

When quantizing neural networks for efficient inference, low-bit integer...

4 Andrey Kuzmin, et al. ∙

research

∙ 07/22/2022

Quantized Sparse Weight Decomposition for Neural Network Compression

In this paper, we introduce a novel method of neural network weight comp...

8 Andrey Kuzmin, et al. ∙

research

∙ 02/02/2022

Cyclical Pruning for Sparse Neural Networks

Current methods for pruning neural network weights iteratively apply mag...

12 Suraj Srinivas, et al. ∙

research

∙ 12/20/2019

Taxonomy and Evaluation of Structured Compression of Convolutional Neural Networks

The success of deep neural networks in many real-world applications is l...

47 Andrey Kuzmin, et al. ∙

research

∙ 11/17/2016

End-to-end Learning of Cost-Volume Aggregation for Real-time Dense Stereo

We present a new deep learning-based approach for dense stereo matching....

0 Andrey Kuzmin, et al. ∙

Success!

An error occurred

Andrey Kuzmin

Featured Co-authors

Pruning vs Quantization: Which is Better?

FP8 versus INT8 for efficient deep learning inference

FP8 Quantization: The Power of the Exponent

Quantized Sparse Weight Decomposition for Neural Network Compression

Cyclical Pruning for Sparse Neural Networks

Taxonomy and Evaluation of Structured Compression of Convolutional Neural Networks

End-to-end Learning of Cost-Volume Aggregation for Real-time Dense Stereo

Sign in with Google

Consider DeepAI Pro