Mostafa Mahmoud | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Gennady Pekhimenko
37 publications
Andreas Moshovos
16 publications
Milos Nikolic
14 publications
Patrick Judd
8 publications
Sayeh Sharify
7 publications
Jiahui Wang
6 publications
Zissis Poulos
6 publications
Ali Hadi Zadeh
5 publications
Alberto Delmas Lascorz
4 publications
Alberto Delmas
4 publications
Anand Jayarajan
4 publications

research

∙ 04/28/2022

Schrödinger's FP: Dynamic Adaptation of Floating-Point Containers for Deep Learning Training

We introduce a software-hardware co-design approach to reduce memory tra...

0 Milos Nikolic, et al. ∙

research

∙ 03/23/2022

Mokey: Enabling Narrow Fixed-Point Inference for Out-of-the-Box Floating-Point Transformer Models

Increasingly larger and better Transformer models keep advancing state-o...

0 Ali Hadi Zadeh, et al. ∙

research

∙ 01/21/2022

APack: Off-Chip, Lossless Data Compression for Efficient Deep Learning Inference

Data accesses between on- and off-chip memories account for a large frac...

0 Alberto Delmas Lascorz, et al. ∙

research

∙ 10/15/2020

FPRaker: A Processing Element For Accelerating Neural Network Training

We present FPRaker, a processing element for composing training accelera...

0 Omar Mohamed Awad, et al. ∙

research

∙ 09/01/2020

TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference

TensorDash is a hardware level technique for enabling data-parallel MAC ...

0 Mostafa Mahmoud, et al. ∙

research

∙ 05/10/2018

Laconic Deep Learning Computing

We motivate a method for transparently identifying ineffectual computati...

0 Sayeh Sharify, et al. ∙

research

∙ 03/09/2018

Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How

We show that, during inference with Convolutional Neural Networks (CNNs)...

0 Alberto Delmas, et al. ∙

research

∙ 11/30/2016

Memory Controller Design Under Cloud Workloads

This work studies the behavior of state-of-the-art memory controller des...

0 Mostafa Mahmoud, et al. ∙

Success!

An error occurred