research
∙
04/04/2023
Blockwise Compression of Transformer-based Models without Retraining
Transformer-based models, represented by GPT-3, ChatGPT, and GPT-4, have...
research
∙
03/16/2023