Mini-Giants: "Small" Language Models and Open Source Win-Win

07/17/2023
by   Zhengping Zhou, et al.
0

ChatGPT is phenomenal. However, it is prohibitively expensive to train and refine such giant models. Fortunately, small language models are flourishing and becoming more and more competent. We call them "mini-giants". We argue that open source community like Kaggle and mini-giants will win-win in many ways, technically, ethically and socially. In this article, we present a brief yet rich background, discuss how to attain small language models, present a comparative study of small language models and a brief discussion of evaluation methods, discuss the application scenarios where small language models are most needed in the real world, and conclude with discussion and outlook.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset