HeBERT HebEMO: a Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition

02/03/2021
by   Avihay Chriqui, et al.
0

The use of Bidirectional Encoder Representations from Transformers (BERT) models for different natural language processing (NLP) tasks, and for sentiment analysis in particular, has become very popular in recent years and not in vain. The use of social media is being constantly on the rise. Its impact on all areas of our lives is almost inconceivable. Researches show that social media nowadays serves as one of the main tools where people freely express their ideas, opinions, and emotions. During the current Covid-19 pandemic, the role of social media as a tool to resonate opinions and emotions, became even more prominent. This paper introduces HeBERT and HebEMO. HeBERT is a transformer-based model for modern Hebrew text. Hebrew is considered a Morphological Rich Language (MRL), with unique characteristics that pose a great challenge in developing appropriate Hebrew NLP models. Analyzing multiple specifications of the BERT architecture, we come up with a language model that outperforms all existing Hebrew alternatives on multiple language tasks. HebEMO is a tool that uses HeBERT to detect polarity and extract emotions from Hebrew user-generated content (UGC), which was trained on a unique Covid-19 related dataset that we collected and annotated for this study. Data collection and annotation followed an innovative iterative semi-supervised process that aimed to maximize predictability. HebEMO yielded a high performance of weighted average F1-score = 0.96 for polarity classification. Emotion detection reached an F1-score of 0.78-0.97, with the exception of surprise, which the model failed to capture (F1 = 0.41). These results are better than the best-reported performance, even when compared to the English language.

READ FULL TEXT

page 8

page 16

research
11/21/2019

Emotion Recognition for Vietnamese Social Media Text

Emotion recognition or emotion prediction is a higher approach or a spec...
research
02/19/2021

Towards Emotion Recognition in Hindi-English Code-Mixed Data: A Transformer Based Approach

In the last few years, emotion detection in social-media text has become...
research
08/20/2023

cantnlp@LT-EDI-2023: Homophobia/Transphobia Detection in Social Media Comments using Spatio-Temporally Retrained Language Models

This paper describes our multiclass classification system developed as p...
research
08/21/2019

Predict Emoji Combination with Retrieval Strategy

As emojis are widely used in social media, people not only use an emoji ...
research
12/16/2022

Utilizing distilBert transformer model for sentiment classification of COVID-19's Persian open-text responses

The COVID-19 pandemic has caused drastic alternations in human life in a...
research
06/01/2020

BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text

There is a growing interest in understanding how humans initiate and hol...
research
08/14/2020

A Hybrid BERT and LightGBM based Model for Predicting Emotion GIF Categories on Twitter

The animated Graphical Interchange Format (GIF) images have been widely ...

Please sign up or login with your details

Forgot password? Click here to reset