Training large language models to follow instructions makes them perform...
Without proper safeguards, large language models will readily follow
mal...
Many NLP tasks exhibit human label variation, where different annotators...
Large language models (LLMs) are used to generate content for a wide ran...
Online sexism is a widespread and harmful phenomenon. Automated tools ca...
Hate speech is a global phenomenon, but most hate speech datasets so far...
Hate speech detection models are typically evaluated on held-out test se...
Labelled data is the foundation of most natural language processing task...
Detecting online hate is a complex task, and low-performing models have
...
Language use differs between domains and even within a domain, language ...
Detecting online hate is a difficult task that even state-of-the-art mod...