Evaluating the factuality of long-form text generated by large language
...
To detect the deployment of large language models for malicious use case...
A key component of generating text from modern language models (LM) is t...
While human evaluation remains best practice for accurately judging the
...
Literary translation is a culturally significant task, but it is bottlen...
To understand what kinds of linguistic knowledge are encoded by pretrain...
Large-scale, high-quality corpora are critical for advancing research in...
Given an input sequence (or prefix), modern language models often assign...
Humanities scholars commonly provide evidence for claims that they make ...
Style transfer is the task of rewriting an input sentence into a target ...
Language models are generally trained on short, truncated input sequence...
The task of long-form question answering (LFQA) involves retrieving docu...
Recent studies on Question Answering (QA) and Conversational QA (ConvQA)...
Abstractive summarization is the task of compressing a long document int...
In the practice of sequential decision making, agents are often designed...
Modern NLP defines the task of style transfer as modifying the style of ...
We study the problem of model extraction in natural language processing,...
Standard decoders for neural machine translation autoregressively genera...
The process of knowledge acquisition can be viewed as a question-answer ...
An approach to make text visually appealing and memorable is semantic
re...
We analyze the performance of different sentiment classification models ...
Previous work has shown that neural encoder-decoder speech recognition c...
Connectionist temporal classification (CTC) is a popular sequence predic...