Sampling is a common strategy for generating text from probabilistic mod...
A fundamental result in psycholinguistics is that less predictable words...
While natural languages differ widely in both canonical word order and w...
Few-shot fine-tuning and in-context learning are two alternative strateg...
Language modeling, a central task in natural language processing, involv...
After just a few hundred training updates, a standard probabilistic mode...
Over the past two decades, numerous studies have demonstrated how less
p...
In this paper, we seek to measure how much information a component in a
...
The Bar-Hillel construction is a classic result in formal language theor...
Modern recommender systems aim to improve user experience. As reinforcem...
While probabilistic language generators have improved dramatically over ...
Probing has become a go-to methodology for interpreting and analyzing de...
The Universal Morphology (UniMorph) project is a collaborative effort
pr...
A central quest of probing is to uncover how pre-trained models encode a...
When generating natural language from neural probabilistic models, high
...
Numerous analyses of reading time (RT) data have been implemented – all ...
Despite achieving incredibly low perplexities on myriad natural language...
While there exist scores of natural languages, each with its unique feat...
Homophony's widespread presence in natural languages is a controversial
...
The uniform information density (UID) hypothesis posits a preference amo...
Pimentel et al. (2020) recently analysed probing from an
information-the...
The unigram distribution is the non-contextual probability of finding a
...
Probes are models devised to investigate the encoding of knowledge – e.g...
The mapping of lexical meanings to wordforms is a major feature of natur...
In common law, the outcome of a new case is determined mostly by precede...
While the prevalence of large pre-trained language models has led to
sig...
This work presents an information-theoretic operationalisation of
cross-...
Psycholinguistic studies of human word processing and lexical access pro...
The question of how to probe contextual word representations in a way th...
Lexical ambiguity is widespread in language, allowing for the reuse of
e...
A broad goal in natural language processing (NLP) is to develop a system...
A major hurdle in data-driven research on typology is having sufficient ...
We present methods for calculating a measure of phonotactic complexity—b...
Measuring what linguistic information is encoded in neural models of lan...
The noun lexica of many natural languages are divided into several decle...
The interest in complex deep neural networks for computer vision applica...
The success of neural networks on a diverse set of NLP tasks has led
res...
A longstanding debate in semiotics centers on the relationship between
l...
This work formalizes the new framework for anomaly detection, called act...
Bimanual gestures are of the utmost importance for the study of motor
co...