Emiel van Miltenburg

research

∙ 03/29/2023

Evaluating NLG systems: A brief introduction

This year the International Conference on Natural Language Generation (I...

0 Emiel van Miltenburg, et al. ∙

research

∙ 12/08/2022

Implicit causality in GPT-2: a case study

This case study investigates the extent to which a language model (GPT-2...

0 Hien Huynh, et al. ∙

research

∙ 08/02/2021

Underreporting of errors in NLG output, and what to do about it

We observe a severe under-reporting of the different kinds of errors tha...

0 Emiel van Miltenburg, et al. ∙

research

∙ 06/16/2021

Automatic Construction of Evaluation Suites for Natural Language Generation Datasets

Machine learning approaches applied to NLP are often evaluated by summar...

0 Simon Mille, et al. ∙

research

∙ 03/11/2021

Preregistering NLP Research

Preregistration refers to the practice of specifying what you are going ...

0 Emiel van Miltenburg, et al. ∙

research

∙ 02/02/2021

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

We introduce GEM, a living benchmark for natural language Generation (NL...

5 Sebastian Gehrmann, et al. ∙

research

∙ 06/15/2020

On the use of human reference data for evaluating automatic image descriptions

Automatic image description systems are commonly trained and evaluated u...

0 Emiel van Miltenburg, et al. ∙

research

∙ 08/23/2019

Neural data-to-text generation: A comparison between pipeline and end-to-end architectures

Traditionally, most data-to-text applications have been designed using a...

0 Thiago castro Ferreira, et al. ∙

research

∙ 07/06/2017

Cross-linguistic differences and similarities in image descriptions

Automatic image description systems are commonly trained and evaluated o...

0 Emiel van Miltenburg, et al. ∙

research

∙ 04/13/2017

Room for improvement in automatic image description: an error analysis

In recent years we have seen rapid and significant progress in automatic...

0 Emiel van Miltenburg, et al. ∙

research

∙ 06/20/2016

Pragmatic factors in image description: the case of negations

We provide a qualitative analysis of the descriptions containing negatio...

0 Emiel van Miltenburg, et al. ∙

research

∙ 05/19/2016

Stereotyping and Bias in the Flickr30K Dataset

An untested assumption behind the crowdsourced descriptions of the image...

0 Emiel van Miltenburg, et al. ∙

research

∙ 04/30/2015

Detecting and ordering adjectival scalemates

This paper presents a pattern-based method that can be used to infer adj...

0 Emiel van Miltenburg, et al. ∙

Emiel van Miltenburg

Featured Co-authors

Sign in with Google

Consider DeepAI Pro