Stereotyping and Bias in the Flickr30K Dataset

05/19/2016
by   Emiel van Miltenburg, et al.
0

An untested assumption behind the crowdsourced descriptions of the images in the Flickr30K dataset (Young et al., 2014) is that they "focus only on the information that can be obtained from the image alone" (Hodosh et al., 2013, p. 859). This paper presents some evidence against this assumption, and provides a list of biases and unwarranted inferences that can be found in the Flickr30K dataset. Finally, it considers methods to find examples of these, and discusses how we should deal with stereotype-driven descriptions in future applications.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset