On the role of effective and referring questions in GuessWhat?!

Mauricio Mazuecos, Alberto Testoni, Raffaella Bernardi, Luciana Benotti


Abstract
Task success is the standard metric used to evaluate referential visual dialogue systems. In this paper we propose two new metrics that evaluate how each question contributes to the goal. First, we measure how effective each question is by evaluating whether the question discards objects that are not the referent. Second, we define referring questions as those that univocally identify one object in the image. We report the new metrics for human dialogues and for state of the art publicly available models on GuessWhat?!. Regarding our first metric, we find that successful dialogues do not have a higher percentage of effective questions for most models. With respect to the second metric, humans make questions at the end of the dialogue that are referring, confirming their guess before guessing. Human dialogues that use this strategy have a higher task success but models do not seem to learn it.
Anthology ID:
2020.alvr-1.4
Volume:
Proceedings of the First Workshop on Advances in Language and Vision Research
Month:
July
Year:
2020
Address:
Online
Editors:
Xin Wang, Jesse Thomason, Ronghang Hu, Xinlei Chen, Peter Anderson, Qi Wu, Asli Celikyilmaz, Jason Baldridge, William Yang Wang
Venue:
ALVR
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
19–25
Language:
URL:
https://aclanthology.org/2020.alvr-1.4
DOI:
10.18653/v1/2020.alvr-1.4
Bibkey:
Cite (ACL):
Mauricio Mazuecos, Alberto Testoni, Raffaella Bernardi, and Luciana Benotti. 2020. On the role of effective and referring questions in GuessWhat?!. In Proceedings of the First Workshop on Advances in Language and Vision Research, pages 19–25, Online. Association for Computational Linguistics.
Cite (Informal):
On the role of effective and referring questions in GuessWhat?! (Mazuecos et al., ALVR 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.alvr-1.4.pdf