GuessWhat?!- Visual Object Discovery through dialogue

01 Oct 2021

Who am I? is a popular guessing game where the players use yes/no questions to guess the identity of person/animal/place. Visual object discovery mimics the game with two players (oracle and questioner) through dialog. In this blog post, lets dive into the introductory paper where GuessWhat?! dataset was introduced.

Contributions:

Game Play:

A sample dialog from the dataset is shared below
sample_data

Now that we got a high level understanding of the problem, how the data looks, lets check how the network is designed.

Modelling:

Performance:

Results of the proposed guesser model is given below
sample_data

Authors Observations:

The baseline code is available here

Future Research Questions:

These are some of the questions that are worth pondering from the paper. Many researchers worked on the variations of the following, will discuss some of those papers in coming posts.

#visual #dialog #guesswhat #oracle #questioner #guesser #task-oriented