Goal-Oriented Visual Question Generation via Intermediate Rewards

Junjie Zhang; Qi Wu; Chunhua Shen; Jian Zhang; Jianfeng Lu; Anton van den Hengel

doi:10.1007/978-3-030-01228-1_12

Goal-Oriented Visual Question Generation via Intermediate Rewards

Junjie Zhang, Qi Wu^*, Chunhua Shen, Jian Zhang, Jianfeng Lu, Anton van den Hengel

^*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

13 Citations (Scopus)

Abstract

Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge. Towards this end, we propose a Deep Reinforcement Learning framework based on three new intermediate rewards, namely goal-achieved, progressive and informativeness that encourage the generation of succinct questions, which in turn uncover valuable information towards the overall goal. By directly optimizing for questions that work quickly towards fulfilling the overall goal, we avoid the tendency of existing methods to generate long series of inane queries that add little value. We evaluate our model on the GuessWhat?! dataset and show that the resulting questions can help a standard ‘Guesser’ identify a specific object in an image at a much higher success rate.

Original language	English
Title of host publication	Computer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings
Editors	Vittorio Ferrari, Cristian Sminchisescu, Martial Hebert, Yair Weiss
Publisher	Springer Verlag
Pages	189-204
Number of pages	16
ISBN (Print)	9783030012274
DOIs	https://doi.org/10.1007/978-3-030-01228-1_12
Publication status	Published - 2018
Externally published	Yes
Event	15th European Conference on Computer Vision, ECCV 2018 - Munich, Germany Duration: 8 Sept 2018 → 14 Sept 2018

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	11209 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	15th European Conference on Computer Vision, ECCV 2018
Country/Territory	Germany
City	Munich
Period	8/09/18 → 14/09/18

Keywords

Goal-oriented
Intermediate rewards
VQG

Access to Document

10.1007/978-3-030-01228-1_12

Cite this

Zhang, J., Wu, Q., Shen, C., Zhang, J., Lu, J., & van den Hengel, A. (2018). Goal-Oriented Visual Question Generation via Intermediate Rewards. In V. Ferrari, C. Sminchisescu, M. Hebert, & Y. Weiss (Eds.), Computer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings (pp. 189-204). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11209 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-030-01228-1_12

Zhang, Junjie ; Wu, Qi ; Shen, Chunhua et al. / Goal-Oriented Visual Question Generation via Intermediate Rewards. Computer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings. editor / Vittorio Ferrari ; Cristian Sminchisescu ; Martial Hebert ; Yair Weiss. Springer Verlag, 2018. pp. 189-204 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{96b3e70db4ed488799c6fba83ddfa7ca,

title = "Goal-Oriented Visual Question Generation via Intermediate Rewards",

abstract = "Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge. Towards this end, we propose a Deep Reinforcement Learning framework based on three new intermediate rewards, namely goal-achieved, progressive and informativeness that encourage the generation of succinct questions, which in turn uncover valuable information towards the overall goal. By directly optimizing for questions that work quickly towards fulfilling the overall goal, we avoid the tendency of existing methods to generate long series of inane queries that add little value. We evaluate our model on the GuessWhat?! dataset and show that the resulting questions can help a standard {\textquoteleft}Guesser{\textquoteright} identify a specific object in an image at a much higher success rate.",

keywords = "Goal-oriented, Intermediate rewards, VQG",

author = "Junjie Zhang and Qi Wu and Chunhua Shen and Jian Zhang and Jianfeng Lu and {van den Hengel}, Anton",

note = "Publisher Copyright: {\textcopyright} 2018, Springer Nature Switzerland AG.; 15th European Conference on Computer Vision, ECCV 2018 ; Conference date: 08-09-2018 Through 14-09-2018",

year = "2018",

doi = "10.1007/978-3-030-01228-1_12",

language = "English",

isbn = "9783030012274",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "189--204",

editor = "Vittorio Ferrari and Cristian Sminchisescu and Martial Hebert and Yair Weiss",

booktitle = "Computer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings",

}

Zhang, J, Wu, Q, Shen, C, Zhang, J, Lu, J & van den Hengel, A 2018, Goal-Oriented Visual Question Generation via Intermediate Rewards. in V Ferrari, C Sminchisescu, M Hebert & Y Weiss (eds), Computer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11209 LNCS, Springer Verlag, pp. 189-204, 15th European Conference on Computer Vision, ECCV 2018, Munich, Germany, 8/09/18. https://doi.org/10.1007/978-3-030-01228-1_12

Goal-Oriented Visual Question Generation via Intermediate Rewards. / Zhang, Junjie; Wu, Qi; Shen, Chunhua et al.
Computer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings. ed. / Vittorio Ferrari; Cristian Sminchisescu; Martial Hebert; Yair Weiss. Springer Verlag, 2018. p. 189-204 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11209 LNCS).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Goal-Oriented Visual Question Generation via Intermediate Rewards

AU - Zhang, Junjie

AU - Wu, Qi

AU - Shen, Chunhua

AU - Zhang, Jian

AU - Lu, Jianfeng

AU - van den Hengel, Anton

PY - 2018

Y1 - 2018

N2 - Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge. Towards this end, we propose a Deep Reinforcement Learning framework based on three new intermediate rewards, namely goal-achieved, progressive and informativeness that encourage the generation of succinct questions, which in turn uncover valuable information towards the overall goal. By directly optimizing for questions that work quickly towards fulfilling the overall goal, we avoid the tendency of existing methods to generate long series of inane queries that add little value. We evaluate our model on the GuessWhat?! dataset and show that the resulting questions can help a standard ‘Guesser’ identify a specific object in an image at a much higher success rate.

AB - Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge. Towards this end, we propose a Deep Reinforcement Learning framework based on three new intermediate rewards, namely goal-achieved, progressive and informativeness that encourage the generation of succinct questions, which in turn uncover valuable information towards the overall goal. By directly optimizing for questions that work quickly towards fulfilling the overall goal, we avoid the tendency of existing methods to generate long series of inane queries that add little value. We evaluate our model on the GuessWhat?! dataset and show that the resulting questions can help a standard ‘Guesser’ identify a specific object in an image at a much higher success rate.

KW - Goal-oriented

KW - Intermediate rewards

KW - VQG

UR - http://www.scopus.com/inward/record.url?scp=85055083095&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-01228-1_12

DO - 10.1007/978-3-030-01228-1_12

M3 - Conference Proceeding

AN - SCOPUS:85055083095

SN - 9783030012274

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 189

EP - 204

BT - Computer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings

A2 - Ferrari, Vittorio

A2 - Sminchisescu, Cristian

A2 - Hebert, Martial

A2 - Weiss, Yair

PB - Springer Verlag

T2 - 15th European Conference on Computer Vision, ECCV 2018

Y2 - 8 September 2018 through 14 September 2018

ER -

Zhang J, Wu Q, Shen C, Zhang J, Lu J, van den Hengel A. Goal-Oriented Visual Question Generation via Intermediate Rewards. In Ferrari V, Sminchisescu C, Hebert M, Weiss Y, editors, Computer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings. Springer Verlag. 2018. p. 189-204. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-01228-1_12

Goal-Oriented Visual Question Generation via Intermediate Rewards

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this