Deep reinforcement learning-based patch selection for illuminant estimation

Bolei Xu; Jingxin Liu; Xianxu Hou; Bozhi Liu; Guoping Qiu

doi:10.1016/j.imavis.2019.08.002

Deep reinforcement learning-based patch selection for illuminant estimation

Bolei Xu, Jingxin Liu, Xianxu Hou, Bozhi Liu, Guoping Qiu^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

Previous deep learning based approaches to illuminant estimation either resized the raw image to lower resolution or randomly cropped image patches for the deep learning model. However, such practices would inevitably lead to information loss or the selection of noisy patches that would affect estimation accuracy. In this paper, we regard patch selection in neural network based illuminant estimation as a controlling problem of selecting image patches that could help remove noisy patches and improve estimation accuracy. To achieve this, we construct a selection network (SeNet) to learn a patch selection policy. Based on data statistics and the learning progression state of the deep illuminant estimation network (DeNet), the SeNet decides which training patches should be input to the DeNet, which in turn gives feedback to the SeNet for it to update its selection policy. To achieve such interactive and intelligent learning, we utilize a reinforcement learning approach termed policy gradient to optimize the SeNet. We show that the proposed learning strategy can enhance the illuminant estimation accuracy, speed up the convergence and improve the stability of the training process of DeNet. We evaluate our method on two public datasets and demonstrate our method outperforms state-of-the-art approaches.

Original language	English
Article number	103798
Journal	Image and Vision Computing
Volume	91
DOIs	https://doi.org/10.1016/j.imavis.2019.08.002
Publication status	Published - Nov 2019
Externally published	Yes

Keywords

Color constancy
Patch selection
Reinforcement learning

Access to Document

10.1016/j.imavis.2019.08.002

Cite this

@article{b8fb0884ea5147658b1547d44b7f6f5e,

title = "Deep reinforcement learning-based patch selection for illuminant estimation",

abstract = "Previous deep learning based approaches to illuminant estimation either resized the raw image to lower resolution or randomly cropped image patches for the deep learning model. However, such practices would inevitably lead to information loss or the selection of noisy patches that would affect estimation accuracy. In this paper, we regard patch selection in neural network based illuminant estimation as a controlling problem of selecting image patches that could help remove noisy patches and improve estimation accuracy. To achieve this, we construct a selection network (SeNet) to learn a patch selection policy. Based on data statistics and the learning progression state of the deep illuminant estimation network (DeNet), the SeNet decides which training patches should be input to the DeNet, which in turn gives feedback to the SeNet for it to update its selection policy. To achieve such interactive and intelligent learning, we utilize a reinforcement learning approach termed policy gradient to optimize the SeNet. We show that the proposed learning strategy can enhance the illuminant estimation accuracy, speed up the convergence and improve the stability of the training process of DeNet. We evaluate our method on two public datasets and demonstrate our method outperforms state-of-the-art approaches.",

keywords = "Color constancy, Patch selection, Reinforcement learning",

author = "Bolei Xu and Jingxin Liu and Xianxu Hou and Bozhi Liu and Guoping Qiu",

note = "Publisher Copyright: {\textcopyright} 2019 Elsevier B.V.",

year = "2019",

month = nov,

doi = "10.1016/j.imavis.2019.08.002",

language = "English",

volume = "91",

journal = "Image and Vision Computing",

issn = "0262-8856",

}

TY - JOUR

T1 - Deep reinforcement learning-based patch selection for illuminant estimation

AU - Xu, Bolei

AU - Liu, Jingxin

AU - Hou, Xianxu

AU - Liu, Bozhi

AU - Qiu, Guoping

PY - 2019/11

Y1 - 2019/11

N2 - Previous deep learning based approaches to illuminant estimation either resized the raw image to lower resolution or randomly cropped image patches for the deep learning model. However, such practices would inevitably lead to information loss or the selection of noisy patches that would affect estimation accuracy. In this paper, we regard patch selection in neural network based illuminant estimation as a controlling problem of selecting image patches that could help remove noisy patches and improve estimation accuracy. To achieve this, we construct a selection network (SeNet) to learn a patch selection policy. Based on data statistics and the learning progression state of the deep illuminant estimation network (DeNet), the SeNet decides which training patches should be input to the DeNet, which in turn gives feedback to the SeNet for it to update its selection policy. To achieve such interactive and intelligent learning, we utilize a reinforcement learning approach termed policy gradient to optimize the SeNet. We show that the proposed learning strategy can enhance the illuminant estimation accuracy, speed up the convergence and improve the stability of the training process of DeNet. We evaluate our method on two public datasets and demonstrate our method outperforms state-of-the-art approaches.

AB - Previous deep learning based approaches to illuminant estimation either resized the raw image to lower resolution or randomly cropped image patches for the deep learning model. However, such practices would inevitably lead to information loss or the selection of noisy patches that would affect estimation accuracy. In this paper, we regard patch selection in neural network based illuminant estimation as a controlling problem of selecting image patches that could help remove noisy patches and improve estimation accuracy. To achieve this, we construct a selection network (SeNet) to learn a patch selection policy. Based on data statistics and the learning progression state of the deep illuminant estimation network (DeNet), the SeNet decides which training patches should be input to the DeNet, which in turn gives feedback to the SeNet for it to update its selection policy. To achieve such interactive and intelligent learning, we utilize a reinforcement learning approach termed policy gradient to optimize the SeNet. We show that the proposed learning strategy can enhance the illuminant estimation accuracy, speed up the convergence and improve the stability of the training process of DeNet. We evaluate our method on two public datasets and demonstrate our method outperforms state-of-the-art approaches.

KW - Color constancy

KW - Patch selection

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85073930676&partnerID=8YFLogxK

U2 - 10.1016/j.imavis.2019.08.002

DO - 10.1016/j.imavis.2019.08.002

M3 - Article

AN - SCOPUS:85073930676

SN - 0262-8856

VL - 91

JO - Image and Vision Computing

JF - Image and Vision Computing

M1 - 103798

ER -

Deep reinforcement learning-based patch selection for illuminant estimation

Abstract

Keywords

Access to Document

Other files and links

Cite this