Automatic Design of Deep Networks with Neural Blocks

Guoqiang Zhong*, Wencong Jiao, Wei Gao, Kaizhu Huang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

21 Citations (Scopus)

Abstract

In recent years, deep neural networks (DNNs) have achieved great successes in many areas, such as cognitive computation, pattern recognition, and computer vision. Although many hand-crafted deep networks have been proposed in the literature, designing a well-behaved neural network for a specific application requires high-level expertise yet. Hence, the automatic architecture design of DNNs has become a challenging and important problem. In this paper, we propose a new reinforcement learning method, whose action policy is to select neural blocks and construct deep networks. We define the action search space with three types of neural blocks, i.e., dense block, residual block, and inception-like block. Additionally, we have also designed several variants for the residual and inception-like blocks. The optimal network is automatically learned by a Q-learning agent, which is iteratively trained to generate well-performed deep networks. To evaluate the proposed method, we have conducted experiments on three datasets, MNIST, SVHN, and CIFAR-10, for image classification applications. Compared with existing hand-crafted and auto-generated neural networks, our auto-designed neural network delivers promising results. Moreover, the proposed reinforcement learning algorithm for deep networks design only runs on one GPU, demonstrating much higher efficiency than most of the previous deep network search approaches.

Original languageEnglish
Pages (from-to)1-12
Number of pages12
JournalCognitive Computation
Volume12
Issue number1
DOIs
Publication statusPublished - 1 Jan 2020

Keywords

  • Automatic deep networks design
  • Deep convolutional neural networks
  • Image classification
  • Neural blocks
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Automatic Design of Deep Networks with Neural Blocks'. Together they form a unique fingerprint.

Cite this