Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning

Zixiang Ding; Yaran Chen; Nannan Li; Dongbin Zhao

doi:10.1109/SSCI47803.2020.9308141

Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning

Zixiang Ding, Yaran Chen, Nannan Li, Dongbin Zhao

Department of Intelligent Science

Chinese Academy of Sciences

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

2 Citations (Scopus)

Abstract

With the development of deep learning that employs deep neural networks (DNN) as powerful tool, its computational requirement grows rapid together with the increasing size (e.g. depth and parameter) of DNN. Currently, model and data parallelism are employed for accelerating the training and inference process of DNN. However, the above techniques make placement decision on devices for DNN based on heuristics and intuitions by machine learning experts. In this paper, we propose an novel approach for designing device placement of DNN in an automatic way. For a DNN, we employ a sequence-to-sequence model as controller to sample device placement from a one-shot model, which contains all possible device placements with respect to a specific hardware environment (e.g. CPU and GPU). Then, reinforcement learning treats the execution time of sampled device placement as reward to guide the sequence-to-sequence model for finding better one. The proposed approach is employed to optimize the device placement for both model and data parallelism of Inception-V3 on ImageNet. Experimental results show that the optimal placements discovered by our method can outperform hand-crafted one.

Original language	English
Title of host publication	2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1478-1484
Number of pages	7
ISBN (Electronic)	9781728125473
DOIs	https://doi.org/10.1109/SSCI47803.2020.9308141
Publication status	Published - 1 Dec 2020
Event	2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020 - Virtual, Canberra, Australia Duration: 1 Dec 2020 → 4 Dec 2020

Publication series

Name	2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020

Conference

Conference	2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020
Country/Territory	Australia
City	Virtual, Canberra
Period	1/12/20 → 4/12/20

Keywords

controller
deep neural networks
device placement
one-shot model
reinforcement learning

Access to Document

10.1109/SSCI47803.2020.9308141

Cite this

Ding, Z., Chen, Y., Li, N., & Zhao, D. (2020). Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning. In 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020 (pp. 1478-1484). Article 9308141 (2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SSCI47803.2020.9308141

@inproceedings{4e763297e0504ebfbc8d79ccedb6d7dc,

title = "Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning",

abstract = "With the development of deep learning that employs deep neural networks (DNN) as powerful tool, its computational requirement grows rapid together with the increasing size (e.g. depth and parameter) of DNN. Currently, model and data parallelism are employed for accelerating the training and inference process of DNN. However, the above techniques make placement decision on devices for DNN based on heuristics and intuitions by machine learning experts. In this paper, we propose an novel approach for designing device placement of DNN in an automatic way. For a DNN, we employ a sequence-to-sequence model as controller to sample device placement from a one-shot model, which contains all possible device placements with respect to a specific hardware environment (e.g. CPU and GPU). Then, reinforcement learning treats the execution time of sampled device placement as reward to guide the sequence-to-sequence model for finding better one. The proposed approach is employed to optimize the device placement for both model and data parallelism of Inception-V3 on ImageNet. Experimental results show that the optimal placements discovered by our method can outperform hand-crafted one.",

keywords = "controller, deep neural networks, device placement, one-shot model, reinforcement learning",

author = "Zixiang Ding and Yaran Chen and Nannan Li and Dongbin Zhao",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020 ; Conference date: 01-12-2020 Through 04-12-2020",

year = "2020",

month = dec,

day = "1",

doi = "10.1109/SSCI47803.2020.9308141",

language = "English",

series = "2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1478--1484",

booktitle = "2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020",

}

Ding, Z, Chen, Y, Li, N & Zhao, D 2020, Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning. in 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020., 9308141, 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020, Institute of Electrical and Electronics Engineers Inc., pp. 1478-1484, 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020, Virtual, Canberra, Australia, 1/12/20. https://doi.org/10.1109/SSCI47803.2020.9308141

Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning. / Ding, Zixiang; Chen, Yaran; Li, Nannan et al.
2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020. Institute of Electrical and Electronics Engineers Inc., 2020. p. 1478-1484 9308141 (2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning

AU - Ding, Zixiang

AU - Chen, Yaran

AU - Li, Nannan

AU - Zhao, Dongbin

PY - 2020/12/1

Y1 - 2020/12/1

N2 - With the development of deep learning that employs deep neural networks (DNN) as powerful tool, its computational requirement grows rapid together with the increasing size (e.g. depth and parameter) of DNN. Currently, model and data parallelism are employed for accelerating the training and inference process of DNN. However, the above techniques make placement decision on devices for DNN based on heuristics and intuitions by machine learning experts. In this paper, we propose an novel approach for designing device placement of DNN in an automatic way. For a DNN, we employ a sequence-to-sequence model as controller to sample device placement from a one-shot model, which contains all possible device placements with respect to a specific hardware environment (e.g. CPU and GPU). Then, reinforcement learning treats the execution time of sampled device placement as reward to guide the sequence-to-sequence model for finding better one. The proposed approach is employed to optimize the device placement for both model and data parallelism of Inception-V3 on ImageNet. Experimental results show that the optimal placements discovered by our method can outperform hand-crafted one.

AB - With the development of deep learning that employs deep neural networks (DNN) as powerful tool, its computational requirement grows rapid together with the increasing size (e.g. depth and parameter) of DNN. Currently, model and data parallelism are employed for accelerating the training and inference process of DNN. However, the above techniques make placement decision on devices for DNN based on heuristics and intuitions by machine learning experts. In this paper, we propose an novel approach for designing device placement of DNN in an automatic way. For a DNN, we employ a sequence-to-sequence model as controller to sample device placement from a one-shot model, which contains all possible device placements with respect to a specific hardware environment (e.g. CPU and GPU). Then, reinforcement learning treats the execution time of sampled device placement as reward to guide the sequence-to-sequence model for finding better one. The proposed approach is employed to optimize the device placement for both model and data parallelism of Inception-V3 on ImageNet. Experimental results show that the optimal placements discovered by our method can outperform hand-crafted one.

KW - controller

KW - deep neural networks

KW - device placement

KW - one-shot model

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85099704921&partnerID=8YFLogxK

U2 - 10.1109/SSCI47803.2020.9308141

DO - 10.1109/SSCI47803.2020.9308141

M3 - Conference Proceeding

AN - SCOPUS:85099704921

T3 - 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020

SP - 1478

EP - 1484

BT - 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020

Y2 - 1 December 2020 through 4 December 2020

ER -

Ding Z, Chen Y, Li N, Zhao D. Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning. In 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020. Institute of Electrical and Electronics Engineers Inc. 2020. p. 1478-1484. 9308141. (2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020). doi: 10.1109/SSCI47803.2020.9308141

Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this