BNAS-v2: memory-efficient and performance-collapse-prevented broad neural architecture search

Zixiang Ding; Yaran Chen; Nannan Li; Dongbin Zhao

doi:10.1109/TSMC.2022.3143201

BNAS-v2: memory-efficient and performance-collapse-prevented broad neural architecture search

Zixiang Ding, Yaran Chen, Nannan Li, Dongbin Zhao^*

^*Corresponding author for this work

Department of Intelligent Science

Research output: Contribution to journal › Article › peer-review

19 Citations (Scopus)

Abstract

In this article, we propose BNAS-v2 to further improve the efficiency of broad neural architecture search (BNAS), which employs a broad convolutional neural network (BCNN) as the search space. In BNAS, the single-path sampling-updating strategy of an overparameterized BCNN leads to terrible unfair training issue, which restricts the efficiency improvement. To mitigate the unfair training issue, we employ a continuous relaxation strategy to optimize all paths of the overparameterized BCNN simultaneously. However, continuous relaxation leads to a performance collapse issue that leads to the unsatisfactory performance of the learned BCNN. For that, we propose the confident learning rate (CLR) and introduce the combination of partial channel connections and edge normalization. Experimental results show that 1) BNAS-v2 delivers state-of-the-art search efficiency on both CIFAR-10 (0.05 GPU days, which is 4× faster than BNAS) and ImageNet (0.19 GPU days) with better or competitive performance; 2) the above two solutions are effectively alleviating the performance collapse issue; and 3) BNAS-v2 achieves powerful generalization ability on multiple transfer tasks, e.g., MNIST, FashionMNIST, NORB, and SVHN. The code is available at https://github.com/zixiangding/BNASv2.

Original language	English
Pages (from-to)	6259-6272
Number of pages	14
Journal	IEEE Transactions on Systems, Man, and Cybernetics: Systems
Volume	52
Issue number	10
DOIs	https://doi.org/10.1109/TSMC.2022.3143201
Publication status	Published - 1 Oct 2022

Keywords

Broad neural architecture search (BNAS)
confident learning rate (CLR)
continuous relaxation
image classification
partial channel connections (PC)

Access to Document

10.1109/TSMC.2022.3143201

Cite this

@article{76bd3a39f1a94d199da0e9d7ed31ecd4,

title = "BNAS-v2: memory-efficient and performance-collapse-prevented broad neural architecture search",

abstract = "In this article, we propose BNAS-v2 to further improve the efficiency of broad neural architecture search (BNAS), which employs a broad convolutional neural network (BCNN) as the search space. In BNAS, the single-path sampling-updating strategy of an overparameterized BCNN leads to terrible unfair training issue, which restricts the efficiency improvement. To mitigate the unfair training issue, we employ a continuous relaxation strategy to optimize all paths of the overparameterized BCNN simultaneously. However, continuous relaxation leads to a performance collapse issue that leads to the unsatisfactory performance of the learned BCNN. For that, we propose the confident learning rate (CLR) and introduce the combination of partial channel connections and edge normalization. Experimental results show that 1) BNAS-v2 delivers state-of-the-art search efficiency on both CIFAR-10 (0.05 GPU days, which is 4× faster than BNAS) and ImageNet (0.19 GPU days) with better or competitive performance; 2) the above two solutions are effectively alleviating the performance collapse issue; and 3) BNAS-v2 achieves powerful generalization ability on multiple transfer tasks, e.g., MNIST, FashionMNIST, NORB, and SVHN. The code is available at https://github.com/zixiangding/BNASv2.",

keywords = "Broad neural architecture search (BNAS), confident learning rate (CLR), continuous relaxation, image classification, partial channel connections (PC)",

author = "Zixiang Ding and Yaran Chen and Nannan Li and Dongbin Zhao",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2022",

month = oct,

day = "1",

doi = "10.1109/TSMC.2022.3143201",

language = "English",

volume = "52",

pages = "6259--6272",

journal = "IEEE Transactions on Systems, Man, and Cybernetics: Systems",

issn = "2168-2216",

number = "10",

}

TY - JOUR

T1 - BNAS-v2: memory-efficient and performance-collapse-prevented broad neural architecture search

AU - Ding, Zixiang

AU - Chen, Yaran

AU - Li, Nannan

AU - Zhao, Dongbin

PY - 2022/10/1

Y1 - 2022/10/1

N2 - In this article, we propose BNAS-v2 to further improve the efficiency of broad neural architecture search (BNAS), which employs a broad convolutional neural network (BCNN) as the search space. In BNAS, the single-path sampling-updating strategy of an overparameterized BCNN leads to terrible unfair training issue, which restricts the efficiency improvement. To mitigate the unfair training issue, we employ a continuous relaxation strategy to optimize all paths of the overparameterized BCNN simultaneously. However, continuous relaxation leads to a performance collapse issue that leads to the unsatisfactory performance of the learned BCNN. For that, we propose the confident learning rate (CLR) and introduce the combination of partial channel connections and edge normalization. Experimental results show that 1) BNAS-v2 delivers state-of-the-art search efficiency on both CIFAR-10 (0.05 GPU days, which is 4× faster than BNAS) and ImageNet (0.19 GPU days) with better or competitive performance; 2) the above two solutions are effectively alleviating the performance collapse issue; and 3) BNAS-v2 achieves powerful generalization ability on multiple transfer tasks, e.g., MNIST, FashionMNIST, NORB, and SVHN. The code is available at https://github.com/zixiangding/BNASv2.

AB - In this article, we propose BNAS-v2 to further improve the efficiency of broad neural architecture search (BNAS), which employs a broad convolutional neural network (BCNN) as the search space. In BNAS, the single-path sampling-updating strategy of an overparameterized BCNN leads to terrible unfair training issue, which restricts the efficiency improvement. To mitigate the unfair training issue, we employ a continuous relaxation strategy to optimize all paths of the overparameterized BCNN simultaneously. However, continuous relaxation leads to a performance collapse issue that leads to the unsatisfactory performance of the learned BCNN. For that, we propose the confident learning rate (CLR) and introduce the combination of partial channel connections and edge normalization. Experimental results show that 1) BNAS-v2 delivers state-of-the-art search efficiency on both CIFAR-10 (0.05 GPU days, which is 4× faster than BNAS) and ImageNet (0.19 GPU days) with better or competitive performance; 2) the above two solutions are effectively alleviating the performance collapse issue; and 3) BNAS-v2 achieves powerful generalization ability on multiple transfer tasks, e.g., MNIST, FashionMNIST, NORB, and SVHN. The code is available at https://github.com/zixiangding/BNASv2.

KW - Broad neural architecture search (BNAS)

KW - confident learning rate (CLR)

KW - continuous relaxation

KW - image classification

KW - partial channel connections (PC)

UR - http://www.scopus.com/inward/record.url?scp=85123774305&partnerID=8YFLogxK

U2 - 10.1109/TSMC.2022.3143201

DO - 10.1109/TSMC.2022.3143201

M3 - Article

AN - SCOPUS:85123774305

SN - 2168-2216

VL - 52

SP - 6259

EP - 6272

JO - IEEE Transactions on Systems, Man, and Cybernetics: Systems

JF - IEEE Transactions on Systems, Man, and Cybernetics: Systems

IS - 10

ER -

BNAS-v2: memory-efficient and performance-collapse-prevented broad neural architecture search

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this