TY - JOUR
T1 - A new multi-scale image semantic understanding method based on deep learning
AU - Jiang, Ying Feng
AU - Zhang, Hua
AU - Xue, Yan Bing
AU - Zhou, Mian
AU - Xu, Guang Ping
AU - Gao, Zan
N1 - Publisher Copyright:
© 2016, Science Press in China. All right reserved.
PY - 2016/2/15
Y1 - 2016/2/15
N2 - How to fuse multi-scale information of image in deep learning is a key problem to be solved. To solve these problems, this paper proposed a deep learning method based on multi-scale iterative training for image semantic understanding. The algorithm uses the convolution neural network (CNN) to extract dense feature vectors from raw pixel for encoding regions centered on each pixel. The multi-scale iterative training captures different scales of textures, colors, edges and other important information. A new method combined with superpixel segmentation is proposed, to estimate the leading category of superpixel block and to correct the pixel classification error. It can depict the outline of the target area boundary and complete the final semantic understanding. The experiments on Stanford Background Dataset-8 verify the effectiveness of the proposed method, and the accuracy rate is 77.4%.
AB - How to fuse multi-scale information of image in deep learning is a key problem to be solved. To solve these problems, this paper proposed a deep learning method based on multi-scale iterative training for image semantic understanding. The algorithm uses the convolution neural network (CNN) to extract dense feature vectors from raw pixel for encoding regions centered on each pixel. The multi-scale iterative training captures different scales of textures, colors, edges and other important information. A new method combined with superpixel segmentation is proposed, to estimate the leading category of superpixel block and to correct the pixel classification error. It can depict the outline of the target area boundary and complete the final semantic understanding. The experiments on Stanford Background Dataset-8 verify the effectiveness of the proposed method, and the accuracy rate is 77.4%.
KW - Convolutional neural network (CNN)
KW - Deep learning
KW - Image semantic understanding
KW - Superpixel segmentation
UR - http://www.scopus.com/inward/record.url?scp=85010046866&partnerID=8YFLogxK
U2 - 10.16136/j.joel.2016.02.0652
DO - 10.16136/j.joel.2016.02.0652
M3 - Article
AN - SCOPUS:85010046866
SN - 1005-0086
VL - 27
SP - 224
EP - 230
JO - Guangdianzi Jiguang/Journal of Optoelectronics Laser
JF - Guangdianzi Jiguang/Journal of Optoelectronics Laser
IS - 2
ER -