TY - JOUR
T1 - Learning location constrained pixel classifiers for image parsing
AU - Dang, Kang
AU - Yuan, Junsong
N1 - Publisher Copyright:
© 2017 Elsevier Inc.
PY - 2017/11
Y1 - 2017/11
N2 - When parsing images with regular spatial layout, the location of a pixel (x,y) can provide important prior for its semantic label. This paper proposes a technique to leverage both location and appearance information for pixel labeling. The proposed method utilizes the spatial layout of the image by building local pixel classifiers that are location constrained, i.e., trained with pixels from a local neighborhood region only. Our proposed local learning works well in different challenging image parsing problems, such as pedestrian parsing, street-view scene parsing and object segmentation, and outperforms existing results that rely on one unified pixel classifier. To better understand the behavior of our local classifier, we perform bias-variance analysis, and demonstrate that the proposed local classifier essentially performs spatial smoothness over the target estimator that uses appearance information and location, which explains why the local classifier is more discriminative but can still handle mis-alignment. Meanwhile, our theoretical and experimental studies suggest the importance of selecting an appropriate neighborhood size to perform location constrained learning, which can significantly influence the parsing results.
AB - When parsing images with regular spatial layout, the location of a pixel (x,y) can provide important prior for its semantic label. This paper proposes a technique to leverage both location and appearance information for pixel labeling. The proposed method utilizes the spatial layout of the image by building local pixel classifiers that are location constrained, i.e., trained with pixels from a local neighborhood region only. Our proposed local learning works well in different challenging image parsing problems, such as pedestrian parsing, street-view scene parsing and object segmentation, and outperforms existing results that rely on one unified pixel classifier. To better understand the behavior of our local classifier, we perform bias-variance analysis, and demonstrate that the proposed local classifier essentially performs spatial smoothness over the target estimator that uses appearance information and location, which explains why the local classifier is more discriminative but can still handle mis-alignment. Meanwhile, our theoretical and experimental studies suggest the importance of selecting an appropriate neighborhood size to perform location constrained learning, which can significantly influence the parsing results.
KW - Local learning
KW - Pedestrian parsing
KW - Spatial layout
KW - Street-view scene parsing
UR - http://www.scopus.com/inward/record.url?scp=85026651038&partnerID=8YFLogxK
U2 - 10.1016/j.jvcir.2017.07.001
DO - 10.1016/j.jvcir.2017.07.001
M3 - Article
AN - SCOPUS:85026651038
SN - 1047-3203
VL - 49
SP - 1
EP - 13
JO - Journal of Visual Communication and Image Representation
JF - Journal of Visual Communication and Image Representation
ER -