Abstract
Semantic image segmentation is a task to predict a category label for every image pixel. The key challenge of it is to design a strong feature representation. In this paper, we fuse the hierarchical convolutional neural network (CNN) features and the region-based features as the feature representation. The hierarchical features contain more global information, while the region-based features contain more local information. The combination of these two kinds of features significantly enhances the feature representation. Then the fused features are used to train a softmax classifier to produce per-pixel label assignment probability. And a fully connected conditional random field (CRF) is used as a post-processing method to improve the labeling consistency. We conduct experiments on SIFT flow dataset. The pixel accuracy and class accuracy are 84.4% and 34.86%, respectively.
| Original language | English |
|---|---|
| Pages (from-to) | 381-385 |
| Number of pages | 5 |
| Journal | Optoelectronics Letters |
| Volume | 13 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - 1 Sept 2017 |
| Externally published | Yes |