TY - JOUR
T1 - Historic Chinese architectures image retrieval by SVM and pyramid histogram of oriented gradients features
AU - Zhang, Bailing
AU - Song, Yonghua
AU - Guan, Sheng uei
AU - Zhang, Yanchun
PY - 2010
Y1 - 2010
N2 - Content-Based Image Retrieval (CBIR) of historic Chinese architecture images is an important area of research bridging society, culture and information technology. Most of the image features used in previous content-based image retrieval systems such as colour, texture and some simple shape descriptors are not effective in describing building images due to high variability in the heterogeneous architectural image collections. This study investigates content-based architectural image retrieval mainly by shape features. The recently proposed shape descriptor, Pyramid Histogram of Oriented Gradients (PHOG) features, counts occurrences of gradient orientation in localized portions of an image and has been proved as an efficient tool for providing spatial distribution of edges. Many existing image retrieval systems attempt to compare the query image with every target image in the database to find the top matching images, resulting in an essentially linear search which is prohibitive when the database is large. To solve the problem, it propose to introduce classification as the first stage in the retrieval system. Based on the PHOG features, it apply the Support Vector Machine (SVM) to automatically classify the ancient Chinese architecture images. Cross-validation test results indicate that the generalization performance of the SVM was over 60% compared to neural network's accuracy of 30% and kNN's accuracy 50%.
AB - Content-Based Image Retrieval (CBIR) of historic Chinese architecture images is an important area of research bridging society, culture and information technology. Most of the image features used in previous content-based image retrieval systems such as colour, texture and some simple shape descriptors are not effective in describing building images due to high variability in the heterogeneous architectural image collections. This study investigates content-based architectural image retrieval mainly by shape features. The recently proposed shape descriptor, Pyramid Histogram of Oriented Gradients (PHOG) features, counts occurrences of gradient orientation in localized portions of an image and has been proved as an efficient tool for providing spatial distribution of edges. Many existing image retrieval systems attempt to compare the query image with every target image in the database to find the top matching images, resulting in an essentially linear search which is prohibitive when the database is large. To solve the problem, it propose to introduce classification as the first stage in the retrieval system. Based on the PHOG features, it apply the Support Vector Machine (SVM) to automatically classify the ancient Chinese architecture images. Cross-validation test results indicate that the generalization performance of the SVM was over 60% compared to neural network's accuracy of 30% and kNN's accuracy 50%.
KW - Australia
KW - China
KW - Chinese historical architectures
KW - Content-based image retrieval
KW - Cross validation
KW - Pyramid histogram of oriented gradient
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=77955251961&partnerID=8YFLogxK
U2 - 10.3923/ijscomp.2010.19.28
DO - 10.3923/ijscomp.2010.19.28
M3 - Article
AN - SCOPUS:77955251961
SN - 1816-9503
VL - 5
SP - 19
EP - 28
JO - International Journal of Soft Computing
JF - International Journal of Soft Computing
IS - 2
ER -