TY - GEN
T1 - Keyword combination extraction in text categorization based on ant colony optimization
AU - Yu, Zi Jun
AU - Wu, Wei Gang
AU - Xiao, Jing
AU - Zhang, Jun
AU - Huang, Rui Zhang
AU - Liu, Ou
PY - 2009
Y1 - 2009
N2 - Due to the increasing number of documents in digital form, the automated text categorization (TC) has become more and more promising in the last ten years. A TC system can automatically assign a document with the most suitable category, but the reason for such an assignment is usually unknown by users. To make the TC system be interpretable, it is necessary to select a group of keywords, or termed a keyword combination, to describe each text category. In this paper, we propose a novel algorithm, keyword combination extraction based on ant colony optimization (KCEACO), to search the optimal keyword combination of a target category. By extending the traditional feature selection techniques, an evaluation function is designed for evaluating a keyword combination. This function takes into account the relationships among different keywords. Experimental results show that KCEACO can efficiently find the optimal keyword combination from a large number of candidate combinations.
AB - Due to the increasing number of documents in digital form, the automated text categorization (TC) has become more and more promising in the last ten years. A TC system can automatically assign a document with the most suitable category, but the reason for such an assignment is usually unknown by users. To make the TC system be interpretable, it is necessary to select a group of keywords, or termed a keyword combination, to describe each text category. In this paper, we propose a novel algorithm, keyword combination extraction based on ant colony optimization (KCEACO), to search the optimal keyword combination of a target category. By extending the traditional feature selection techniques, an evaluation function is designed for evaluating a keyword combination. This function takes into account the relationships among different keywords. Experimental results show that KCEACO can efficiently find the optimal keyword combination from a large number of candidate combinations.
KW - Ant colony optimization
KW - Concept learning
KW - Feature selection
KW - Keyword combination extraction
KW - Text categorization
UR - http://www.scopus.com/inward/record.url?scp=77649319918&partnerID=8YFLogxK
U2 - 10.1109/SoCPaR.2009.90
DO - 10.1109/SoCPaR.2009.90
M3 - Conference Proceeding
AN - SCOPUS:77649319918
SN - 9780769538792
T3 - SoCPaR 2009 - Soft Computing and Pattern Recognition
SP - 430
EP - 435
BT - SoCPaR 2009 - Soft Computing and Pattern Recognition
T2 - International Conference on Soft Computing and Pattern Recognition, SoCPaR 2009
Y2 - 4 December 2009 through 7 December 2009
ER -