Keyword combination extraction in text categorization based on ant colony optimization

Zi Jun Yu, Wei Gang Wu, Jing Xiao, Jun Zhang*, Rui Zhang Huang, Ou Liu

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

4 Citations (Scopus)

Abstract

Due to the increasing number of documents in digital form, the automated text categorization (TC) has become more and more promising in the last ten years. A TC system can automatically assign a document with the most suitable category, but the reason for such an assignment is usually unknown by users. To make the TC system be interpretable, it is necessary to select a group of keywords, or termed a keyword combination, to describe each text category. In this paper, we propose a novel algorithm, keyword combination extraction based on ant colony optimization (KCEACO), to search the optimal keyword combination of a target category. By extending the traditional feature selection techniques, an evaluation function is designed for evaluating a keyword combination. This function takes into account the relationships among different keywords. Experimental results show that KCEACO can efficiently find the optimal keyword combination from a large number of candidate combinations.

Original languageEnglish
Title of host publicationSoCPaR 2009 - Soft Computing and Pattern Recognition
Pages430-435
Number of pages6
DOIs
Publication statusPublished - 2009
Externally publishedYes
EventInternational Conference on Soft Computing and Pattern Recognition, SoCPaR 2009 - Malacca, Malaysia
Duration: 4 Dec 20097 Dec 2009

Publication series

NameSoCPaR 2009 - Soft Computing and Pattern Recognition

Conference

ConferenceInternational Conference on Soft Computing and Pattern Recognition, SoCPaR 2009
Country/TerritoryMalaysia
CityMalacca
Period4/12/097/12/09

Keywords

  • Ant colony optimization
  • Concept learning
  • Feature selection
  • Keyword combination extraction
  • Text categorization

Fingerprint

Dive into the research topics of 'Keyword combination extraction in text categorization based on ant colony optimization'. Together they form a unique fingerprint.

Cite this