Enhancing recursive supervised learning using clustering and combinatorial optimization (RSL-CC)

Kiruthika Ramanathan; Sheng Uei Guan

doi:10.1007/978-3-540-75396-4_6

Enhancing recursive supervised learning using clustering and combinatorial optimization (RSL-CC)

Kiruthika Ramanathan^*, Sheng Uei Guan

^*Corresponding author for this work

National University of Singapore

Research output: Chapter in Book or Report/Conference proceeding › Chapter › peer-review

Abstract

The use of a team of weak learners to learn a dataset has been shown better than the use of one single strong learner. In fact, the idea is so successful that boosting, an algorithm combining several weak learners for supervised learning, has been considered to be one of the best off-the-shelf classifiers. However, some problems still remain, including determining the optimal number of weak learners and the overfitting of data. In an earlier work, we developed the RPHP algorithm which solves both these problems by using a combination of genetic algorithm, weak learner and pattern distributor. In this paper, we revise the global search component by replacing it with a cluster-based combinatorial optimization. Patterns are clustered according to the output space of the problem, i.e., natural clusters are formed based on patterns belonging to each class. A combinatorial optimization problem is therefore formed, which is solved using evolutionary algorithms. The evolutionary algorithms identify the "easy" and the "difficult" clusters in the system. The removal of the easy patterns then gives way to the focused learning of the more complicated patterns. The problem therefore becomes recursively simpler. Overfitting is overcome by using a set of validation patterns along with a pattern distributor. An algorithm is also proposed to use the pattern distributor to determine the optimal number of recursions and hence the optimal number of weak learners for the problem. Empirical studies show generally good performance when compared to other state-of-the-art methods.

Original language	English
Title of host publication	Engineering Evolutionary Intelligent Systems
Editors	Ajith Abraham, Crina Grosan, Witold Pedrycz
Pages	157-176
Number of pages	20
DOIs	https://doi.org/10.1007/978-3-540-75396-4_6
Publication status	Published - 2008
Externally published	Yes

Publication series

Name	Studies in Computational Intelligence
Volume	82
ISSN (Print)	1860-949X

Access to Document

10.1007/978-3-540-75396-4_6

Cite this

@inbook{43348b8f1f844dfe8ed192ebb5e33025,

title = "Enhancing recursive supervised learning using clustering and combinatorial optimization (RSL-CC)",

abstract = "The use of a team of weak learners to learn a dataset has been shown better than the use of one single strong learner. In fact, the idea is so successful that boosting, an algorithm combining several weak learners for supervised learning, has been considered to be one of the best off-the-shelf classifiers. However, some problems still remain, including determining the optimal number of weak learners and the overfitting of data. In an earlier work, we developed the RPHP algorithm which solves both these problems by using a combination of genetic algorithm, weak learner and pattern distributor. In this paper, we revise the global search component by replacing it with a cluster-based combinatorial optimization. Patterns are clustered according to the output space of the problem, i.e., natural clusters are formed based on patterns belonging to each class. A combinatorial optimization problem is therefore formed, which is solved using evolutionary algorithms. The evolutionary algorithms identify the {"}easy{"} and the {"}difficult{"} clusters in the system. The removal of the easy patterns then gives way to the focused learning of the more complicated patterns. The problem therefore becomes recursively simpler. Overfitting is overcome by using a set of validation patterns along with a pattern distributor. An algorithm is also proposed to use the pattern distributor to determine the optimal number of recursions and hence the optimal number of weak learners for the problem. Empirical studies show generally good performance when compared to other state-of-the-art methods.",

author = "Kiruthika Ramanathan and Guan, {Sheng Uei}",

year = "2008",

doi = "10.1007/978-3-540-75396-4_6",

language = "English",

isbn = "9783540753957",

series = "Studies in Computational Intelligence",

pages = "157--176",

editor = "Ajith Abraham and Crina Grosan and Witold Pedrycz",

booktitle = "Engineering Evolutionary Intelligent Systems",

}

Enhancing recursive supervised learning using clustering and combinatorial optimization (RSL-CC). / Ramanathan, Kiruthika; Guan, Sheng Uei.
Engineering Evolutionary Intelligent Systems. ed. / Ajith Abraham; Crina Grosan; Witold Pedrycz. 2008. p. 157-176 (Studies in Computational Intelligence; Vol. 82).

Research output: Chapter in Book or Report/Conference proceeding › Chapter › peer-review

TY - CHAP

T1 - Enhancing recursive supervised learning using clustering and combinatorial optimization (RSL-CC)

AU - Ramanathan, Kiruthika

AU - Guan, Sheng Uei

PY - 2008

Y1 - 2008

N2 - The use of a team of weak learners to learn a dataset has been shown better than the use of one single strong learner. In fact, the idea is so successful that boosting, an algorithm combining several weak learners for supervised learning, has been considered to be one of the best off-the-shelf classifiers. However, some problems still remain, including determining the optimal number of weak learners and the overfitting of data. In an earlier work, we developed the RPHP algorithm which solves both these problems by using a combination of genetic algorithm, weak learner and pattern distributor. In this paper, we revise the global search component by replacing it with a cluster-based combinatorial optimization. Patterns are clustered according to the output space of the problem, i.e., natural clusters are formed based on patterns belonging to each class. A combinatorial optimization problem is therefore formed, which is solved using evolutionary algorithms. The evolutionary algorithms identify the "easy" and the "difficult" clusters in the system. The removal of the easy patterns then gives way to the focused learning of the more complicated patterns. The problem therefore becomes recursively simpler. Overfitting is overcome by using a set of validation patterns along with a pattern distributor. An algorithm is also proposed to use the pattern distributor to determine the optimal number of recursions and hence the optimal number of weak learners for the problem. Empirical studies show generally good performance when compared to other state-of-the-art methods.

AB - The use of a team of weak learners to learn a dataset has been shown better than the use of one single strong learner. In fact, the idea is so successful that boosting, an algorithm combining several weak learners for supervised learning, has been considered to be one of the best off-the-shelf classifiers. However, some problems still remain, including determining the optimal number of weak learners and the overfitting of data. In an earlier work, we developed the RPHP algorithm which solves both these problems by using a combination of genetic algorithm, weak learner and pattern distributor. In this paper, we revise the global search component by replacing it with a cluster-based combinatorial optimization. Patterns are clustered according to the output space of the problem, i.e., natural clusters are formed based on patterns belonging to each class. A combinatorial optimization problem is therefore formed, which is solved using evolutionary algorithms. The evolutionary algorithms identify the "easy" and the "difficult" clusters in the system. The removal of the easy patterns then gives way to the focused learning of the more complicated patterns. The problem therefore becomes recursively simpler. Overfitting is overcome by using a set of validation patterns along with a pattern distributor. An algorithm is also proposed to use the pattern distributor to determine the optimal number of recursions and hence the optimal number of weak learners for the problem. Empirical studies show generally good performance when compared to other state-of-the-art methods.

UR - http://www.scopus.com/inward/record.url?scp=38049007534&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-75396-4_6

DO - 10.1007/978-3-540-75396-4_6

M3 - Chapter

AN - SCOPUS:38049007534

SN - 9783540753957

T3 - Studies in Computational Intelligence

SP - 157

EP - 176

BT - Engineering Evolutionary Intelligent Systems

A2 - Abraham, Ajith

A2 - Grosan, Crina

A2 - Pedrycz, Witold

ER -

Enhancing recursive supervised learning using clustering and combinatorial optimization (RSL-CC)

Abstract

Publication series

Access to Document

Other files and links

Cite this