Learning from Few Samples with Memory Network

Shufei Zhang; Kaizhu Huang; Rui Zhang; Amir Hussain

doi:10.1007/s12559-017-9507-z

Learning from Few Samples with Memory Network

Shufei Zhang, Kaizhu Huang^*, Rui Zhang, Amir Hussain

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

17 Citations (Scopus)

Abstract

Neural networks (NN) have achieved great successes in pattern recognition and machine learning. However, the success of a NN usually relies on the provision of a sufficiently large number of data samples as training data. When fed with a limited data set, a NN’s performance may be degraded significantly. In this paper, a novel NN structure is proposed called a memory network. It is inspired by the cognitive mechanism of human beings, which can learn effectively, even from limited data. Taking advantage of the memory from previous samples, the new model achieves a remarkable improvement in performance when trained using limited data. The memory network is demonstrated here using the multi-layer perceptron (MLP) as a base model. However, it would be straightforward to extend the idea to other neural networks, e.g., convolutional neural networks (CNN). In this paper, the memory network structure is detailed, the training algorithm is presented, and a series of experiments are conducted to validate the proposed framework. Experimental results show that the proposed model outperforms traditional MLP-based models as well as other competitive algorithms in response to two real benchmark data sets.

Original language	English
Pages (from-to)	15-22
Number of pages	8
Journal	Cognitive Computation
Volume	10
Issue number	1
DOIs	https://doi.org/10.1007/s12559-017-9507-z
Publication status	Published - 1 Feb 2018

Keywords

Memory
Multi-layer perceptron
Neural network
Prior knowledge
Recognition

Access to Document

10.1007/s12559-017-9507-z

Cite this

@article{3d6bd0971de448e3844bba6b006bf83f,

title = "Learning from Few Samples with Memory Network",

abstract = "Neural networks (NN) have achieved great successes in pattern recognition and machine learning. However, the success of a NN usually relies on the provision of a sufficiently large number of data samples as training data. When fed with a limited data set, a NN{\textquoteright}s performance may be degraded significantly. In this paper, a novel NN structure is proposed called a memory network. It is inspired by the cognitive mechanism of human beings, which can learn effectively, even from limited data. Taking advantage of the memory from previous samples, the new model achieves a remarkable improvement in performance when trained using limited data. The memory network is demonstrated here using the multi-layer perceptron (MLP) as a base model. However, it would be straightforward to extend the idea to other neural networks, e.g., convolutional neural networks (CNN). In this paper, the memory network structure is detailed, the training algorithm is presented, and a series of experiments are conducted to validate the proposed framework. Experimental results show that the proposed model outperforms traditional MLP-based models as well as other competitive algorithms in response to two real benchmark data sets.",

keywords = "Memory, Multi-layer perceptron, Neural network, Prior knowledge, Recognition",

author = "Shufei Zhang and Kaizhu Huang and Rui Zhang and Amir Hussain",

note = "Publisher Copyright: {\textcopyright} 2017, Springer Science+Business Media, LLC.",

year = "2018",

month = feb,

day = "1",

doi = "10.1007/s12559-017-9507-z",

language = "English",

volume = "10",

pages = "15--22",

journal = "Cognitive Computation",

issn = "1866-9956",

number = "1",

}

TY - JOUR

T1 - Learning from Few Samples with Memory Network

AU - Zhang, Shufei

AU - Huang, Kaizhu

AU - Zhang, Rui

AU - Hussain, Amir

PY - 2018/2/1

Y1 - 2018/2/1

N2 - Neural networks (NN) have achieved great successes in pattern recognition and machine learning. However, the success of a NN usually relies on the provision of a sufficiently large number of data samples as training data. When fed with a limited data set, a NN’s performance may be degraded significantly. In this paper, a novel NN structure is proposed called a memory network. It is inspired by the cognitive mechanism of human beings, which can learn effectively, even from limited data. Taking advantage of the memory from previous samples, the new model achieves a remarkable improvement in performance when trained using limited data. The memory network is demonstrated here using the multi-layer perceptron (MLP) as a base model. However, it would be straightforward to extend the idea to other neural networks, e.g., convolutional neural networks (CNN). In this paper, the memory network structure is detailed, the training algorithm is presented, and a series of experiments are conducted to validate the proposed framework. Experimental results show that the proposed model outperforms traditional MLP-based models as well as other competitive algorithms in response to two real benchmark data sets.

AB - Neural networks (NN) have achieved great successes in pattern recognition and machine learning. However, the success of a NN usually relies on the provision of a sufficiently large number of data samples as training data. When fed with a limited data set, a NN’s performance may be degraded significantly. In this paper, a novel NN structure is proposed called a memory network. It is inspired by the cognitive mechanism of human beings, which can learn effectively, even from limited data. Taking advantage of the memory from previous samples, the new model achieves a remarkable improvement in performance when trained using limited data. The memory network is demonstrated here using the multi-layer perceptron (MLP) as a base model. However, it would be straightforward to extend the idea to other neural networks, e.g., convolutional neural networks (CNN). In this paper, the memory network structure is detailed, the training algorithm is presented, and a series of experiments are conducted to validate the proposed framework. Experimental results show that the proposed model outperforms traditional MLP-based models as well as other competitive algorithms in response to two real benchmark data sets.

KW - Memory

KW - Multi-layer perceptron

KW - Neural network

KW - Prior knowledge

KW - Recognition

UR - http://www.scopus.com/inward/record.url?scp=85032029937&partnerID=8YFLogxK

U2 - 10.1007/s12559-017-9507-z

DO - 10.1007/s12559-017-9507-z

M3 - Article

AN - SCOPUS:85032029937

SN - 1866-9956

VL - 10

SP - 15

EP - 22

JO - Cognitive Computation

JF - Cognitive Computation

IS - 1

ER -

Learning from Few Samples with Memory Network

Abstract

Keywords

Access to Document

Other files and links

Cite this