Memory-augmented Neural Machine Translation

Yang Feng; Shiyue Zhang; Andi Zhang; Dong Wang; Andrew Abel

doi:10.18653/v1/d17-1146

Memory-augmented Neural Machine Translation

Yang Feng, Shiyue Zhang, Andi Zhang, Dong Wang^*, Andrew Abel

^*Corresponding author for this work

Department of Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

41 Citations (Scopus)

Abstract

Neural machine translation (NMT) has achieved notable success in recent times, however it is also widely recognized that this approach has limitations with handling infrequent words and word pairs. This paper presents a novel memory-augmented NMT (M-NMT) architecture, which stores knowledge about how words (usually infrequently encountered ones) should be translated in a memory and then utilizes them to assist the neural model. We use this memory mechanism to combine the knowledge learned from a conventional statistical machine translation system and the rules learned by an NMT system, and also propose a solution for out-of-vocabulary (OOV) words based on this framework. Our experiments on two Chinese-English translation tasks demonstrated that the M-NMT architecture outperformed the NMT baseline by 9.0 and 2.7 BLEU points on the two tasks, respectively. Additionally, we found this architecture resulted in a much more effective OOV treatment compared to competitive methods.

Original language	English
Title of host publication	EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings
Publisher	Association for Computational Linguistics (ACL)
Pages	1390-1399
Number of pages	10
ISBN (Electronic)	9781945626838
DOIs	https://doi.org/10.18653/v1/d17-1146
Publication status	Published - 2017
Event	2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017 - Copenhagen, Denmark Duration: 9 Sept 2017 → 11 Sept 2017

Publication series

Name	EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings

Conference

Conference	2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017
Country/Territory	Denmark
City	Copenhagen
Period	9/09/17 → 11/09/17

Access to Document

10.18653/v1/d17-1146

Cite this

Feng, Y., Zhang, S., Zhang, A., Wang, D., & Abel, A. (2017). Memory-augmented Neural Machine Translation. In EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 1390-1399). (EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d17-1146

@inproceedings{fd66507b1cf143b39bcdde8967864da3,

title = "Memory-augmented Neural Machine Translation",

abstract = "Neural machine translation (NMT) has achieved notable success in recent times, however it is also widely recognized that this approach has limitations with handling infrequent words and word pairs. This paper presents a novel memory-augmented NMT (M-NMT) architecture, which stores knowledge about how words (usually infrequently encountered ones) should be translated in a memory and then utilizes them to assist the neural model. We use this memory mechanism to combine the knowledge learned from a conventional statistical machine translation system and the rules learned by an NMT system, and also propose a solution for out-of-vocabulary (OOV) words based on this framework. Our experiments on two Chinese-English translation tasks demonstrated that the M-NMT architecture outperformed the NMT baseline by 9.0 and 2.7 BLEU points on the two tasks, respectively. Additionally, we found this architecture resulted in a much more effective OOV treatment compared to competitive methods.",

author = "Yang Feng and Shiyue Zhang and Andi Zhang and Dong Wang and Andrew Abel",

note = "Publisher Copyright: {\textcopyright} 2017 Association for Computational Linguistics.; 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017 ; Conference date: 09-09-2017 Through 11-09-2017",

year = "2017",

doi = "10.18653/v1/d17-1146",

language = "English",

series = "EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings",

publisher = "Association for Computational Linguistics (ACL)",

pages = "1390--1399",

booktitle = "EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings",

}

Feng, Y, Zhang, S, Zhang, A, Wang, D & Abel, A 2017, Memory-augmented Neural Machine Translation. in EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings. EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings, Association for Computational Linguistics (ACL), pp. 1390-1399, 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, Copenhagen, Denmark, 9/09/17. https://doi.org/10.18653/v1/d17-1146

Memory-augmented Neural Machine Translation. / Feng, Yang; Zhang, Shiyue; Zhang, Andi et al.
EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings. Association for Computational Linguistics (ACL), 2017. p. 1390-1399 (EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Memory-augmented Neural Machine Translation

AU - Feng, Yang

AU - Zhang, Shiyue

AU - Zhang, Andi

AU - Wang, Dong

AU - Abel, Andrew

PY - 2017

Y1 - 2017

N2 - Neural machine translation (NMT) has achieved notable success in recent times, however it is also widely recognized that this approach has limitations with handling infrequent words and word pairs. This paper presents a novel memory-augmented NMT (M-NMT) architecture, which stores knowledge about how words (usually infrequently encountered ones) should be translated in a memory and then utilizes them to assist the neural model. We use this memory mechanism to combine the knowledge learned from a conventional statistical machine translation system and the rules learned by an NMT system, and also propose a solution for out-of-vocabulary (OOV) words based on this framework. Our experiments on two Chinese-English translation tasks demonstrated that the M-NMT architecture outperformed the NMT baseline by 9.0 and 2.7 BLEU points on the two tasks, respectively. Additionally, we found this architecture resulted in a much more effective OOV treatment compared to competitive methods.

AB - Neural machine translation (NMT) has achieved notable success in recent times, however it is also widely recognized that this approach has limitations with handling infrequent words and word pairs. This paper presents a novel memory-augmented NMT (M-NMT) architecture, which stores knowledge about how words (usually infrequently encountered ones) should be translated in a memory and then utilizes them to assist the neural model. We use this memory mechanism to combine the knowledge learned from a conventional statistical machine translation system and the rules learned by an NMT system, and also propose a solution for out-of-vocabulary (OOV) words based on this framework. Our experiments on two Chinese-English translation tasks demonstrated that the M-NMT architecture outperformed the NMT baseline by 9.0 and 2.7 BLEU points on the two tasks, respectively. Additionally, we found this architecture resulted in a much more effective OOV treatment compared to competitive methods.

UR - http://www.scopus.com/inward/record.url?scp=85073170444&partnerID=8YFLogxK

U2 - 10.18653/v1/d17-1146

DO - 10.18653/v1/d17-1146

M3 - Conference Proceeding

AN - SCOPUS:85073170444

T3 - EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings

SP - 1390

EP - 1399

BT - EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings

PB - Association for Computational Linguistics (ACL)

T2 - 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017

Y2 - 9 September 2017 through 11 September 2017

ER -

Memory-augmented Neural Machine Translation

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this