Joint multi-label attention networks for social text annotation

Hang Dong; Wei Wang; Kaizhu Huang; Frans Coenen

Joint multi-label attention networks for social text annotation

Hang Dong, Wei Wang, Kaizhu Huang, Frans Coenen

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

11 Citations (Scopus)

Abstract

We propose a novel attention network for document annotation with user-generated tags. The network is designed according to the human reading and annotation behaviour. Usually, users try to digest the title and obtain a rough idea about the topic first, and then read the content of the document. Present research shows that the title metadata could largely affect the social annotation. To better utilise this information, we design a framework that separates the title from the content of a document and apply a title-guided attention mechanism over each sentence in the content. We also propose two semantic-based loss regularisers that enforce the output of the network to conform to label semantics, i.e. similarity and subsumption. We analyse each part of the proposed system with two real-world open datasets on publication and question annotation. The integrated approach, Joint Multi-label Attention Network (JMAN), significantly outperformed the Bidirectional Gated Recurrent Unit (Bi-GRU) by around 13%-26% and the Hierarchical Attention Network (HAN) by around 4%-12% on both datasets, with around 10%-30% reduction of training time.

Original language	English
Title of host publication	Long and Short Papers
Publisher	Association for Computational Linguistics (ACL)
Pages	1348-1354
Number of pages	7
ISBN (Electronic)	9781950737130
Publication status	Published - 2019
Event	2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019 - Minneapolis, United States Duration: 2 Jun 2019 → 7 Jun 2019

Publication series

Name	NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference
Volume	1

Conference

Conference	2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019
Country/Territory	United States
City	Minneapolis
Period	2/06/19 → 7/06/19

Cite this

Dong, H., Wang, W., Huang, K., & Coenen, F. (2019). Joint multi-label attention networks for social text annotation. In Long and Short Papers (pp. 1348-1354). (NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference; Vol. 1). Association for Computational Linguistics (ACL).

Dong, Hang ; Wang, Wei ; Huang, Kaizhu et al. / Joint multi-label attention networks for social text annotation. Long and Short Papers. Association for Computational Linguistics (ACL), 2019. pp. 1348-1354 (NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference).

@inproceedings{80655004568b42628a50dcd1e2530ecb,

title = "Joint multi-label attention networks for social text annotation",

abstract = "We propose a novel attention network for document annotation with user-generated tags. The network is designed according to the human reading and annotation behaviour. Usually, users try to digest the title and obtain a rough idea about the topic first, and then read the content of the document. Present research shows that the title metadata could largely affect the social annotation. To better utilise this information, we design a framework that separates the title from the content of a document and apply a title-guided attention mechanism over each sentence in the content. We also propose two semantic-based loss regularisers that enforce the output of the network to conform to label semantics, i.e. similarity and subsumption. We analyse each part of the proposed system with two real-world open datasets on publication and question annotation. The integrated approach, Joint Multi-label Attention Network (JMAN), significantly outperformed the Bidirectional Gated Recurrent Unit (Bi-GRU) by around 13%-26% and the Hierarchical Attention Network (HAN) by around 4%-12% on both datasets, with around 10%-30% reduction of training time.",

author = "Hang Dong and Wei Wang and Kaizhu Huang and Frans Coenen",

note = "Publisher Copyright: {\textcopyright} 2019 Association for Computational Linguistics; 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019 ; Conference date: 02-06-2019 Through 07-06-2019",

year = "2019",

language = "English",

series = "NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference",

publisher = "Association for Computational Linguistics (ACL)",

pages = "1348--1354",

booktitle = "Long and Short Papers",

}

Dong, H, Wang, W, Huang, K & Coenen, F 2019, Joint multi-label attention networks for social text annotation. in Long and Short Papers. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, vol. 1, Association for Computational Linguistics (ACL), pp. 1348-1354, 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019, Minneapolis, United States, 2/06/19.

Joint multi-label attention networks for social text annotation. / Dong, Hang; Wang, Wei; Huang, Kaizhu et al.
Long and Short Papers. Association for Computational Linguistics (ACL), 2019. p. 1348-1354 (NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference; Vol. 1).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Joint multi-label attention networks for social text annotation

AU - Dong, Hang

AU - Wang, Wei

AU - Huang, Kaizhu

AU - Coenen, Frans

PY - 2019

Y1 - 2019

N2 - We propose a novel attention network for document annotation with user-generated tags. The network is designed according to the human reading and annotation behaviour. Usually, users try to digest the title and obtain a rough idea about the topic first, and then read the content of the document. Present research shows that the title metadata could largely affect the social annotation. To better utilise this information, we design a framework that separates the title from the content of a document and apply a title-guided attention mechanism over each sentence in the content. We also propose two semantic-based loss regularisers that enforce the output of the network to conform to label semantics, i.e. similarity and subsumption. We analyse each part of the proposed system with two real-world open datasets on publication and question annotation. The integrated approach, Joint Multi-label Attention Network (JMAN), significantly outperformed the Bidirectional Gated Recurrent Unit (Bi-GRU) by around 13%-26% and the Hierarchical Attention Network (HAN) by around 4%-12% on both datasets, with around 10%-30% reduction of training time.

AB - We propose a novel attention network for document annotation with user-generated tags. The network is designed according to the human reading and annotation behaviour. Usually, users try to digest the title and obtain a rough idea about the topic first, and then read the content of the document. Present research shows that the title metadata could largely affect the social annotation. To better utilise this information, we design a framework that separates the title from the content of a document and apply a title-guided attention mechanism over each sentence in the content. We also propose two semantic-based loss regularisers that enforce the output of the network to conform to label semantics, i.e. similarity and subsumption. We analyse each part of the proposed system with two real-world open datasets on publication and question annotation. The integrated approach, Joint Multi-label Attention Network (JMAN), significantly outperformed the Bidirectional Gated Recurrent Unit (Bi-GRU) by around 13%-26% and the Hierarchical Attention Network (HAN) by around 4%-12% on both datasets, with around 10%-30% reduction of training time.

UR - http://www.scopus.com/inward/record.url?scp=85074639029&partnerID=8YFLogxK

M3 - Conference Proceeding

AN - SCOPUS:85074639029

T3 - NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference

SP - 1348

EP - 1354

BT - Long and Short Papers

PB - Association for Computational Linguistics (ACL)

T2 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019

Y2 - 2 June 2019 through 7 June 2019

ER -

Dong H, Wang W, Huang K, Coenen F. Joint multi-label attention networks for social text annotation. In Long and Short Papers. Association for Computational Linguistics (ACL). 2019. p. 1348-1354. (NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference).

Joint multi-label attention networks for social text annotation

Abstract

Publication series

Conference

Other files and links

Cite this