Fedmed: A federated learning framework for language modeling

Xing Wu; Zhaowang Liang; Jianjia Wang

doi:10.3390/s20144048

Fedmed: A federated learning framework for language modeling

Xing Wu^*, Zhaowang Liang, Jianjia Wang

^*Corresponding author for this work

Shanghai University

Research output: Contribution to journal › Article › peer-review

50 Citations (Scopus)

Abstract

Federated learning (FL) is a privacy-preserving technique for training a vast amount of decentralized data and making inferences on mobile devices. As a typical language modeling problem, mobile keyboard prediction aims at suggesting a probable next word or phrase and facilitating the human-machine interaction in a virtual keyboard of the smartphone or laptop. Mobile keyboard prediction with FL hopes to satisfy the growing demand that high-level data privacy be preserved in artificial intelligence applications even with the distributed models training. However, there are two major problems in the federated optimization for the prediction: (1) aggregating model parameters on the server-side and (2) reducing communication costs caused by model weights collection. To address the above issues, traditional FL methods simply use averaging aggregation or ignore communication costs. We propose a novel Federated Mediation (FedMed) framework with the adaptive aggregation, mediation incentive scheme, and topK strategy to address the model aggregation and communication costs. The performance is evaluated in terms of perplexity and communication rounds. Experiments are conducted on three datasets (i.e., Penn Treebank, WikiText-2, and Yelp) and the results demonstrate that our FedMed framework achieves robust performance and outperforms baseline approaches.

Original language	English
Article number	4048
Pages (from-to)	1-17
Number of pages	17
Journal	Sensors (Switzerland)
Volume	20
Issue number	14
DOIs	https://doi.org/10.3390/s20144048
Publication status	Published - 2 Jul 2020
Externally published	Yes

Keywords

Communication efficiency
Federated learning
Language modeling
TopK ranking

Access to Document

10.3390/s20144048

Cite this

@article{42b15c758190467d80714a392cd45412,

title = "Fedmed: A federated learning framework for language modeling",

abstract = "Federated learning (FL) is a privacy-preserving technique for training a vast amount of decentralized data and making inferences on mobile devices. As a typical language modeling problem, mobile keyboard prediction aims at suggesting a probable next word or phrase and facilitating the human-machine interaction in a virtual keyboard of the smartphone or laptop. Mobile keyboard prediction with FL hopes to satisfy the growing demand that high-level data privacy be preserved in artificial intelligence applications even with the distributed models training. However, there are two major problems in the federated optimization for the prediction: (1) aggregating model parameters on the server-side and (2) reducing communication costs caused by model weights collection. To address the above issues, traditional FL methods simply use averaging aggregation or ignore communication costs. We propose a novel Federated Mediation (FedMed) framework with the adaptive aggregation, mediation incentive scheme, and topK strategy to address the model aggregation and communication costs. The performance is evaluated in terms of perplexity and communication rounds. Experiments are conducted on three datasets (i.e., Penn Treebank, WikiText-2, and Yelp) and the results demonstrate that our FedMed framework achieves robust performance and outperforms baseline approaches.",

keywords = "Communication efficiency, Federated learning, Language modeling, TopK ranking",

author = "Xing Wu and Zhaowang Liang and Jianjia Wang",

note = "Publisher Copyright: {\textcopyright} 2020 by the authors. Licensee MDPI, Basel, Switzerland.",

year = "2020",

month = jul,

day = "2",

doi = "10.3390/s20144048",

language = "English",

volume = "20",

pages = "1--17",

journal = "Sensors (Switzerland)",

issn = "1424-8220",

publisher = "MDPI (Basel, Switzerland) ",

number = "14",

}

TY - JOUR

T1 - Fedmed

T2 - A federated learning framework for language modeling

AU - Wu, Xing

AU - Liang, Zhaowang

AU - Wang, Jianjia

PY - 2020/7/2

Y1 - 2020/7/2

N2 - Federated learning (FL) is a privacy-preserving technique for training a vast amount of decentralized data and making inferences on mobile devices. As a typical language modeling problem, mobile keyboard prediction aims at suggesting a probable next word or phrase and facilitating the human-machine interaction in a virtual keyboard of the smartphone or laptop. Mobile keyboard prediction with FL hopes to satisfy the growing demand that high-level data privacy be preserved in artificial intelligence applications even with the distributed models training. However, there are two major problems in the federated optimization for the prediction: (1) aggregating model parameters on the server-side and (2) reducing communication costs caused by model weights collection. To address the above issues, traditional FL methods simply use averaging aggregation or ignore communication costs. We propose a novel Federated Mediation (FedMed) framework with the adaptive aggregation, mediation incentive scheme, and topK strategy to address the model aggregation and communication costs. The performance is evaluated in terms of perplexity and communication rounds. Experiments are conducted on three datasets (i.e., Penn Treebank, WikiText-2, and Yelp) and the results demonstrate that our FedMed framework achieves robust performance and outperforms baseline approaches.

AB - Federated learning (FL) is a privacy-preserving technique for training a vast amount of decentralized data and making inferences on mobile devices. As a typical language modeling problem, mobile keyboard prediction aims at suggesting a probable next word or phrase and facilitating the human-machine interaction in a virtual keyboard of the smartphone or laptop. Mobile keyboard prediction with FL hopes to satisfy the growing demand that high-level data privacy be preserved in artificial intelligence applications even with the distributed models training. However, there are two major problems in the federated optimization for the prediction: (1) aggregating model parameters on the server-side and (2) reducing communication costs caused by model weights collection. To address the above issues, traditional FL methods simply use averaging aggregation or ignore communication costs. We propose a novel Federated Mediation (FedMed) framework with the adaptive aggregation, mediation incentive scheme, and topK strategy to address the model aggregation and communication costs. The performance is evaluated in terms of perplexity and communication rounds. Experiments are conducted on three datasets (i.e., Penn Treebank, WikiText-2, and Yelp) and the results demonstrate that our FedMed framework achieves robust performance and outperforms baseline approaches.

KW - Communication efficiency

KW - Federated learning

KW - Language modeling

KW - TopK ranking

UR - http://www.scopus.com/inward/record.url?scp=85088243565&partnerID=8YFLogxK

U2 - 10.3390/s20144048

DO - 10.3390/s20144048

M3 - Article

C2 - 32708152

AN - SCOPUS:85088243565

SN - 1424-8220

VL - 20

SP - 1

EP - 17

JO - Sensors (Switzerland)

JF - Sensors (Switzerland)

IS - 14

M1 - 4048

ER -

Fedmed: A federated learning framework for language modeling

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this