Skip to main navigation Skip to search Skip to main content

Text-speech collaboration LLM embedding low-rank adaptation, activation-aware weight quantization and knowledge distillation

  • Xi'an Jiaotong-Liverpool University
  • Apon Medical

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Large Language Model, or LLM, are enabling more and more real world applications. To fuse a rich variety of information sources into LLM provides powerful solutions. This paper designs a text and speech fusion framework on LLM. It embeds LoRA fine-tuning, AWQ quantization, and knowledge distillation to improve the output models' performance. The proposed framework is evaluated on emotional applications and medical applications and compared with comprehensive large language models.

Original languageEnglish
Title of host publicationProceedings - 2025 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages231-239
Number of pages9
ISBN (Electronic)9798331559762
DOIs
Publication statusPublished - 2025
Event17th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2025 - Taiyuan, China
Duration: 18 Oct 202519 Oct 2025

Conference

Conference17th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2025
Country/TerritoryChina
CityTaiyuan
Period18/10/2519/10/25

Cite this