Abstract
Large Language Model, or LLM, are enabling more and more real world applications. To fuse a rich variety of information sources into LLM provides powerful solutions. This paper designs a text and speech fusion framework on LLM. It embeds LoRA fine-tuning, AWQ quantization, and knowledge distillation to improve the output models' performance. The proposed framework is evaluated on emotional applications and medical applications and compared with comprehensive large language models.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 231-239 |
| Number of pages | 9 |
| ISBN (Electronic) | 9798331559762 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 17th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2025 - Taiyuan, China Duration: 18 Oct 2025 → 19 Oct 2025 |
Conference
| Conference | 17th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2025 |
|---|---|
| Country/Territory | China |
| City | Taiyuan |
| Period | 18/10/25 → 19/10/25 |
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver