Abstract
utilize multimodal data to predict the intensity of three emotional categories. In our work, we discovered that integrating multiple dimensions, modalities, and levels enhances the effectiveness of emotional judgment. In terms of feature extraction, we utilize over a dozen types of medium backbone networks, including W2V-MSP, GLM, and FAU, which are representative of audio, text, and video modalities, respectively. Additionally, we utilize the LoRA framework and employ various domain adaptation methods to effectively adapt to the task at hand. Regarding model design, apart from the RNN model in the baseline, we have extensively incorporated our transformer variant and multi-modal fusion model. Finally, we propose a Hyper-parameter Search Strategy (HPSS) for late fusion to further enhance the effectiveness of the fusion model. For the MuSe-MIMIC, our method achieves Pearson's Correlation Coefficient of 0.7753, 0.7647, and 0.6653 for Approval, Disappointment, and Uncertainty, respectively, outperforming the baseline system by a large margin (i.e., 0.5536, 0.5139, and 0.3395) on the test set. The final mean pearson is 0.7351, surpassing all other participants and ranking Top 1.
Original language | English |
---|---|
Title of host publication | MuSe 2023 - Proceedings of the 4th Multimodal Sentiment Analysis Challenge and Workshop |
Subtitle of host publication | Mimicked Emotions, Humour and Personalisation, Co-located with: MM 2023 |
Publisher | Association for Computing Machinery, Inc |
Pages | 19-26 |
Number of pages | 8 |
ISBN (Electronic) | 9798400702709 |
DOIs | |
Publication status | Published - 2 Nov 2023 |
Externally published | Yes |
Event | 4th Multimodal Sentiment Analysis Challenge and Workshop, MuSe 2023, In conjunction with 31st ACM Multimedia 2023 - Ottawa, Canada Duration: 29 Oct 2023 → … |
Conference
Conference | 4th Multimodal Sentiment Analysis Challenge and Workshop, MuSe 2023, In conjunction with 31st ACM Multimedia 2023 |
---|---|
Country/Territory | Canada |
City | Ottawa |
Period | 29/10/23 → … |
Keywords
- hyper-parameter search strategy
- multimodal fusion
- multimodal sentiment analysis