Skip to main navigation Skip to search Skip to main content

Audio Captioning Based on Transformer and Pre-Trained CNN

  • Kun Chen
  • , Yusong Wu
  • , Ziyue Wang
  • , Xuan Zhang
  • , Fudong Nian
  • , Shengchen Li
  • , Xi Shao
  • Beijing University of Posts and Telecommunications
  • Nanjing University of Posts and Telecommunications
  • Anhui University
  • Tencent

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Original languageEnglish
Title of host publicationProceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020)
Pages21-25
Publication statusPublished - 11 Feb 2020

Cite this