Software Review: Empowering Language Education with D-ID Creative Reality Studio’s Multimodal Capabilities

Chenghao Wang*, Xueyun Li

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

D-ID Creative Reality Studio (D-ID) is a platform for creating Artificial Intelligence (AI) presenter (digital human) videos, translating videos, and designing conversational agents. D-ID seamlessly integrates deep-learning face animation technology, large language models (LLMs), natural language processing (NLP), and speech synthesis and recognition (SSR), offering new possibilities for immersive language teaching and learning. As highlighted in the slogan, “Amaze your audience with your art” (D-ID, 2024), both language learners and teachers can utilise D-ID to create personalised multimodal digital learning resources. Aligning with the interaction hypothesis (Long, 1996), a conversational agent can serve as a language partner, offering accurate linguistic input and facilitating conversational practice anytime and anywhere. Additionally, the incorporation of a digital human is consistent with the Cognitive Theory of Multimedia Learning (Mayer & Moreno, 2003), which asserts that dual-channel input—integrating visual and auditory modalities—enhances memory retention and overall learning effectiveness. This software review centres on the web-based version of D-ID and provides a detailed analysis of the main features that can contribute to second language acquisition.
Original languageEnglish
JournalInternational Journal of Computer-Assisted Language Learning and Teaching
Volume15
Issue number1
DOIs
Publication statusPublished - Jan 2025

Fingerprint

Dive into the research topics of 'Software Review: Empowering Language Education with D-ID Creative Reality Studio’s Multimodal Capabilities'. Together they form a unique fingerprint.

Cite this