TY - GEN
T1 - Enhanced Human-Robot Interaction for Robotic Container Unloading Systems Utilizing Digital Cousin and LLM
AU - Fang, Shihui
AU - Yue, Linfeng
AU - Lin, Yuqian
AU - Xu, Ziqi
AU - Lu, Jiawei
AU - Zhang, Quan
AU - Chen, Min
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Existing robotic container unloading systems lack the adaptability and semantic reasoning required for robust performance due to dynamic environments and unstructured cargo arrangements. This paper proposes a novel LLM and Digital Cousins-driven robotic automation system to address these limitations. By leveraging the DeepSeek-V3LLM (large language model), the system integrates contextual understanding, real-time decision-making, and adaptive grasping strategies to handle irregular or occluded objects. A hybrid architecture synergizes LLM-based high-level planning with low-level robotic control, while a synchronized digital cousin framework ensures real-time physical-virtual synchronization. The digital cousin technique constructs semantically consistent virtual scenes from single RGB images using a 3D asset library, enabling cost-effective, automated synthesis while preserving core geometric and functional attributes. Experimental results demonstrate significant improvements in operational efficiency and robustness compared to conventional methods. This work bridges AI-driven cognitive capabilities with industrial robotics, offering a scalable and intelligent solution for autonomous logistics.
AB - Existing robotic container unloading systems lack the adaptability and semantic reasoning required for robust performance due to dynamic environments and unstructured cargo arrangements. This paper proposes a novel LLM and Digital Cousins-driven robotic automation system to address these limitations. By leveraging the DeepSeek-V3LLM (large language model), the system integrates contextual understanding, real-time decision-making, and adaptive grasping strategies to handle irregular or occluded objects. A hybrid architecture synergizes LLM-based high-level planning with low-level robotic control, while a synchronized digital cousin framework ensures real-time physical-virtual synchronization. The digital cousin technique constructs semantically consistent virtual scenes from single RGB images using a 3D asset library, enabling cost-effective, automated synthesis while preserving core geometric and functional attributes. Experimental results demonstrate significant improvements in operational efficiency and robustness compared to conventional methods. This work bridges AI-driven cognitive capabilities with industrial robotics, offering a scalable and intelligent solution for autonomous logistics.
KW - Container unloading
KW - Digital cousin
KW - Human-robot interaction
KW - Large language models
KW - Robotic automation
UR - https://www.scopus.com/pages/publications/105015982160
U2 - 10.1109/CACRE66141.2025.11119567
DO - 10.1109/CACRE66141.2025.11119567
M3 - Conference Proceeding
AN - SCOPUS:105015982160
T3 - Proceedings - 2025 10th International Conference on Automation, Control and Robotics Engineering, CACRE 2025
SP - 91
EP - 95
BT - Proceedings - 2025 10th International Conference on Automation, Control and Robotics Engineering, CACRE 2025
A2 - Zhang, Fuming
A2 - Chai, Li
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th International Conference on Automation, Control and Robotics Engineering, CACRE 2025
Y2 - 16 July 2025 through 19 July 2025
ER -