Fast Depth Estimation of Object via Neural Network Perspective Projection

Yu Han; Yaran Chen; Haoran Li; Mingjun Ma; Dongbin Zhao

doi:10.1109/DDCLS55054.2022.9858358

Fast Depth Estimation of Object via Neural Network Perspective Projection

Yu Han, Yaran Chen^*, Haoran Li, Mingjun Ma, Dongbin Zhao

^*Corresponding author for this work

Department of Intelligent Science

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

2 Citations (Scopus)

Abstract

In autonomous driving and mobile robotic systems, obtaining the depths of objects in real-time is crucial. The current network-based methods usually design complex network to achieve 3D object detection or monocular depth estimation for the whole image, resulting in too slow to be applied to mobile robots. The perspective projection-based method can achieve real-time, which calculates the object depth based on the camera parameters, the object sizes in the world coordinates and in image coordinates. While it relies heavily on the accuracy of object size in images coordinates, and the size is usually obtained with errors through detector network. Combining the perspective projection-based methods and network-based methods, we propose a fast object depth estimation method by designing a neural network to learn perspective projection, called Fast-Depth-NPP: 1) Instead of considering the whole image, we only consider the local depth of the image; 2) Using local image patches as network inputs avoids measurement errors of object size with detector; 3) the use of global information is enhanced by incorporating position encoding. Our method is validated on the mobile robot public dataset Neurons Perception dataset, achieving excellent results and meeting the real-time requirements.

Original language	English
Title of host publication	Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022
Editors	Mingxuan Sun, Zengqiang Chen
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	788-794
Number of pages	7
ISBN (Electronic)	9781665496759
DOIs	https://doi.org/10.1109/DDCLS55054.2022.9858358
Publication status	Published - 2022
Event	11th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2022 - Emeishan, China Duration: 3 Aug 2022 → 5 Aug 2022

Publication series

Name	Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022

Conference

Conference	11th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2022
Country/Territory	China
City	Emeishan
Period	3/08/22 → 5/08/22

Keywords

Convolutional Neural Network
Depth Estimation
Object Detection

Access to Document

10.1109/DDCLS55054.2022.9858358

Cite this

Han, Y., Chen, Y., Li, H., Ma, M., & Zhao, D. (2022). Fast Depth Estimation of Object via Neural Network Perspective Projection. In M. Sun, & Z. Chen (Eds.), Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022 (pp. 788-794). (Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/DDCLS55054.2022.9858358

Han, Yu ; Chen, Yaran ; Li, Haoran et al. / Fast Depth Estimation of Object via Neural Network Perspective Projection. Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022. editor / Mingxuan Sun ; Zengqiang Chen. Institute of Electrical and Electronics Engineers Inc., 2022. pp. 788-794 (Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022).

@inproceedings{2f1fe32aa39b49b18b3a2ba4da203eb1,

title = "Fast Depth Estimation of Object via Neural Network Perspective Projection",

abstract = "In autonomous driving and mobile robotic systems, obtaining the depths of objects in real-time is crucial. The current network-based methods usually design complex network to achieve 3D object detection or monocular depth estimation for the whole image, resulting in too slow to be applied to mobile robots. The perspective projection-based method can achieve real-time, which calculates the object depth based on the camera parameters, the object sizes in the world coordinates and in image coordinates. While it relies heavily on the accuracy of object size in images coordinates, and the size is usually obtained with errors through detector network. Combining the perspective projection-based methods and network-based methods, we propose a fast object depth estimation method by designing a neural network to learn perspective projection, called Fast-Depth-NPP: 1) Instead of considering the whole image, we only consider the local depth of the image; 2) Using local image patches as network inputs avoids measurement errors of object size with detector; 3) the use of global information is enhanced by incorporating position encoding. Our method is validated on the mobile robot public dataset Neurons Perception dataset, achieving excellent results and meeting the real-time requirements.",

keywords = "Convolutional Neural Network, Depth Estimation, Object Detection",

author = "Yu Han and Yaran Chen and Haoran Li and Mingjun Ma and Dongbin Zhao",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 11th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2022 ; Conference date: 03-08-2022 Through 05-08-2022",

year = "2022",

doi = "10.1109/DDCLS55054.2022.9858358",

language = "English",

series = "Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "788--794",

editor = "Mingxuan Sun and Zengqiang Chen",

booktitle = "Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022",

}

Han, Y, Chen, Y, Li, H, Ma, M & Zhao, D 2022, Fast Depth Estimation of Object via Neural Network Perspective Projection. in M Sun & Z Chen (eds), Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022. Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022, Institute of Electrical and Electronics Engineers Inc., pp. 788-794, 11th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2022, Emeishan, China, 3/08/22. https://doi.org/10.1109/DDCLS55054.2022.9858358

Fast Depth Estimation of Object via Neural Network Perspective Projection. / Han, Yu; Chen, Yaran; Li, Haoran et al.
Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022. ed. / Mingxuan Sun; Zengqiang Chen. Institute of Electrical and Electronics Engineers Inc., 2022. p. 788-794 (Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Fast Depth Estimation of Object via Neural Network Perspective Projection

AU - Han, Yu

AU - Chen, Yaran

AU - Li, Haoran

AU - Ma, Mingjun

AU - Zhao, Dongbin

PY - 2022

Y1 - 2022

N2 - In autonomous driving and mobile robotic systems, obtaining the depths of objects in real-time is crucial. The current network-based methods usually design complex network to achieve 3D object detection or monocular depth estimation for the whole image, resulting in too slow to be applied to mobile robots. The perspective projection-based method can achieve real-time, which calculates the object depth based on the camera parameters, the object sizes in the world coordinates and in image coordinates. While it relies heavily on the accuracy of object size in images coordinates, and the size is usually obtained with errors through detector network. Combining the perspective projection-based methods and network-based methods, we propose a fast object depth estimation method by designing a neural network to learn perspective projection, called Fast-Depth-NPP: 1) Instead of considering the whole image, we only consider the local depth of the image; 2) Using local image patches as network inputs avoids measurement errors of object size with detector; 3) the use of global information is enhanced by incorporating position encoding. Our method is validated on the mobile robot public dataset Neurons Perception dataset, achieving excellent results and meeting the real-time requirements.

AB - In autonomous driving and mobile robotic systems, obtaining the depths of objects in real-time is crucial. The current network-based methods usually design complex network to achieve 3D object detection or monocular depth estimation for the whole image, resulting in too slow to be applied to mobile robots. The perspective projection-based method can achieve real-time, which calculates the object depth based on the camera parameters, the object sizes in the world coordinates and in image coordinates. While it relies heavily on the accuracy of object size in images coordinates, and the size is usually obtained with errors through detector network. Combining the perspective projection-based methods and network-based methods, we propose a fast object depth estimation method by designing a neural network to learn perspective projection, called Fast-Depth-NPP: 1) Instead of considering the whole image, we only consider the local depth of the image; 2) Using local image patches as network inputs avoids measurement errors of object size with detector; 3) the use of global information is enhanced by incorporating position encoding. Our method is validated on the mobile robot public dataset Neurons Perception dataset, achieving excellent results and meeting the real-time requirements.

KW - Convolutional Neural Network

KW - Depth Estimation

KW - Object Detection

UR - http://www.scopus.com/inward/record.url?scp=85137750834&partnerID=8YFLogxK

U2 - 10.1109/DDCLS55054.2022.9858358

DO - 10.1109/DDCLS55054.2022.9858358

M3 - Conference Proceeding

AN - SCOPUS:85137750834

T3 - Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022

SP - 788

EP - 794

BT - Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022

A2 - Sun, Mingxuan

A2 - Chen, Zengqiang

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 11th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2022

Y2 - 3 August 2022 through 5 August 2022

ER -

Han Y, Chen Y, Li H, Ma M, Zhao D. Fast Depth Estimation of Object via Neural Network Perspective Projection. In Sun M, Chen Z, editors, Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022. Institute of Electrical and Electronics Engineers Inc. 2022. p. 788-794. (Proceedings of 2022 IEEE 11th Data Driven Control and Learning Systems Conference, DDCLS 2022). doi: 10.1109/DDCLS55054.2022.9858358

Fast Depth Estimation of Object via Neural Network Perspective Projection

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this