TY - GEN
T1 - Reinforcement Learning based Underwater Structural Pole Inspection
AU - Tan, Chee Sheng
AU - Mohd-Mokhtar, Rosmiwati
AU - Arshad, Mohd Rizal
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - The most challenging problem in inspection planning is the structural coverage in an environment with obstacles. This paper presents a coverage path planning framework based on reinforcement learning using an autonomous underwater vehicle (AUV). This approach exploits the knowledge from the model and generates an optimal path to move from the initial position to the nearest area of interest (AOI). Then, it starts to perform a sweep of the exterior boundary of a three-dimensional (3D) structure in the workspace, including concerning the complete coverage of the given AOI and avoiding obstacles. In this model, a non-linear action selection strategy is used to provide a meaningful exploration, contributing to more stability in the learning process. A reward function is designed by taking into consideration multiple objectives to satisfy the sub-goal requirements. The simulation result indicates the effectiveness of the approach in planning the inspection path. The AUV behaves as a boustrophedon motion when covering the AOI and can achieve maximum cumulative reward while reaching the learning goal.
AB - The most challenging problem in inspection planning is the structural coverage in an environment with obstacles. This paper presents a coverage path planning framework based on reinforcement learning using an autonomous underwater vehicle (AUV). This approach exploits the knowledge from the model and generates an optimal path to move from the initial position to the nearest area of interest (AOI). Then, it starts to perform a sweep of the exterior boundary of a three-dimensional (3D) structure in the workspace, including concerning the complete coverage of the given AOI and avoiding obstacles. In this model, a non-linear action selection strategy is used to provide a meaningful exploration, contributing to more stability in the learning process. A reward function is designed by taking into consideration multiple objectives to satisfy the sub-goal requirements. The simulation result indicates the effectiveness of the approach in planning the inspection path. The AUV behaves as a boustrophedon motion when covering the AOI and can achieve maximum cumulative reward while reaching the learning goal.
KW - coverage path planning
KW - reinforcement learning
KW - underwater inspection
UR - http://www.scopus.com/inward/record.url?scp=85151931123&partnerID=8YFLogxK
U2 - 10.1109/USYS56283.2022.10072827
DO - 10.1109/USYS56283.2022.10072827
M3 - Conference Proceeding
AN - SCOPUS:85151931123
T3 - 2022 IEEE 9th International Conference on Underwater System Technology: Theory and Applications, USYS 2022
BT - 2022 IEEE 9th International Conference on Underwater System Technology
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th IEEE International Conference on Underwater System Technology: Theory and Applications, USYS 2022
Y2 - 5 December 2022 through 6 December 2022
ER -