Abstract
Human-Object Interaction (HOI) detection is a critical topic in the visual understanding field. With the development of deep learning models, the research of HOI detection has been profoundly reshaped. Deep convolutional neural networks increased the object recognition accuracy of static images and induced a detection-based HOI detection stream. The detection-based models resolve the HOI detection problem from a classification perspective. Another stream of HOI detection methods seeks a deeper understanding of the information shown in images, and they are named HOI understanding methods in this survey paper. HOI understanding methods usually acquire external linguistic data to enable the deep models to learn more about the images. Additionally, some of the HOI understanding methods exploit graph neural networks (GNN) to increase the inference accuracy of the model.
Original language | English |
---|---|
Article number | 126243 |
Journal | Neurocomputing |
Volume | 543 |
DOIs | |
Publication status | Published - 28 Jul 2023 |
Keywords
- Computer vision
- Convolutional neural networks
- Human-object interaction
- Object detection
- Representation learning
- Transformers