Food Detection and Recognition with Deep Learning: A Comparative Study

Siao Wah Tan; Chin Poo Lee; Kian Ming Lim; Jit Yan Lim

doi:10.1109/ICoICT58202.2023.10262523

Food Detection and Recognition with Deep Learning: A Comparative Study

Siao Wah Tan^*, Chin Poo Lee, Kian Ming Lim, Jit Yan Lim

^*Corresponding author for this work

Multimedia University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

4 Citations (Scopus)

Abstract

Food detection and recognition involves the use of computer vision and machine learning techniques to identify and classify food items in images or videos. It has numerous applications, such as dietary tracking, nutrition analysis, and inventory management. This research paper presents a comparative study of six deep learning models (SSD (VGG-16), Faster-RCNN (Resnet-50), Faster-RCNN (Mobilenet-V3), Faster-RCNN (Mobilenet-V3-320), RetinaNet (Resnet-50), and YOLOv5) for food detection and recognition. The models' performance is evaluated using three publicly available datasets: School Lunch Dataset, UEC FOOD 100, and UEC FOOD 256. Notably, Faster R-CNN (Mobilenet-V3) achieved mAP of 0.931 in the School Lunch Dataset, while YOLOv5 achieved 0.774 and 0.701 mAP in the UEC FOOD 100 and UEC FOOD 256 Datasets, respectively. YOLOv5 demonstrates comparable results to Faster R-CNN but with a smaller input image size and a larger batch size in food detection.

Original language	English
Title of host publication	2023 11th International Conference on Information and Communication Technology, ICoICT 2023
Pages	283-288
Number of pages	6
ISBN (Electronic)	9798350321982
DOIs	https://doi.org/10.1109/ICoICT58202.2023.10262523
Publication status	Published - 2023
Externally published	Yes
Event	11th International Conference on Information and Communication Technology, ICoICT 2023 - Melaka, Malaysia Duration: 23 Aug 2023 → 24 Aug 2023

Publication series

Name	2023 11th International Conference on Information and Communication Technology, ICoICT 2023
Volume	2023-August

Conference

Conference	11th International Conference on Information and Communication Technology, ICoICT 2023
Country/Territory	Malaysia
City	Melaka
Period	23/08/23 → 24/08/23

Keywords

Faster Region-Based Convolutional Neural Networks (Faster R-CNN)
Food detection
Object detection
YOLOv5

Access to Document

10.1109/ICoICT58202.2023.10262523

Cite this

Tan, S. W., Lee, C. P., Lim, K. M., & Lim, J. Y. (2023). Food Detection and Recognition with Deep Learning: A Comparative Study. In 2023 11th International Conference on Information and Communication Technology, ICoICT 2023 (pp. 283-288). (2023 11th International Conference on Information and Communication Technology, ICoICT 2023; Vol. 2023-August). https://doi.org/10.1109/ICoICT58202.2023.10262523

@inproceedings{d5488d033d0b492ebbb41d20c12cf84e,

title = "Food Detection and Recognition with Deep Learning: A Comparative Study",

abstract = "Food detection and recognition involves the use of computer vision and machine learning techniques to identify and classify food items in images or videos. It has numerous applications, such as dietary tracking, nutrition analysis, and inventory management. This research paper presents a comparative study of six deep learning models (SSD (VGG-16), Faster-RCNN (Resnet-50), Faster-RCNN (Mobilenet-V3), Faster-RCNN (Mobilenet-V3-320), RetinaNet (Resnet-50), and YOLOv5) for food detection and recognition. The models' performance is evaluated using three publicly available datasets: School Lunch Dataset, UEC FOOD 100, and UEC FOOD 256. Notably, Faster R-CNN (Mobilenet-V3) achieved mAP of 0.931 in the School Lunch Dataset, while YOLOv5 achieved 0.774 and 0.701 mAP in the UEC FOOD 100 and UEC FOOD 256 Datasets, respectively. YOLOv5 demonstrates comparable results to Faster R-CNN but with a smaller input image size and a larger batch size in food detection.",

keywords = "Faster Region-Based Convolutional Neural Networks (Faster R-CNN), Food detection, Object detection, YOLOv5",

author = "Tan, {Siao Wah} and Lee, {Chin Poo} and Lim, {Kian Ming} and Lim, {Jit Yan}",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 11th International Conference on Information and Communication Technology, ICoICT 2023 ; Conference date: 23-08-2023 Through 24-08-2023",

year = "2023",

doi = "10.1109/ICoICT58202.2023.10262523",

language = "English",

series = "2023 11th International Conference on Information and Communication Technology, ICoICT 2023",

pages = "283--288",

booktitle = "2023 11th International Conference on Information and Communication Technology, ICoICT 2023",

}

Tan, SW, Lee, CP, Lim, KM & Lim, JY 2023, Food Detection and Recognition with Deep Learning: A Comparative Study. in 2023 11th International Conference on Information and Communication Technology, ICoICT 2023. 2023 11th International Conference on Information and Communication Technology, ICoICT 2023, vol. 2023-August, pp. 283-288, 11th International Conference on Information and Communication Technology, ICoICT 2023, Melaka, Malaysia, 23/08/23. https://doi.org/10.1109/ICoICT58202.2023.10262523

Food Detection and Recognition with Deep Learning: A Comparative Study. / Tan, Siao Wah; Lee, Chin Poo; Lim, Kian Ming et al.
2023 11th International Conference on Information and Communication Technology, ICoICT 2023. 2023. p. 283-288 (2023 11th International Conference on Information and Communication Technology, ICoICT 2023; Vol. 2023-August).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Food Detection and Recognition with Deep Learning: A Comparative Study

AU - Tan, Siao Wah

AU - Lee, Chin Poo

AU - Lim, Kian Ming

AU - Lim, Jit Yan

PY - 2023

Y1 - 2023

N2 - Food detection and recognition involves the use of computer vision and machine learning techniques to identify and classify food items in images or videos. It has numerous applications, such as dietary tracking, nutrition analysis, and inventory management. This research paper presents a comparative study of six deep learning models (SSD (VGG-16), Faster-RCNN (Resnet-50), Faster-RCNN (Mobilenet-V3), Faster-RCNN (Mobilenet-V3-320), RetinaNet (Resnet-50), and YOLOv5) for food detection and recognition. The models' performance is evaluated using three publicly available datasets: School Lunch Dataset, UEC FOOD 100, and UEC FOOD 256. Notably, Faster R-CNN (Mobilenet-V3) achieved mAP of 0.931 in the School Lunch Dataset, while YOLOv5 achieved 0.774 and 0.701 mAP in the UEC FOOD 100 and UEC FOOD 256 Datasets, respectively. YOLOv5 demonstrates comparable results to Faster R-CNN but with a smaller input image size and a larger batch size in food detection.

AB - Food detection and recognition involves the use of computer vision and machine learning techniques to identify and classify food items in images or videos. It has numerous applications, such as dietary tracking, nutrition analysis, and inventory management. This research paper presents a comparative study of six deep learning models (SSD (VGG-16), Faster-RCNN (Resnet-50), Faster-RCNN (Mobilenet-V3), Faster-RCNN (Mobilenet-V3-320), RetinaNet (Resnet-50), and YOLOv5) for food detection and recognition. The models' performance is evaluated using three publicly available datasets: School Lunch Dataset, UEC FOOD 100, and UEC FOOD 256. Notably, Faster R-CNN (Mobilenet-V3) achieved mAP of 0.931 in the School Lunch Dataset, while YOLOv5 achieved 0.774 and 0.701 mAP in the UEC FOOD 100 and UEC FOOD 256 Datasets, respectively. YOLOv5 demonstrates comparable results to Faster R-CNN but with a smaller input image size and a larger batch size in food detection.

KW - Faster Region-Based Convolutional Neural Networks (Faster R-CNN)

KW - Food detection

KW - Object detection

KW - YOLOv5

UR - http://www.scopus.com/inward/record.url?scp=85174394864&partnerID=8YFLogxK

U2 - 10.1109/ICoICT58202.2023.10262523

DO - 10.1109/ICoICT58202.2023.10262523

M3 - Conference Proceeding

AN - SCOPUS:85174394864

T3 - 2023 11th International Conference on Information and Communication Technology, ICoICT 2023

SP - 283

EP - 288

BT - 2023 11th International Conference on Information and Communication Technology, ICoICT 2023

T2 - 11th International Conference on Information and Communication Technology, ICoICT 2023

Y2 - 23 August 2023 through 24 August 2023

ER -

Food Detection and Recognition with Deep Learning: A Comparative Study

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this