APSIPA Transactions on Signal and Information Processing > Vol 12 > Issue 1

Novel Personal Protective Equipment Detection Technique with Attention-based YOLOv7 and Human Pose Estimation

Krishadawut Olde Monnikhof, Department of Electronics and Telecommunication Engineering, King Mongkut’s University of Technology Thonburi, Thailand, Punyapat Areerob, Department of Electronics and Telecommunication Engineering, King Mongkut’s University of Technology Thonburi, Thailand, Zheng Wu, Panasonic R&D Center, Singapore, Takrit Tanasnitikul, Panasonic R&D Center, Singapore, Wuttipong Kumwilaisak, Department of Electronics and Telecommunication Engineering, King Mongkut’s University of Technology Thonburi, Thailand, wuttipong.kum@kmutt.ac.th
 
Suggested Citation
Krishadawut Olde Monnikhof, Punyapat Areerob, Zheng Wu, Takrit Tanasnitikul and Wuttipong Kumwilaisak (2023), "Novel Personal Protective Equipment Detection Technique with Attention-based YOLOv7 and Human Pose Estimation", APSIPA Transactions on Signal and Information Processing: Vol. 12: No. 1, e29. http://dx.doi.org/10.1561/116.00000119

Publication Date: 26 Jun 2023
© 2023 K. O. Monnikhof, P. Areerob, Z. Wu, T. Tanasnitikul and W. Kumwilaisak
 
Subjects
 
Keywords
PPE detectionattention-based YOLOv7human pose estimationfalse positive removal
 

Share

Open Access

This is published under the terms of CC BY-NC.

Downloaded: 959 times

In this article:
Introduction 
Personal Protective Equipment (PPE) Wearing Detection Method with YOLOv7 and Attention Mechanism 
Experimental Result 
Conclusions 
References 

Abstract

This paper proposes a new PPE detection method based on the attention-based YOLOv7 and human pose estimation. The proposed attention module consists of the concatenation of CBAM (Convolutional Block Attention Module) and the SE (Squeeze-and-Excitation) block. This attention module is placed immediately before the detection layer of the YOLOv7 architecture. CBAM derives spatial and channel attention of extracted features from a YOLOv7 backbone. The attention weights prioritize the relevant features in both the spatial and channel domains to be utilized for PPE detection. The SE block refines the attention weights obtained from CBAM before feeding weighted features to the detection layer. Human pose estimation based on YOLO-pose is employed to remove some false positives of PPE detection. The proposed method detects human body parts and assigns key points to human body parts. The essential reference points are computed from the derived vital points. The detection targets far from the reference points will be regarded as false positives and removed. From experimental results, our proposed PPE detection can increase mAP by up to 8.5% at threshold 0.5, 8.8% at threshold 0.5 to 0.95, and reduce false positive detection by 22% on deployment when compared to the original YOLOv7 model.

DOI:10.1561/116.00000119