|Table of Contents|

Detection of taxi driver's mask wearing based on improved YOLOv3 algorithm(PDF)

长安大学学报(自然科学版)[ISSN:1006-6977/CN:61-1281/TN]

Issue:
2023年6期
Page:
106-115
Research Field:
交通工程
Publishing date:

Info

Title:
Detection of taxi driver's mask wearing based on improved YOLOv3 algorithm
Author(s):
SUN Yong1 WEI Ze-fa1 CUI Hua2 SONG Huan-sheng3
(1. Educational Technology and Network Center, Chang'an University, Xi'an 710064, Shaanxi, China; 2. College of Future Transportation, Chang'an University, Xi'an 710064, Shaanxi, China; 3. School of Information Engineering, Chang'an University, Xi'an 710064, Shaanxi, China)
Keywords:
traffic engineering image processing depth separable convolution multi scale prediction mask detection
PACS:
U491.3
DOI:
10.19721/j.cnki.1671-8879.2023.06.010
Abstract:
In order to monitor the mask wearing of taxi drivers in working state more timely and accurately, a mask wearing detection method for taxi drivers based on YOLOv3 improved algorithm was proposed. Before the beginning of the experiment, using the video data collected from the previous work, a dataset containing 2 478 images of taxi drivers working inside was created. According to the characteristics of the dataset, three improvement strategies were adopted. First of all, the ordinary convolution in the main network was replaced by the deep separable convolution, the model compression and the increase of network structure depth were realized while reducing the number of model parameters. Then, in order to ensure the fusion effect of mask edge information in the multi-scale prediction process, the original three feature images was reduced to two in the multi-scale fusion. Finally, in order to maintain the number of anchor frames in the feature image fusion process, K-means algorithm was used to recalculate 8 initial anchor frame values, and four were allocated on each fusion feature image, through the above improvements, the algorithm can better adapt to self-built datasets. The results show that the accuracy of the improved YOLOV3 algorithm can be improved to 96.2%, and the model is compressed to 32 M. The processing speed is 43 frames per second in NVIDIA 1080Ti environment, which meets the real-time requirements. Therefore, it can be seen that the improved algorithm performs better and can be effectively used for mask wearing detection of taxi drivers.2 tabs, 10 figs, 23 refs.

References:

[1] LIU Y,GAYLE A A,WILDER-SMITH A,et al.The reproductive number of COVID-19 is higher compared to SARS coronavirus[J].Journal of Travel Medicine,2020,27(2):1-4.
[2]刘 全,翟建伟,章宗长,等.深度强化学习综述[J].计算机学报,2018,41(1)1-27.
LIU Quan,ZHAI Jian-wei,ZHANG Zong-chang,et al.A survey on deep reinforcement learning[J].Chinese Journal of Computers,2018,41(1)1-27.
[3]CHOKKADI S,INDIA N.A study on various state of the art of the art face recognition system using deep learning techniques[J].International Journal of Advanced Trends in Computer Science and Engineering,2019:1590-1600.
[4]李明熹,林正奎,曲 毅.计算机视觉下的车辆目标检测算法综述[J].计算机工程与应用,2019,55(24)20-28.
LI Ming-xi,LIN Zheng-kui,QU Yi.Survey of vehicle object detection algorithm in computer vision[J].Computer Engineering and Applications,2019,55(24)20-28.
[5]SUN Y,WANG X,TANG X.Deep convolutional network cascade for facial point detection[C]//IEEE.2013 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).New York:IEEE,2013:3476-3483.
[6]ZHANG K,ZHANG Z,LI Z,et al.Joint face detection and alignment using multitask cascaded convolutional networks[J].IEEE Signal Processing Letters,2016,23(10):1499-1503.
[7]WANG C Y,LIAO H Y,WU Y H,et al.CSPNet:A new backbone that can enhance learning capability of CNN[C]//IEEE.Proceedings of 2020 IEEE Conference on Computer Vision and Pattern Recognition Workshops(CVPRW).New York:IEEE,2020:1571-1580.
[8]LIU S,QI L,QIN H F,et al.Path aggregation network for instance segmentation[C]//IEEE.Proceedings of 2018 IEEE Conference on Computer Vision and Pattern Recognition.New York:IEEE,2018:8759-8768.
[9]WANG Y H,DING H W,LI B,et al.Mask wearing detection algorithm based on improved YOLOv3 in complex scenes[J].Computer Engineering,2020,46(11):12-22.
[10]牛作东,覃 涛,李捍东,等.改进RetinaFace的自然场景口罩佩戴检测算法[J].计算机工程与应用,2020,56(12):1-7.
NIU Zuo-dong,QIN Tao,LI Han-dong,et al.Improved algorithm of RetinaFace for natural scene mask wear detection[J].Computer Engineering and Applications,2020,56(12):1-7.
[11]曹城硕,袁 杰.基于YOLO-Mask算法的口罩佩戴检测方法[J].激光与光电子学进展,2021,58(8):203-210.
CAO Cheng-shuo,YUAN Jie.Mask-wearing detection method based on YOLO-Mask[J].Laser & Optoelectronics Progress,2021,58(8):203-210.
[12]邹佰翰,秦亚亭,苑晓兵,等.基于轻量级CNN的口罩人脸检测方法现状研究[J].软件,2020,41(8):186-188.
ZOU Bai-han,QIN Ya-ting,YUAN Xiao-bing,et al.Research on mask face detection based on lightweight CNN[J].Computer Engineering & Software,2020,41(8):186-188.
[13]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//IEEE.Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).New York:IEEE,2014:580-587.
[14]GIRSHICK R.Fast R-CNN[C]//IEEE.Proceedings of 2015 IEEE International Conference on Computer Vision(ICCV).New York:IEEE,2016:1440-1448.
[15]REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:Towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[16]REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:Unified,real-time object detection[C]//IEEE.Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).New York:IEEE,2016:779-788.
[17]REDMON J,FARHADI A.YOLO9000:Better,faster,stronger[C]//IEEE.Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).New York:IEEE,2017:6517-6525.
[18]IOFFE S,SZEGEDY C.Batch normalization:Accelerating deep network training by reducing internal covariate shift[C]//ACM.Proceedings of the 32nd International Conference on International Conference on Machine Learning.New York:ACM,2015:448-456.
[19]REDMON J,FARHADI A.YOLOv3:An incremental improvement[J].ArXiv:1804.02767.
[20]HOWARD A G,ZHU M,CHEN B,et al.MobileNets:Efficient convolutional neural networks for mobile vision applications[J].ArXiv:1704.04861.
[21]EVERINGHAM M,VAN GOOL L,WILLIAMS C K I,et al.The pascal visual object classes(VOC)challenge[J].International Journal of Computer Vision,2010,88(2):303-338.
[22]魏泽发.基于深度学习的出租车司机违规行为检测[D].西安:长安大学,2019.
WEI Ze-fa.Detection of taxi drivers' illegal behavior based on deep learning[D].Xi'an:Chang'an University,2019.
[23]JIA Y Q,SHELHAMER E,DONAHUE J,et al.Caffe:Convolutional architecture for fast feature embedding[C]//ACM.Proceedings of the 22nd ACM international conference on Multimedia.New York:ACM,2014:675-678.

Memo

Memo:
-
Last Update: 2023-10-30