Open Access Open Access  Restricted Access Subscription or Fee Access

Object Detection Using Deep Learning Algorithm

Kilari Veera Swamy, Jayanth Sai Marisetty, Srujan Vare


Object detection is a computer technology related to computer vision and image processing that deals with detecting images of semantic objects of a certain class (for example cars, buildings, etc.) in digital images and videos. To detect objects, there are many algorithms available, in this paper YOLO algorithm is explored for detection of objects on real-time data. You Only Look Once (YOLO) is a state-of-the-art real time object detection system whose architecture is based on Convolutional Neural Networks (CNN) and anchor boxes. In this work, YOLO V3 is considered. When it comes to the implementation, the input image is being divided into grids in which for each cell classification and localization is applied. After the localization, the algorithm draws bounding boxes for the detected objects. As in real time images there might be a probability of having objects present in more than one grid cell hence, we also use Non max suppression to detect the object more accurately. We also import OpenCV, NumPy libraries in PyCharm Environment for the better optimization of the output. In this paper, to detect the objects in an image using this YOLO algorithm is also tested apart from videos.

Full Text:



B. Hou, J. Li, X. Zhang, S. Wang and L. Jiao, “Object Detection and Tracking Based on Convolutional Neural Networks for High-Resolution Optical Remote Sensing Video,” IGARSS 2019-2019 IEEE International Geoscience and Remote Sensing Symposium, 2019, pp. 5433-5436, doi: 10.1109/IGARSS.2019.8898173.

C. Li, Y. Zhang and Y. Qu, “Object detection based on deep learning of small samples,” 2018 Tenth International Conference on Advanced Computational Intelligence (ICACI), 2018, pp. 449-454, doi: 10.1109/ICACI.2018.8377501.

H. Nakahara, H. Yonekawa and S. Sato, “An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA,” 2017 International Conference on Field Programmable Technology (ICFPT), 2017, pp. 168-175, doi: 10.1109/FPT.2017.8280135.

H. Qian, J. Xu and J. Zhou, “Object Detection Using Deep Convolutional Neural Networks,” 2018 Chinese Automation Congress (CAC), 2018, pp. 1151-1156, doi: 10.1109/CAC.2018.8623808.

J. Lee, J. Bang and S. Yang, “Object detection with sliding window in images including multiple similar objects,” 2017 International Conference on Information and Communication Technology Convergence (ICTC), 2017, pp. 803-806, doi: 10.1109/ICTC.2017.8190786.

J. Redmon, S. Divvala, R. Girshick and A. Farhadi, “You Only Look Once: Unified, Real-Time Object Detection,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 779-788, doi: 10.1109/CVPR.2016.91.

K. L. Masita, A. N. Hasan and T. Shongwe, “Deep Learning in Object Detection: a Review,” 2020 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems (icABCD), 2020, pp. 1-11, doi: 10.1109/icABCD49160.2020.9183866.

S. Chen, J. Hong, T. Zhang, J. Li and Y. Guan, “Object Detection Using Deep Learning: Single Shot Detector with a Refined Feature-fusion Structure,” 2019 IEEE International Conference on Real-time Computing and Robotics (RCAR), 2019, pp. 219-224, doi: 10.1109/RCAR47638.2019. 9044027.

S. Hsu, C. Huang and C. Chuang, “Vehicle detection using simplified fast R-CNN,” 2018 International Workshop on Advanced Image Technology (IWAIT), 2018, pp. 1-3, doi: 10.1109/IWAIT.2018.8369767.

X. Zhou, W. Gong, W. Fu and F. Du, “Application of deep learning in object detection,” 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS), 2017, pp. 631-634, doi: 10.1109/ICIS.2017.7960069.

Y. Lee, C. Lee, H. Lee and J. Kim, “Fast Detection of Objects Using a YOLOv3 Network for a Vending Machine,” 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS), 2019, pp. 132-136, doi: 10.1109/AICAS.2019.8771517.

Y. Qian, H. Zheng, D. He, Z. Zhang and Z. Zhang, “R-CNN Object Detection Inference With Deep Learning Accelerator,” 2018 IEEE/CIC International Conference on Communications in China (ICCC Workshops), 2018, pp. 297-302, doi: 10.1109/ICCChinaW.2018.8674519.


  • There are currently no refbacks.

Copyright (c) 2021 Journal of Control & Instrumentation

eISSN: 2229-6972