Toward Performing Image Classification and Object Detection With Convolutional Neural Networks in Autonomous Driving Systems: A Survey

Turay, Tolga; Vladimirova, Tanya

doi:10.1109/access.2022.3147495

Toward Performing Image Classification and Object Detection With Convolutional Neural Networks in Autonomous Driving Systems: A Survey

Atıf İçin Kopyala

Turay T., Vladimirova T.

IEEE ACCESS, cilt.10, ss.14076-14119, 2022 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 10
Basım Tarihi: 2022
Doi Numarası: 10.1109/access.2022.3147495
Dergi Adı: IEEE ACCESS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), Scopus, Compendex, INSPEC, Directory of Open Access Journals
Sayfa Sayıları: ss.14076-14119
Anahtar Kelimeler: Convolutional neural networks, Image classification, Object detection, Real-time systems, Convolution, Task analysis, Laser radar, Autonomous vehicles, computer vision, convolution, convolutional neural networks, embedded systems, object detection, image classification, OF-THE-ART, QUASI-NEWTON METHODS, SIMULTANEOUS LOCALIZATION, OPTIMIZATION METHODS, RECOGNITION, VEHICLES, INFORMATION, ALGORITHMS, NAVIGATION, DISTANCE
Atatürk Üniversitesi Adresli: Hayır

Özet

Nowadays Convolutional Neural Networks (CNNs) are being employed in a wide range of industrial technologies for a variety of sectors, such as medical, automotive, aviation, agriculture, space, etc. This paper reviews the state-of-the-art in both the field of CNNs for image classification and object detection and Autonomous Driving Systems (ADSs) in a synergetic way. Layer-based details of CNNs along with parameter and floating-point operation number calculations are outlined. Using an evolutionary approach, the majority of the outstanding image classification CNNs, published in the open literature, is introduced with a focus on their accuracy performance, parameter number, model size, and inference speed, highlighting the progressive developments in convolutional operations. Results of a novel investigation of the convolution types and operations commonly used in CNNs are presented, including a timing analysis aimed at assessing their impact on CNN performance. This extensive experimental study provides new insight into the behaviour of each convolution type in terms of training time, inference time, and layer level decomposition. Building blocks for CNN-based object detection are also discussed, such as backbone networks and baseline types, and then representative state-of-the-art designs are outlined. Experimental results from the literature are summarised for each of the reviewed models. This is followed by an overview of recent ADSs related works and current industry activities, aiming to bridge academic research and industry practice on CNNs and ADSs. Design approaches targeted at solving problems of automakers in achieving real-time implementations are also proposed based on a discussion of design constraints, human vs. machine evaluations and trade-off analysis of accuracy vs. size. Current technologies, promising directions, and expectations from the literature on ADSs are introduced including a comprehensive trade-off analysis from a human-machine perspective.