Comparative Analysis and Testing of Deep Learning Neural Network Models for Road Sign Recognition

Olga A. Kovaleva; Ковалева Ольга Александровна; Mikhail I. Deev; Деев Михаил Игоревич; Sergey V. Kovalev; Ковалев Сергей Владимирович; Ilya A. Zabrodsky; Забродский Илья Алексеевич

doi:10.25686/2306-2819.2024.1.60

Comparative Analysis and Testing of Deep Learning Neural Network Models for Road Sign Recognition

Authors: Kovaleva O.A.¹, Deev M.I.¹, Kovalev S.V.¹, Zabrodsky I.A.¹
Affiliations:
1. Tambov State University named after G.R. Derzhavin
Issue: No 1 (2024)
Pages: 60-69
Section: Computer engineering and informatics
URL: https://journal-vniispk.ru/2306-2819/article/view/276000
DOI: https://doi.org/10.25686/2306-2819.2024.1.60
EDN: https://elibrary.ru/OWLGIP
ID: 276000

Cite item

Abstract

Introduction. The main problem of neural network models in the form of embedded systems is the limited computing resources, which does not allow obtaining the output result in the shortest possible time. Minimizing the time spent on model training also remains a priority. Therefore, when comparing the accuracy of output data and the learning speed of neural network models, the main attention should be paid to different image sizes. The goal of the work is to identify a model that gives the most accurate results after training, has high performance and allows you to produce the most extensive output data with minimal input. The novelty of the study lies in the fact that it was carried out using a comparative analysis of five neural network models on one framework with input data sizes of 32 × 32,48 × 48 and 64 × 64, the combination of which has not been studied.

Description of research objects. Five of the most common models were selected as objects of comparative analysis: DenseNet, GoogLeNet, LeNet, MobileNet and ResNet50. The study uses the available TSRD dataset, which includes 6164 images of road signs containing 58 categories. The images were combined into two sub-databases, one of which is training, and the second is test. There are 4170 image files in the training database, and 1994 in the testing database. Each image contains an annotation with information about the four coordinates of the sign and its category. Their size varies from 26 × 28 to 491 × 402 pixels. Tensor-Flow framework was also used to run all neural network models. The advantage of the models is the accuracy of the output data with a relatively small number of inputs.

Conclusion. According to the results of the study, the following leaders in output accuracy can be identified: ResNet50 and MobileNet. In terms of learning speed, MobileNet takes first place with an average of 169 seconds on the GPU, ResNet50 takes second place with a difference of 465 seconds in terms of learning speed on GPU. MobileNet data accuracy showed high results of 94.5% on CPU and 94.5% on GPU. In terms of accuracy and training time, the LeNet and GoogleNet models showed similar performance and accuracy. The DenseNet model showed the most negative results in its accuracy and performance. Changing the size of the input data had little effect on the output accuracy and scalability of the training environment. Due to the fact that the data sizes may not affect the accuracy of the result, when using the TSRD data set, it is proposed to use 32 × 32 dimensions as input data sizes that allow you to find road signs. This solution is more economical and does not affect the accuracy of the output data. The experiments conducted during the study showed that the accuracy of ResNet50 is similar to the accuracy of MobilNet. This proves that for the definition of road signs, deeper models with a longer learning time are not in every case preferable to small ones.

Keywords

neural network models, deep learning, autopilot, framework, artificial intelligence

Full Text

About the authors

Olga A. Kovaleva

Tambov State University named after G.R. Derzhavin

Email: deevmih3@yandex.ru
ORCID iD: 0000-0003-0735-6205
SPIN-code: 2984-0570

Doctor of Engineering Sciences, Professor of the Department of Mathematical Modeling and Information Technologies

Russian Federation, 33, Internatsionalnaya st., Tambov, 392000

Mikhail I. Deev

Tambov State University named after G.R. Derzhavin

Author for correspondence.
Email: deevmih3@yandex.ru
ORCID iD: 0009-0008-1170-7251

PhD student

Russian Federation, 33, Internatsionalnaya st., Tambov, 392000

Sergey V. Kovalev

Tambov State University named after G.R. Derzhavin

Email: deevmih3@yandex.ru
ORCID iD: 0000-0002-5961-7561
SPIN-code: 6685-7573

Doctor of Engineering Sciences, Professor of the Department of Mathematical Modeling and Information Technologies

Russian Federation, 33, Internatsionalnaya st., Tambov, 392000

Ilya A. Zabrodsky

Tambov State University named after G.R. Derzhavin

Email: deevmih3@yandex.ru
ORCID iD: 0009-0000-5538-1400
SPIN-code: 2896-8971

Senior Lecturer of the Department of Mathematical Modeling and Information Technologies

Russian Federation, 33, Internatsionalnaya st., Tambov, 392000

References

Tikhonov AA. Big data and deep machine learning in artificial neural networks. Science and Education Today. 2018;(6):35–38. (In Russ.).
Petrov SP. Convolutional neural network for car license plate character recognition. System Analysis in Science and Education. 2013;(3):66–73. (In Russ.).
Sozykin AV. Review of methods for training deep neural networks. Bulletin Of SUSU. Series: Computational Mathematics and Computer Science. 2017;6(3):28–59. (In Russ.).
Arjun S. Creating DenseNet 121 with Tensor-Flow. Towards Data Science. Jule 2020. Available from::https://towardsdatascience.com/creating-densenet-121-with-tensorflow-edbc08a956d8 (accessed: 13.06.2023).
Suvaditya M. The Annotated ResNet-50. Towards Data Science. August 2022. Available from: https://towardsdatascience.com/the-annotated-resnet-50-a6c536034758 (accessed: 23.06.2023).
Richmond A. Deep Learning: GoogLeNet Ex-plained. Towards Data Science. December 2020. Available from: https://towardsdatascience.com/deep-learning-googlenet-explained-de8861c82765 (accessed: 17.06.2023).
Kozlov S. Evolution of neural networks for image recognition in Google: GoogLe NeT. Available from: https://habr.com/ru/articles/301084. 2016 (accessed: 26.06.2023).
Rizwan M. LeNet-5 - A Classic CNN Architecture. Data Science Central. October 2018. Available from: https://www.datasciencecentral.com/lenet-5-a-classic-cnn-architecture/ (accessed: 22.06.2023).
Howard AG et al. Mobilenets: Efficient convolutional neural networks for mobile vision applications. 2017. Available from: https://arxiv.org/pdf/1704.04861.pdf. (accessed: 26.06.2023).
Panchenko D. MobileNet: smaller, faster, more accurate. Available from: https://habr.com/-ru/articles/352804/ (accessed: 26.06.2023).

Supplementary files

Supplementary Files

Action

1. JATS XML

Download

2. Fig. 1. One dense Densinet block with 5 layers and a growth rate of k = 4. Each subsequent layer accepts all feature maps from the previous one as input data [4]

Download (76KB)

Indexing metadata

3. Fig. 2. An example of the Densinet model with three dense blocks for object recognition in the image [4]

Download (63KB)

Indexing metadata

4. Fig. 3. The architecture of the ResNet50 model is presented in the form of blocks sorted by levels [5]

Download (88KB)

Indexing metadata

5. Fig. 4. The composite block “Inception module” of the GoogLeNet model in a detailed presentation [7]

Download (92KB)

Indexing metadata

6. Fig. 5. LeNet model architecture [8]

Download (87KB)

Indexing metadata

7. Fig. 6. On the left is a block of a conventional convolutional network, and on the right is a basic MobileNet block [10]

Download (8KB)

Indexing metadata

8. Fig. 7. Graph of the time spent on training for each neural network model at a size of 32x32,48x48,64x64

Download (105KB)

Indexing metadata

9. Fig. 8. Graph of the time spent on training for each neural network model at a size of 32x32,48x48,64x64

Download (104KB)

Indexing metadata

10. Fig. 9. Graph of accuracy of output data of neural network models at a size of 32x32,48x48,64x64

Download (148KB)

Indexing metadata

11. Fig. 10. Graph of accuracy of output data of neural network models at a size of 32x32,48x48,64x64

Download (131KB)

Indexing metadata

Username
Password
Remember me

Forgot password?	Register

Username
Password
Remember me

Forgot password?	Register

No 1(65) (2025)

No 1(65) (2025)

Comparative Analysis and Testing of Deep Learning Neural Network Models for Road Sign Recognition

Full Text

Abstract

Keywords

Full Text

About the authors

Olga A. Kovaleva

Mikhail I. Deev

Sergey V. Kovalev

Ilya A. Zabrodsky

References

Supplementary files