Image classification based on deep learning with automatic relevance determination and structured Bayesian pruning
- 作者: Pham C.T.1, Phan M.N.1, Tran T.T.2
-
隶属关系:
- The University of Danang
- University of Economics, The University of Danang
- 期: 卷 16, 编号 4 (2024)
- 页面: 927-938
- 栏目: MODELS IN PHYSICS AND TECHNOLOGY
- URL: https://journal-vniispk.ru/2076-7633/article/view/306594
- DOI: https://doi.org/10.20537/2076-7633-2024-16-4-927-938
- ID: 306594
如何引用文章
全文:
详细
Deep learning’s power stems from complex architectures; however, these can lead to overfitting, where models memorize training data and fail to generalize to unseen examples. This paper proposes a novel probabilistic approach to mitigate this issue. We introduce two key elements: Truncated Log-Uniform Prior and Truncated Log-Normal Variational Approximation, and Automatic Relevance Determination (ARD) with Bayesian Deep Neural Networks (BDNNs). Within the probabilistic framework, we employ a specially designed truncated log-uniform prior for noise. This prior acts as a regularizer, guiding the learning process towards simpler solutions and reducing overfitting. Additionally, a truncated log-normal variational approximation is used for efficient handling of the complex probability distributions inherent in deep learning models. ARD automatically identifies and removes irrelevant features or weights within a model. By integrating ARD with BDNNs, where weights have a probability distribution, we achieve a variational bound similar to the popular variational dropout technique. Dropout randomly drops neurons during training, encouraging the model not to rely heavily on any single feature. Our approach with ARD achieves similar benefits without the randomness of dropout, potentially leading to more stable training.To evaluate our approach, we have tested the model on two datasets: the Canadian Institute For Advanced Research (CIFAR-10) for image classification and a dataset of Macroscopic Images of Wood, which is compiled from multiple macroscopic images of wood datasets. Our method is applied to established architectures like Visual Geometry Group (VGG) and Residual Network (ResNet). The results demonstrate significant improvements. The model reduced overfitting while maintaining, or even improving, the accuracy of the network’s predictions on classification tasks. This validates the effectiveness of our approach in enhancing the performance and generalization capabilities of deep learning models.
作者简介
Cong Pham
The University of Danang
Email: pcthang@dut.udn.vn
Minh Phan
The University of Danang
Email: pcthang@dut.udn.vn
Thi Tran
University of Economics, The University of Danang
编辑信件的主要联系方式.
Email: pcthang@dut.udn.vn
参考
- J. Beckers, B. V. Erp, Z. Zhao, K. Kondrashov, B. D. Vries, “Principled pruning of Bayesian neural networks through variational free energy minimization”, IEEE Open Journal of Signal Processing, 5 (2023), 195–203.
- L. G. Esteban, F. G. Fernandez, et al., “Artificial neural networks in wood identification: the case of two Juniperus species from the Canary Islands”, IAWA Journal, 30 (2009), 87–94.
- M. Figurnov, A. Ibraimova, D. Vetrov, P. Kohli, “Perforated CNNs: acceleration through elimination of redundant convolutions”, International Conference on Neural Information Processing Systems, 2016, 955–963.
- P. L. P. Filho, L. S. Oliveira, S. Nisgoski, A. S. Britto, “Forest species recognition using macroscopic images”, Machine Vision and Applications, 25 (2014), 1019–1031.
- Y. Gal, Uncertainty in Deep Learning, University of Cambridge, 2016, 174 pp.
- Y. Gal, Z. Ghahramani, “Dropout as a Bayesian approximation: representing model uncertainty in deep learning”, International Conference on Machine Learning, ICML, 48 (2016), 1050–1059.
- T. He, S. Mu, H. Zhou, J. Hu, “Wood species identification based on an ensemble of deep convolution neural networks”, Wood Research, 66 (2021), 1–14.
- K. He, X. Zhang, S. Ren, J. Sun, “Deep residual learning for image recognition”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, 770–778.
- G. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, “Improving neural networks by preventing co-adaptation of feature detectors”, CoRR, 2012, 1–18.
- J. Hron, A. Matthews, Z. Ghahramani, “Variational Bayesian dropout: pitfalls and fixes”, International Conference on Machine Learning, 2018, 2019–2028.
- A. Jungo, et al., “Towards uncertainty-assisted brain tumor segmentation and survival prediction”, International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI (BrainLes 2017), 2018, 474–485.
- V. Kharitonov, D. Molchanov, D. Vetrov, Variational Dropout via Empirical Bayes, 2018, 5 pp., arXiv:1811.00596.
- D. P. Kingma, T. Salimans, M. Welling, “Variational dropout and the local reparameterization trick”, International Conference on Neural Information Processing Systems, 2015, 2575–2583.
- B. Lakshminarayanan, A. Pritzel, C. Blundell, “Simple and scalable predictive uncertainty estimation using deep ensembles”, International Conference on Neural Information Processing Systems, 2017, 6405–6416.
- V. Lebedev, V. Lempitsky, “Fast convnets using group-wise brain damage”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, 2554–2564.
- F. Lens, C. Liang, Y. Guo, et al., “Computer-assisted timber identification based on features extracted from microscopic wood sections”, IAWA Journal, 41 (2020), 660–680.
- Z. Li, B. Gong, T. Yang, “Improved dropout for shallow and deep learning”, International Conference on Neural Information Processing Systems, 2016, 2531–2539.
- C. Louizos, K. Ullrich, M. Welling, “Bayesian compression for deep learning”, International Conference on Neural Information Processing Systems, 2017, 3290–3300.
- D. J. C. MacKay, “Probable networks and plausible predictions — a review of practical Bayesian methods for supervised neural networks”, Network: Computation in Neural Systems, 6 (1995), 469–505.
- S. Mathew, D. B. Rowe, Pruning a Neural Network Using Bayesian Inference, 2023, 26 pp., arXiv:2308.02451.
- D. Molchanov, A. Ashukha, D. Vetrov, “Variational dropout sparsifies deep neural networks”, International Conference on Machine Learning, 70 (2017), 2498–2507.
- E. Mortaz, “Imbalance accuracy metric for model selection in multi-class imbalance classification problems”, Knowledge-Based Systems, 210 (2020), 1–8.
- R. M. Neal, Bayesian Learning for Neural Networks, Lecture Notes in Computer Science, 118, 2012, 204 pp.
- K. Neklyudov, D. Molchanov, A. Ashukha, P. Vetrov, “Structured Bayesian pruning via log-normal multiplicative noise”, International Conference on Neural Information Processing Systems, 2017, 6778–6787.
- N. Rosa da Silva, V. Deklerck, J. M. Baetens, et al., “Improved wood species identification based on multi-view imagery of the three anatomical planes”, Plant Methods, 18 (2022), 1–17.
- J. L. Silva, R. Bordalo, J. Pissarra, “Wood identification: an overview of current and past methods”, ECR, 12 (2020), 45–68, Studies in Conservation & Restoration.
- J. L. Silva, R. Bordalo, J. Pissarra, P. Palacios, “Computer vision-based wood identification: a review”, Forests, 13 (2022), 1–26.
- K. Simonyan, A. Zisserman, “Very deep convolutional networks for large-scale image recognition”, International Conference on Learning Representations, ICLR, 2015, 1–14.
- N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting”, The Journal of Machine Learning Research, 15 (2014), 1929–1958.
- M. K. Titsias, M. Lazaro-Gredilla, “Doubly stochastic variational Bayes for non-conjugate inference”, International Conference on Machine Learning, ICML, 32 (2014), 1971–1979.
- W. Wen, C. Wu, Y. Wang, Y. Chen, H. Li, “Learning structured sparsity in deep neural networks”, International Conference on Neural Information Processing Systems, 2016, 2082–2090.
- A. Zhou, K. Luo, “Sparse dropout regularization method for convolutional neural networks”, Journal of Chinese Computer Systems, 39 (2018), 1674–1679.
- Y. Zhou, Y. Zhang, Y. Wang, Q. Tian, “Accelerate CNN via Recursive Bayesian Pruning”, IEEE International Conference on Computer Vision, ICCV, 2019, 3305–3314.
- L. Zhu, N. Laptev, “Deep and confident prediction for time series at Uber”, IEEE International Conference on Data Mining, ICDM, 2017, 103–110.
补充文件
