Two Methods for Estimating Optical Flow from a Video Sequence of Images
- Авторлар: Butakova M.A.1, Shcherban I.V.2, Mishin N.A.2, Belyavsky G.I.2
- 
							Мекемелер: 
							- Scientific Research and Design Institute of Informatization, Automation and Communication of Railway Transport
- Southern Federal University
 
- Шығарылым: № 1 (2025)
- Беттер: 115-127
- Бөлім: Analysis of Signals, Audio and Video Information
- URL: https://journal-vniispk.ru/2071-8594/article/view/293510
- DOI: https://doi.org/10.14357/20718594250110
- EDN: https://elibrary.ru/GUJXEK
- ID: 293510
Дәйексөз келтіру
Толық мәтін
Аннотация
The content of the article focuses on calculating optical flow from a series of images. Two techniques are presented for solving this challenging computational problem. These techniques play a crucial role in various fields of computer vision, including object tracking, scene analysis, microand macro-motion detection, facial expression recognition, and more. Both techniques complement each other: the first technique, which uses fast convolutions, is best for calculating the video stream across all pixels in an image; the second technique, which relies on robust estimates for linear regression parameters, is better suited for point configurations. It is recommended to perform pre-processing with the first technique to minimize contrast effects, whereas the image quality has little impact on the results with the second technique due to its robust estimates. By their nature, these methods are related to variational approaches for calculating optical flow. However, they differ significantly from the methods described in the literature in terms of speed and accuracy. These methods do not require the use of deep learning, so they can be applied without a large training dataset for methods that utilize deep neural networks for optical flow computation. The results obtained on grayscale images can easily be extended to color images, and most importantly, to systems of secondary features that have recently been used in computer vision.
Негізгі сөздер
Авторлар туралы
Maria Butakova
Scientific Research and Design Institute of Informatization, Automation and Communication of Railway Transport
							Хат алмасуға жауапты Автор.
							Email: m.a.butakova@yandex.ru
				                					                																			                								
Doctor of Technical Sciences, Professor, Chief Researcher
Ресей, Rostov-on-DonIgor Shcherban
Southern Federal University
														Email: shcheri@mail.ru
				                					                																			                								
Doctor of Technical Sciences, Docent, Leading Researcher, Laboratory of Neurotechnology and Psychophysiology
Ресей, Rostov-on-DonNikita Mishin
Southern Federal University
														Email: nmishin@sfedu.ru
				                					                																			                								
Postgraduate student, Department of Optimization Methods and Machine Learning, I. I. Vorovich Institute of Mathematics, Mechanics and Computer Science
Ресей, Rostov-on-DonGrigory Belyavsky
Southern Federal University
														Email: gbelyavski@sfedu.ru
				                					                																			                								
Doctor of Technical Sciences, Professor, Professor, Department of Optimization Methods and Machine Learning, I. I. Vorovich Institute of Mathematics, Mechanics and Computer Science, Leading Researcher, Laboratory of Neurotechnology and Psychophysiology
Ресей, Rostov-on-DonӘдебиет тізімі
- Lucas B. D., Kanade T. An iterative image registration technique with an application to stereo vision // IJCAI'81: 7th international joint conference on Artificial intelligence. 1981. V. 2. P. 674-679.
- Leonida K. L., Sevilla K. V., Manlises C. O. A MotionBased Tracking System Using the Lucas-Kanade Optical Flow Method // 2022 14th International Conference on Computer and Automation Engineering (ICCAE). IEEE, 2022. P. 86-90.
- Horn B. K. P., Schunck B. G. Determining optical flow // Artificial intelligence. 1981. V. 17. No 1-3. P. 185-203.
- NVIDIA Optical Flow SDK // Electronic resource. URL: https://developer.nvidia.com/optical-flow-sdk (accessed 07.01.2024).
- Dosovitskiy A. et al. Flownet: Learning optical flow with convolutional networks // Proceedings of the IEEE international conference on computer vision. 2015. P. 2758-2766.
- Tehrani A. K. Z., Rivaz H. MPWC-Net++: evolution of optical flow pyramidal convolutional neural network for ultrasound elastography // Medical Imaging 2021: Ultrasonic Imaging and Tomography. SPIE, 2021. V. 11602. P. 14-23.
- Sun D. et al. Pwc-net: Cnns for optical flow using pyramid, warping, and cost volume // Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. P. 8934-8943.
- Kavitha K. K., Venkatapur R. B. Motion Estimation using Optical Flow Through A CNN based Pyramidal Warping and Cost Volume Approach: An Optimized PWC-Net Model // Grenze International Journal of Engineering & Technology (GIJET). 2024. V. 10.
- Teed Z., Deng J. Raft: Recurrent all-pairs field transforms for optical flow // Computer Vision–ECCV 2020: 16th European Conference. Glasgow, UK. Proceedings, Part II 16. Springer International Publishing, 2020. P. 402-419.
- Sui X. et al. Craft: Cross-attentional flow transformer for robust optical flow // Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition. 2022. P. 17602-17611.
- Turin G. An introduction to matched filters // IRE transactions on Information theory. 1960. V. 6. No 3. P. 311-329.
- Boldin M. V., Simonova G. I., Tyurin Yu. N. Znakovyj statisticheskij analiz linejnyh modelej [Sign-based methods in linear statistical models] // Amerikanskoe matematicheskoe obshchestvo [American Mathematical Society]. 1997. V. 162.
- Durrani S. et al. Accelerating fourier and number theoretic transforms using tensor cores and warp shuffles // 2021 30th International conference on parallel architectures and compilation techniques (PACT). IEEE, 2021. P. 345-355.
- Cheon B. W., Kim N. H. Modified gaussian filter based on fuzzy membership function for awgn removal in digital images // Journal of information and communication convergence engineering. 2021. V. 19. No 1. P. 54-60.
- Wang Z., Wende-von Berg S., Braun M. Fast parallel Newton–Raphson power flow solver for large number of system calculations with CPU and GPU // Sustainable Energy, Grids and Networks. 2021. V. 27. P. 100483.
- Tihonov A. N. O nekorrektnyh zadachah linejnoj algebry i ustojchivom metode ih resheniya [On incorrect problems of linear algebra and a stable method for their solution] // DAN SSSR. 1965. V. 163. No 3. P. 591-594.
- HD1K Benchmark Suite // Electronic resource. URL: http://hci-benchmark.iwr.uni-heidelberg.de/ (accessed 07.01.2024).
- About MPI-Sintel Flow // Electronic resource. URL: http://sintel.is.tue.mpg.de/about (accessed 07.01.2024).
- Otte M., Nagel H. H. Optical flow estimation: advances and comparisons // Computer Vision–ECCV'94: Third European Conference on Computer Vision. Stockholm, Sweden. Proceedings, Volume I 3. Springer Berlin Heidelberg, 1994. P. 49-60.
- Barron J. L., Fleet D. J., Beauchemin S. S. Performance of optical flow techniques // International journal of computer vision. 1994. V. 12. P. 43-77.
Қосымша файлдар
 
				
			 
						 
					 
						 
						 
						 
									
 
  
  
  Мақаланы E-mail арқылы жіберу
			Мақаланы E-mail арқылы жіберу 