卷 23, 编号 4 (2024)

Artificial intelligence, knowledge and data engineering

Kalman Filter for a Particular Class of Dynamic Object Images

Soifer V., Fursov V., Kharitonov S.

摘要

We discuss the problem of estimating the state of a dynamic object by using observed images generated by an optical system. The work aims to implement a novel approach that would ensure improved accuracy of dynamic object tracking using a sequence of images. We utilize a vector model that describes the object image as a limited number of vertexes (reference points). Upon imaging, the object of interest is assumed to be retained at the center of each frame, so that the motion parameters can be considered as projections onto the axes of a coordinate system matched with the camera's optical axis. The novelty of the approach is that the observed parameters (the distance along the optical axis and angular attitude) of the object are calculated using the coordinates of specified points in the object images. For estimating the object condition, a Kalman-Bucy filter is constructed on the assumption that the dynamic object motion is described by a set of equations for the translational motion of the center of mass along the optical axis and variations in the angular attitude relative to the image plane. The efficiency of the proposed method is illustrated by an example of estimating the object's angular attitude.
Informatics and Automation. 2024;23(4):953-968
pages 953-968 views

The Issues of Creation of Machine-Understandable Smart Standards Based on Knowledge Graphs

Shalfeeva E., Gribova V.

摘要

The development of digital transformation requires the widespread use of digital technologies in standardization documents. One of the goals is to create standards with machine-understandable content that will allow the use of digital documents at various stages of development and production without the need for a human operator. The purpose of this work is to describe an approach for creating and translating industry normative documents into a machine-understandable representation for their further use in software services and systems. There are three types of SMART standard content: machine-readable, machine-interpretable, and machine-understandable. Knowledge graphs are actively used to formalize data and knowledge when solving various problems. The new two-level approach is proposed for the creation and translation into a machine-understandable representation of regulatory documents as knowledge graphs. The approach defines two types of interpretation of a smart document (human readability and machine understandability) through two related formats: a graph, each semantic node of which represents text in a natural language, and a network of concepts and strict connections. Each node of a human-readable graph corresponds (in general) to a subtree of a machine-readable knowledge graph. As the basis for ensuring the transformation of one form of smart standard representation into another form, LLM models are used, supplemented by a specialized adapter obtained as a result of additional training using the Parameter-Efficient Fine-Tuning approach. Requirements have been established for a set of problem- and subject-oriented tools for generating knowledge graphs. The conceptual architecture of the system for supporting the solution of a set of problems based on knowledge graphs is shown, and the principles for implementing software components that work with smart knowledge for intelligent software services are established.
Informatics and Automation. 2024;23(4):969-988
pages 969-988 views

Algorithms for the Primary Analysis of Local Fluorescence Objects in the DNA Sequencer «Nanofor SPS»

Manoilov V., Borodinov A., Zarutsky I., Petrov A., Saraev A., Kurochkin V.

摘要

The DNA sequencer "Nanofor SPS", developed at the Institute of Analytical Instrumentation of the Russian Academy of Sciences, implements a method for massively parallel sequencing to decrypt the sequence of nucleic acids. This method allows for the determination of the nucleotide sequence in DNA or RNA, containing from several hundred to hundreds of millions of bases. Thus, there is the opportunity to obtain detailed information about the genome of various biological entities, including humans, animals, and plants. A crucial part of this device is the software, without which it is impossible to solve genome decoding tasks. The output data of optical detection in the sequencer are a set of images over four channels, corresponding to nucleotide types: A, C, G, T. Through specialized software, the position of molecular clusters and their intensity characteristics, along with parameters of the surrounding background, are determined. Algorithms and programs for processing fluorescence signals, discussed in the paper, were developed during the creation of the device software. Also, to debug and test the working programs, models of image construction similar to real data obtained in the course of sequencer operation were created. These models made it possible to obtain a significant amount of information without running expensive experiments. Significant progress has been made in the field of machine learning in recent years, including in the field of bioinformatics, leading to the implementation of the most common models and their potential for practical tasks. However, while these methods have amply proven their worth in secondary bioinformatics data analysis, their potential for the primary analysis remains untapped. This paper focuses on the development and implementation of machine learning methods for primary analysis of optical images of fluorescence signals in reaction cells. The methods of clustering and their testing on models and images obtained from the device are described. The aim of this paper is to demonstrate the capabilities of algorithms for primary analysis of fluorescence signals that arise during sequencing in the «Nanofor SPS» device. The paper describes the main tasks of fluorescence signal analysis and compares traditional methods of solving them and solutions using machine learning technologies.
Informatics and Automation. 2024;23(4):989-1021
pages 989-1021 views

Unet-boosted Classifier – Multi-Task Architecture for Small Datasets Applied to Brain MRI Classification

Sobyanin K., Kulikova S.

摘要

The problem of training deep neural networks on small samples is especially relevant for medical issues. The paper examines the impact of pixel-wise marking of significant objects in the image, over the true class label, on the quality of the classification. To achieve better classification results on small samples, we propose a multitasking architecture – Unet-boosted classifier (UBC), that is trained simultaneously to solve classification and semantic segmentation problems. As the exploratory dataset, MRI images of patients with benign glioma and glioblastoma taken from the BRaTS 2019 data set are used. One horizontal slice of the MRI image containing a glioma is considered as the input (a total of 380 frames in the training set), and the probability of glioblastoma – as the output. Resnet34 was chosen as the baseline, trained without augmentations with a loss function based on cross-entropy. As an alternative solution, UBC-resnet34 is used – the same resnet34, boosted by a decoder built on the U-Net principle and predicting the pixels with glioma. The smoothed Sorensen-Dice coefficient (DiceLoss) is used as a decoder loss function. Results on the test sample: accuracy for the baseline reached 0.71, for the proposed model – 0.81, and the Dice score – 0.77. Thus, a deep model can be well trained even on a small data set, using the proposed architecture, provided that marking of the affected tissues in the form of a semantic mask is provided.
Informatics and Automation. 2024;23(4):1022-1046
pages 1022-1046 views

Restoration of Semantic-Based Super-Resolution Aerial Images

Favorskaya M., Pakhirka A.

摘要

Currently, technologies for remote sensing image processing are actively developing, including both satellite images and aerial images obtained from video cameras of unmanned aerial vehicles. Often such images have artifacts such as low resolution, blurred image fragments, noise, etc. One way to overcome such limitations is to use modern technologies to restore super-resolution images based on deep learning methods. The specificity of aerial images is the presentation of texture and structural elements in a higher resolution than in satellite images, which objectively contributes to better results of restoration. The article provides a classification of super-resolution methods based on the main architectures of deep neural networks, namely convolutional neural networks, visual transformers and generative adversarial networks. The article proposes a method for reconstructing super-resolution aerial images SemESRGAN taking into account semantic features by using an additional deep network for semantic segmentation during the training stage. The total loss function, including adversarial losses, pixel-level losses, and perception losses (feature similarity), is minimized. Six annotated aerial and satellite image datasets CLCD, DOTA, LEVIR-CD, UAVid, AAD, and AID were used for the experiments. The results of image restoration using the proposed SemESRGAN method were compared with the basic architectures of convolutional neural networks, visual transformers and generative adversarial networks. Comparative results of image restoration were obtained using objective metrics PSNR and SSIM, which made it possible to evaluate the quality of restoration using various deep network models.
Informatics and Automation. 2024;23(4):1047-1076
pages 1047-1076 views

Intelligent Neural Network Machine with Thinking Functions

Osipov V.

摘要

In recent years, interest in artificial intelligence based on neural network approaches has grown significantly. A number of significant scientific results have been obtained that have found wide application in practice. Generative adversarial neural network models, neural network transformers, and other solutions have attracted much attention. Obvious progress has been achieved in neural network recognition and image generation, text and speech processing, event forecasting, and control of processes that are difficult to formalize. However, it has not yet been possible to endow neural network machines with thinking. All results obtained using neural network machines can be attributed to solutions based on various types of signal binding without full control of their processing processes. Typical representatives of such machines are ChatGPT. The capabilities for intelligently operating various signals in known neural network machines are very limited. Among the main reasons for such limitations, one should highlight the imperfection of the basic principles of neural network information processing used. The properties of neurons have long been considered in a simplified manner. This was due to both gaps in the field of biological research and the lack of opportunities to build large neural networks on complex neuron models. In recent years the situation has changed. New ways to implement large neural networks have emerged. It has also been established that even individual neurons can have extensive internal memory and implement various functions. However, many mechanisms of neuron functioning and their interactions still remain unclear. The issues of controlled associative access to the internal memory of neurons have been little studied. These shortcomings significantly hinder the creation of thinking neural network machines. The object of research in the article is the process of intelligent neural network information processing. The subject of research: principles, models, and methods of such processing. The goal is to expand the functionality of neural network machines to solve difficult-to-formalize creative problems through the development of new principles, models, and methods of intelligent information processing. In the interests of achieving this goal, the operating principles of intelligent neural network machines are clarified, and new models and methods of neural network information processing are proposed. A new model of a pulse neuron is revealed as a basic element of such machines. It is recommended to form the artificial brain of neural network machines in the form of multilayer neural networks endowed with logical structures with neurons of different parameters. A new method of multi-level intelligent information processing in neural network machines based on smart impulse neurons is proposed. The mechanisms of thinking of neural network machines, and the underlying functions of intellectual operation of images and concepts in neural network memory are explained. Simulation results are presented that confirm the validity of the proposed solutions.
Informatics and Automation. 2024;23(4):1077-1109
pages 1077-1109 views

A Combined Term Extraction Method for the Problem of Monitoring Thematic Discussions in Social Media

Pimeshkov V., Nikonorova M., Shishaev M.

摘要

Term extraction is an important stage in the automated construction of knowledge systems based on natural language texts, since it provides the formation of a basic concept system, which is then used in applied problems of intellectual information processing. The article discusses the problem of automated extraction of terms from natural language texts for their further use in the construction of formalized knowledge systems (ontologies, thesauruses, knowledge graphs) within the problem of monitoring thematic discussions in social media. This problem is characterized by the need to include in the formed knowledge system both concepts from several different domains, and some general concepts used by the audience of social media within thematic discussions. In addition, the generated knowledge system is dynamic both in terms of the composition of the domains it covers and the composition of relevant concepts to be included in the system. The use of existing classical methods for term extraction in this case is difficult, since they are focused on extracting terms within one domain. Based on this, to solve the problem under consideration, a combined method is proposed, combining approaches based on dictionaries, NER tools and rules. The results of the experiments demonstrate the effectiveness of the proposed combination of approaches to term extraction, which makes it possible to extract terms for the problem of monitoring and analyzing thematic discussions in social media. The developed method significantly exceeds the precision of the considered term extraction tools. As a further direction of research, the possibility of developing a method for solving the problem of identifying nested terms or entities is considered.
Informatics and Automation. 2024;23(4):1110-1138
pages 1110-1138 views

A Conception of Collaborative Decision Support Systems: Approach and Platform Architecture

Smirnov A., Ponomarev A., Shilov N., Levashova T., Teslya N.

摘要

The paper describes a general conception of collaborative decision support systems, in which teams providing decision support a) are formed flexibly in accordance with the problem and b) consist of both human experts and intelligent agents implementing AI methods and techniques. An analysis of the key problems of creating collaborative decision support systems based on the collaboration of humans and AI is carried out, the following problems are highlighted: ensuring interoperability (mutual understanding) between heterogeneous team members, reconciling differing positions of participants, ensuring trust between participants, ensuring the effectiveness of joint actions planning and maintaining a balance between predefined workflows and self-organization. Principles for constructing such systems have been formed, offering solutions to the identified problems. In particular, it is proposed to employ an ontology-oriented representation of information about the problem (in the form of multi-aspect ontology), a set of methods for monitoring team activities, reputation scheme, elements of explainable AI, as well as mechanisms of limited self-organization. The proposed concept forms the basis of a software platform for the development of collaborative decision support systems, the main architectural provisions of which are also presented in the paper. The use of the platform is illustrated by an example from the field of rational management of road infrastructure and the creation of a collaborative DSS for the development of measures to reduce road accidents.
Informatics and Automation. 2024;23(4):1139-1172
pages 1139-1172 views

A Method for Recognition of Sentiment and Emotions in Russian Speech Transcripts Using Machine Translation

Dvoynikova A., Kagirov I., Karpov A.

摘要

This paper addresses the issue of user emotions and sentiment recognition in transcripts of Russian speech samples using lexical methods and machine translation. The availability of data for sentiment analysis in Russian texts is quite limited, thus this paper proposes a new approach which is based on automatic machine translation of Russian texts into English. Additionally, the paper presents the results of experimental research regarding the impact of partial and full machine translation on emotion and sentiment recognition. Partial translation means translating single lexemes not included in Russian sentiment dictionaries, while full translation implies translating the entire text. A translated text is further analyzed using different English sentiment dictionaries. Experiments have demonstrated that the combination of all English sentiment dictionaries enhances the accuracy of emotion and sentiment recognition in text data. Furthermore, this paper explores the correlation between the length of the text data vector and its representativity. Experimental research for emotion and sentiment recognition tasks was conducted with the use of expert and automatic transcripts of the multimodal Russian corpus RAMAS. Based on the experimental results, one can conclude that the use of word lemmatization is a more effective approach for normalizing words in speech transcripts compared to stemming. The use of the proposed methods involving full and partial machine translation allows for an improvement in sentiment and emotion recognition accuracy by 0.65-9.76% in terms of F-score compared to the baseline approach. As a result of the application of machine translation methods to expert and automatic transcriptions of the Russian speech corpus RAMAS, an accuracy in recognition of 7 emotion classes was achieved at 31.12% and 23.74%, and 3 sentiment classes at 75.37% and 71.60%, respectively. Additionally, the experiments revealed that the use of statistical vectors as a text data vectorization method results in an a 1-5% increase in F-score value compared to concatenated (statistical and sentiment) vectors.
Informatics and Automation. 2024;23(4):1173-1198
pages 1173-1198 views

Cascade Classifier for the Detection and Identification of Birds in a Videostream

Vlasov E., Krasnenko N.

摘要

A method and a prototype of the program for detecting the presence of birds in the video data flow in real time are presented in the paper. The method is based on the cascade classifier solving the problem of bird detection and identification with the use of a bioacoustic bird scaring system deployed at the Tomsk airport. In our research, the Viola-Jones cascade classifier representing one of the implementations of the Haar cascade algorithm has been used. This algorithm allows objects to be detected in images and videos with high accuracy and rate. In this case, the classifier was leaned on the data set containing images of birds that allowed us to reach high accuracy of bird detection and identification in the videos. The possibilities of the developed classifier are also estimated, and its high productivity is shown. In this study, various methods of machine learning and video data analysis are used to obtain exact and reliable results. As a whole, the present work is an innovative approach to a solution to the urgent problem of airport protection from birds. The application of the developed method has allowed the operating efficiency of the bioacoustic bird scaring system to be increased together with the safety of flights at the Tomsk airport, thereby decreasing the probability of airplane collisions with birds. The novelty of the work consists of the application of the Viola–Jones method for solving the problem of bird detection and identification and estimating its efficiency. Thus, this work is an important contribution to the development of methods for detecting and identifying objects in videos and can also be used in other fields of science and technology in which the automatic detection and classification of objects in the video data flow is required.
Informatics and Automation. 2024;23(4):1199-1220
pages 1199-1220 views

Using SAR Data for Monitoring of Agricultural Crops in the South of the Russian Far East

Verkhoturov A., Stepanov A., Illarionova L.

摘要

The use of SAR data to monitoring agricultural crops is a promising area of research designed to complement existing methods and technologies based on the analysis of multispectral images. The main advantages of vegetation indices calculated from SAR data include their sensitivity to the polarimetric properties of the backscatter intensity, its scattering characteristics, and independence from cloud cover. This is especially important for the territory of the south of the Russian Far East, whose monsoon climate provides humid and cloudy weather during the period when crops gain maximum biomass. For arable lands in the Khabarovsk Territory and the Amur Region, a total of 64 Sentinel-1 SAR images were obtained from May to October 2021. For each date, the values of the DpRVI, RVI, VH/VV indices were calculated and time series were constructed for the entire observation period for individual fields (342 fields in total). NDVI time series were constructed from Sentinel-2 multispectral images using a cloud mask. The characteristics of time series extremes were calculated for different types of arable land: soybeans, oats, and fallows. It was shown that for each crop the seasonal curves DpRVI, RVI, VH/VV had a characteristic appearance. It was found that the DpRVI demonstrated the highest stability – the coefficients of variation of the seasonal variation of DpRVI were significantly lower than those for RVI and VH/VV. It was also revealed that the similarity between the curves of these indices remained for regions quite distant from each other - the Khabarovsk Territory and the Amur Region. The main characteristics of the seasonal variation of time series of radar indices were calculated in comparison with NDVI - the magnitude of the maximum, the date of the maximum and the values of the coefficient of variation for these indicators. It was found, firstly, that the values of these indicators in different regions are similar to each other; secondly, the variability of the maximum and the day of the maximum for DpRVI is lower than for RVI and VH/VV; thirdly, the variability of the maximum and the day of the maximum for DpRVI is comparable to NDVI. Thus, time series of radar indices DpRVI, RVI, VH/VV for the main types of agricultural lands in the south of the Far East have distinctive features and can be used in classification problems, yield modeling and crop rotation control.
Informatics and Automation. 2024;23(4):1221-1245
pages 1221-1245 views

Information security

Post-Quantum Public-Key Cryptoschemes on Finite Algebras

Moldovyan A., Moldovyan D., Moldovyan N.

摘要

One direction in the development of practical post-quantum public-key cryptographic algorithms is the use of finite algebras as their algebraic carrier. Two approaches in this direction are considered: 1) construction of electronic digital signature algorithms with a hidden group on non-commutative associative algebras and 2) construction of multidimensional cryptography algorithms using the exponential operation in a vector finite field (in a commutative algebra, which is a finite field) to specify a nonlinear mapping with a secret trapdoor. The first approach involves the development of two types of cryptoschemes: those based on the computational difficulty of a) the hidden discrete logarithm problem and b) solving a large system of quadratic equations. For the second type, problems arise in ensuring complete randomization of the digital signature and specifying non-commutative associative algebras of large dimension. Ways to solve these problems are discussed. The importance of studying the structure of finite non-commutative algebras from the point of view of decomposition into a set of commutative subalgebras is shown. Another direction is aimed at a significant (10 or more times) reduction in the size of the public key in multivariate-cryptography algorithms and is associated with the problem of developing formalized, parameterizable, unified methods for specifying vector finite fields of large dimensions (from 5 to 130) with a sufficiently large number of potentially implementable types and modifications each type (up to 2500 or more). Variants of such methods and topologies of nonlinear mappings on finite vector fields of various dimensions are proposed. It is shown that the use of mappings that specify the exponential operation in vector finite fields potentially eliminates the main drawback of known multivariate-cryptography algorithms, which is associated with the large size of the public key.
Informatics and Automation. 2024;23(4):1246-1276
pages 1246-1276 views

Согласие на обработку персональных данных с помощью сервиса «Яндекс.Метрика»

1. Я (далее – «Пользователь» или «Субъект персональных данных»), осуществляя использование сайта https://journals.rcsi.science/ (далее – «Сайт»), подтверждая свою полную дееспособность даю согласие на обработку персональных данных с использованием средств автоматизации Оператору - федеральному государственному бюджетному учреждению «Российский центр научной информации» (РЦНИ), далее – «Оператор», расположенному по адресу: 119991, г. Москва, Ленинский просп., д.32А, со следующими условиями.

2. Категории обрабатываемых данных: файлы «cookies» (куки-файлы). Файлы «cookie» – это небольшой текстовый файл, который веб-сервер может хранить в браузере Пользователя. Данные файлы веб-сервер загружает на устройство Пользователя при посещении им Сайта. При каждом следующем посещении Пользователем Сайта «cookie» файлы отправляются на Сайт Оператора. Данные файлы позволяют Сайту распознавать устройство Пользователя. Содержимое такого файла может как относиться, так и не относиться к персональным данным, в зависимости от того, содержит ли такой файл персональные данные или содержит обезличенные технические данные.

3. Цель обработки персональных данных: анализ пользовательской активности с помощью сервиса «Яндекс.Метрика».

4. Категории субъектов персональных данных: все Пользователи Сайта, которые дали согласие на обработку файлов «cookie».

5. Способы обработки: сбор, запись, систематизация, накопление, хранение, уточнение (обновление, изменение), извлечение, использование, передача (доступ, предоставление), блокирование, удаление, уничтожение персональных данных.

6. Срок обработки и хранения: до получения от Субъекта персональных данных требования о прекращении обработки/отзыва согласия.

7. Способ отзыва: заявление об отзыве в письменном виде путём его направления на адрес электронной почты Оператора: info@rcsi.science или путем письменного обращения по юридическому адресу: 119991, г. Москва, Ленинский просп., д.32А

8. Субъект персональных данных вправе запретить своему оборудованию прием этих данных или ограничить прием этих данных. При отказе от получения таких данных или при ограничении приема данных некоторые функции Сайта могут работать некорректно. Субъект персональных данных обязуется сам настроить свое оборудование таким способом, чтобы оно обеспечивало адекватный его желаниям режим работы и уровень защиты данных файлов «cookie», Оператор не предоставляет технологических и правовых консультаций на темы подобного характера.

9. Порядок уничтожения персональных данных при достижении цели их обработки или при наступлении иных законных оснований определяется Оператором в соответствии с законодательством Российской Федерации.

10. Я согласен/согласна квалифицировать в качестве своей простой электронной подписи под настоящим Согласием и под Политикой обработки персональных данных выполнение мною следующего действия на сайте: https://journals.rcsi.science/ нажатие мною на интерфейсе с текстом: «Сайт использует сервис «Яндекс.Метрика» (который использует файлы «cookie») на элемент с текстом «Принять и продолжить».