On determining semantic similarity based on relationships of a combined thesaurus
- Authors: Golitsyna O.L.1, Maksimov N.V.1, Fedorova V.A.1
-
Affiliations:
- MEPhI National Research Nuclear University
- Issue: Vol 50, No 4 (2016)
- Pages: 139-153
- Section: Article
- URL: https://journal-vniispk.ru/0005-1055/article/view/150134
- DOI: https://doi.org/10.3103/S0005105516040026
- ID: 150134
Cite item
Abstract
Problems of the use of thesauruses for fuzzy comparisons of conceptual patterns are considered. A measure of semantic similarity that can be calculated using hierarchical and association relationships of a thesaurus is proposed, as well as an algorithm to compile a semantic intersection of conceptual patterns based on the coinciding maximum principle. A massive of texts and conceptual search patterns of thesis papers was used for experimental studies, which proved that the use of the lexis of different subject fields of a multi-area thesaurus produced a more precise identification of sematic similarity. The power of the pattern intersection increased significantly through pairs of descriptors linked by the semantic similarity measure; however, the average degree of pairwise intersection only increased by 1–2%, which implies an insignificant “expansion” of a conceptual pattern as it is used as a search pattern in creating search-result outputs in automated search mechanisms.
About the authors
O. L. Golitsyna
MEPhI National Research Nuclear University
Author for correspondence.
Email: olgolitsina@yandex.ru
Russian Federation, Moscow
N. V. Maksimov
MEPhI National Research Nuclear University
Email: olgolitsina@yandex.ru
Russian Federation, Moscow
V. A. Fedorova
MEPhI National Research Nuclear University
Email: olgolitsina@yandex.ru
Russian Federation, Moscow
Supplementary files
