Evaluation of the efficiency of the chi-square metric


如何引用文章

全文:

开放存取 开放存取
受限制的访问 ##reader.subscriptionAccessGranted##
受限制的访问 订阅存取

详细

The efficiency of using the chi-square metrics to weigh terms used in text documents is evaluated. The procedure includes the selection and advanced processing of class C and ~C texts, compilation of a reference dictionary and calculation of scores for all the terms in the dictionary, calculation of χ2 coefficients for terms from a class C text, and calculation of the general efficiency factor by the sum of the coefficients found for the terms from the reference dictionary. The weighting by the χ2 formula, odds-ratio (OR) formula, and on the basis of probabilistic variables is analyzed and compared. It was found that the best result is yielded by the OR-based weighting.

作者简介

V. Yatsko

Katanov Khakassia State University

编辑信件的主要联系方式.
Email: viacheslav-yatsko@rambler.ru
俄罗斯联邦, pr. Lenina 92, Abakan, Khakassia, 655000

补充文件

附件文件
动作
1. JATS XML

版权所有 © Allerton Press, Inc., 2016