A thermodynamic approach to selecting a number of clusters based on topic modeling


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

A thermodynamic approach has been applied to solving the problem of selecting the number of clusters/topics in topic modeling. The main principles of this approach are formulated and the behavior of topic models during temperature variations is studied. Using thermodynamic formalism, the existence of the entropy phase transition in topic models is shown and criteria for the choice of optimum number of clusters/ topics are determined.

About the authors

S. N. Koltcov

National Research University Higher School of Economics

Author for correspondence.
Email: skoltsov@hse.ru
Russian Federation, St. Petersburg, 190008

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2017 Pleiades Publishing, Ltd.