A Randomized Algorithm for a Sequence 2-Clustering Problem


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

We consider a strongly NP-hard problem of partitioning a finite Euclidean sequence into two clusters of given cardinalities minimizing the sum over both clusters of intracluster sums of squared distances from clusters elements to their centers. The center of one cluster is unknown and is defined as the mean value of all points in the cluster. The center of the other cluster is the origin. Additionally, the difference between the indices of two consequent points from the first cluster is bounded from below and above by some constants. A randomized algorithm that finds an approximation solution of the problem in polynomial time for given values of the relative error and failure probability and for an established parameter value is proposed. The conditions are established under which the algorithm is polynomial and asymptotically exact.

About the authors

A. V. Kel’manov

Sobolev Institute of Mathematics, Siberian Branch, Russian Academy of Sciences; Novosibirsk State University

Author for correspondence.
Email: kelm@math.nsc.ru
Russian Federation, Novosibirsk, 630090; Novosibirsk, 630090

S. A. Khamidullin

Sobolev Institute of Mathematics, Siberian Branch, Russian Academy of Sciences

Author for correspondence.
Email: kham@math.nsc.ru
Russian Federation, Novosibirsk, 630090

V. I. Khandeev

Sobolev Institute of Mathematics, Siberian Branch, Russian Academy of Sciences; Novosibirsk State University

Author for correspondence.
Email: khandeev@math.nsc.ru
Russian Federation, Novosibirsk, 630090; Novosibirsk, 630090

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2018 Pleiades Publishing, Ltd.