Approximation algorithm for the problem of partitioning a sequence into clusters


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

We consider the problem of partitioning a finite sequence of Euclidean points into a given number of clusters (subsequences) using the criterion of the minimal sum (over all clusters) of intercluster sums of squared distances from the elements of the clusters to their centers. It is assumed that the center of one of the desired clusters is at the origin, while the center of each of the other clusters is unknown and determined as the mean value over all elements in this cluster. Additionally, the partition obeys two structural constraints on the indices of sequence elements contained in the clusters with unknown centers: (1) the concatenation of the indices of elements in these clusters is an increasing sequence, and (2) the difference between an index and the preceding one is bounded above and below by prescribed constants. It is shown that this problem is strongly NP-hard. A 2-approximation algorithm is constructed that is polynomial-time for a fixed number of clusters.

About the authors

A. V. Kel’manov

Sobolev Institute of Mathematics, Siberian Branch; Novosibirsk State University

Author for correspondence.
Email: kelm@math.nsc.ru
Russian Federation, Novosibirsk, 630090; Novosibirsk, 630090

L. V. Mikhailova

Sobolev Institute of Mathematics, Siberian Branch

Email: kelm@math.nsc.ru
Russian Federation, Novosibirsk, 630090

S. A. Khamidullin

Sobolev Institute of Mathematics, Siberian Branch

Email: kelm@math.nsc.ru
Russian Federation, Novosibirsk, 630090

V. I. Khandeev

Sobolev Institute of Mathematics, Siberian Branch; Novosibirsk State University

Email: kelm@math.nsc.ru
Russian Federation, Novosibirsk, 630090; Novosibirsk, 630090

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2017 Pleiades Publishing, Ltd.