Distance–Constrained Data Clustering by Combined k–means Algorithms and Opinion Dynamics Filters

Adriano Fagiolini, Damiano La Manna, Roberto Setola, Gabriele Oliva

Risultato della ricerca: Paper

2 Citazioni (Scopus)

Abstract

Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional data into groups with small in–group and large out–group distances. Most of the existing algorithms fail when a lower bound for the distance among cluster centroids is specified, while this type of constraint can be of help in obtaining a better clustering. Traditional approaches require that the desired number of clusters are specified a priori, which requires either a subjective decision or global meta–information knowledge that is not easily obtainable. In this paper, an extension of the standard data clustering problem is addressed, including additional constraints on the cluster centroid distances. Based on the well–known Hegelsmann–Krause opinion dynamics model, an algorithm that is capable to find admissible solutions is given. A key feature of the algorithm is the ability to partition the original set of data into a suitable number of clusters, without the necessity to specify such a number in advance. In the proposed approach, instead, the maximum distance among any pair of cluster centroids is specified.
Lingua originaleEnglish
Stato di pubblicazionePublished - 2014

Fingerprint

Clustering algorithms
Dynamic models

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Information Systems
  • Computer Science Applications
  • Computer Networks and Communications

Cita questo

@conference{ebd34b42659d4b3f9f005d73b752a517,
title = "Distance–Constrained Data Clustering by Combined k–means Algorithms and Opinion Dynamics Filters",
abstract = "Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional data into groups with small in–group and large out–group distances. Most of the existing algorithms fail when a lower bound for the distance among cluster centroids is specified, while this type of constraint can be of help in obtaining a better clustering. Traditional approaches require that the desired number of clusters are specified a priori, which requires either a subjective decision or global meta–information knowledge that is not easily obtainable. In this paper, an extension of the standard data clustering problem is addressed, including additional constraints on the cluster centroid distances. Based on the well–known Hegelsmann–Krause opinion dynamics model, an algorithm that is capable to find admissible solutions is given. A key feature of the algorithm is the ability to partition the original set of data into a suitable number of clusters, without the necessity to specify such a number in advance. In the proposed approach, instead, the maximum distance among any pair of cluster centroids is specified.",
author = "Adriano Fagiolini and {La Manna}, Damiano and Roberto Setola and Gabriele Oliva",
year = "2014",
language = "English",

}

TY - CONF

T1 - Distance–Constrained Data Clustering by Combined k–means Algorithms and Opinion Dynamics Filters

AU - Fagiolini, Adriano

AU - La Manna, Damiano

AU - Setola, Roberto

AU - Oliva, Gabriele

PY - 2014

Y1 - 2014

N2 - Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional data into groups with small in–group and large out–group distances. Most of the existing algorithms fail when a lower bound for the distance among cluster centroids is specified, while this type of constraint can be of help in obtaining a better clustering. Traditional approaches require that the desired number of clusters are specified a priori, which requires either a subjective decision or global meta–information knowledge that is not easily obtainable. In this paper, an extension of the standard data clustering problem is addressed, including additional constraints on the cluster centroid distances. Based on the well–known Hegelsmann–Krause opinion dynamics model, an algorithm that is capable to find admissible solutions is given. A key feature of the algorithm is the ability to partition the original set of data into a suitable number of clusters, without the necessity to specify such a number in advance. In the proposed approach, instead, the maximum distance among any pair of cluster centroids is specified.

AB - Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional data into groups with small in–group and large out–group distances. Most of the existing algorithms fail when a lower bound for the distance among cluster centroids is specified, while this type of constraint can be of help in obtaining a better clustering. Traditional approaches require that the desired number of clusters are specified a priori, which requires either a subjective decision or global meta–information knowledge that is not easily obtainable. In this paper, an extension of the standard data clustering problem is addressed, including additional constraints on the cluster centroid distances. Based on the well–known Hegelsmann–Krause opinion dynamics model, an algorithm that is capable to find admissible solutions is given. A key feature of the algorithm is the ability to partition the original set of data into a suitable number of clusters, without the necessity to specify such a number in advance. In the proposed approach, instead, the maximum distance among any pair of cluster centroids is specified.

UR - http://hdl.handle.net/10447/98017

UR - http://ieeexplore.ieee.org/document/6961441/

M3 - Paper

ER -