Hierarchical clustering based on the information bottleneck method using a control process

Bonmatí Coll, Ester; Bardera i Reig, Antoni; Boada, Imma; Feixas Feixas, Miquel; Sbert, Mateu

Hierarchical clustering based on the information bottleneck method using a control process

Bonmatí Coll, Ester

orcId Bardera i Reig, Antoni scopusId Bardera i Reig, Antoni

Bardera i Reig, Antoni

orcId Boada, Imma scopusId Boada, Imma

Boada, Imma

orcId Feixas Feixas, Miquel scopusId Feixas Feixas, Miquel

Feixas Feixas, Miquel

orcId Sbert, Mateu researcherId Sbert, Mateu scopusId Sbert, Mateu

Sbert, Mateu

2015-08-24

Texto Completo

Hierarchical-clustering.pdf

Solicita copia

Al rellenar este formulario estáis solicitando una copia del artículo, depositado en el repositorio institucional (DUGiDocs), a su autor o al autor principal del artículo. Será el mismo autor quien decidirá enviar una copia del documento a quien lo solicite si lo considera oportuno. En todo caso, la Biblioteca de la UdG no interviene en este proceso ya que no está autorizada a facilitar artículos cuando éstos son de acceso restringido.

Clustering techniques aim organizing data into groups whose members are similar. A key element of these techniques is the definition of a similarity measure. The information bottleneck method provides us a full solution of the clustering problem with no need to define a similarity measure, since a variable is clustered depending on a control variable by maximizing the mutual information between them. In this paper, we propose a hierarchical clustering algorithm based on the information bottleneck method such that, instead of using a control variable, the different possible values of a Markov process are clustered by maximally preserving the mutual information between two consecutive states of the Markov process. These two states can be seen as the input and the output of an information channel that is used as a control process, similarly to how the variable is used as a control variable in the original information bottleneck algorithm. We present both agglomerative and divisive versions of our hierarchical clustering approach and two different applications. The first one, to quantize an image by grouping intensity bins of the image histograms, is tested on synthetic, photographic and medical images and compared with hand-labelled images, hierarchical clustering using Euclidean distance and non-negative matrix factorization methods. The second one, to cluster brain regions by grouping them depending on their connectivity, is tested on medical data. In all the applications, the obtained results demonstrate the efficacy of the method in getting clusters with high mutual information.

Tots els drets reservats

Mostrar el registro completo del ítem

Identificadores

http://hdl.handle.net/10256/10949

issn: 1433-7541

doi: 10.1007/s10044-015-0467-1

eissn: 1433-755X

Texto Completo

Hierarchical-clustering.pdf

Solicita copia

Al rellenar este formulario estáis solicitando una copia del artículo, depositado en el repositorio institucional (DUGiDocs), a su autor o al autor principal del artículo. Será el mismo autor quien decidirá enviar una copia del documento a quien lo solicite si lo considera oportuno. En todo caso, la Biblioteca de la UdG no interviene en este proceso ya que no está autorizada a facilitar artículos cuando éstos son de acceso restringido.

Compartir

Impacto

979

1

Ver Estadísticas de uso

Citado veces en Scopus

Citado veces en Web of Science

H-index de esta revista:

Índice Scimago de 1971:

Google Académico