Probabilistic clustering of interval data

Paula Brito*, A. Pedro Duarte Silva, José G. Dias

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)
16 Downloads

Abstract

In this paper we address the problem of clustering interval data, adopting a model-based approach. To this purpose, parametric models for interval-valued variables are used which consider configurations for the variance-covariance matrix that take the nature of the interval data directly into account. Results, both on synthetic and empirical data, clearly show the well-founding of the proposed approach. The method succeeds in finding parsimonious heterocedastic models which is a critical feature in many applications. Furthermore, the analysis of the different data sets made clear the need to explicitly consider the intrinsic variability present in interval data.
Original languageEnglish
Pages (from-to)293-313
Number of pages21
JournalIntelligent Data Analysis
Volume19
Issue number2
DOIs
Publication statusPublished - 2015

Keywords

  • Clustering methods
  • Finite mixture models
  • Interval-valued variable
  • Intrinsic variability
  • Symbolic data

Fingerprint

Dive into the research topics of 'Probabilistic clustering of interval data'. Together they form a unique fingerprint.

Cite this