Comparison between two algorithms for computing the weighted generalized affinity coefficient in the case of interval data

Áurea Sousa*, Osvaldo Silva, Leonor Bacelar-Nicolau, João Cabral, Helena Bacelar-Nicolau

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

From the affinity coefficient between two discrete probability distributions proposed by Matusita, Bacelar-Nicolau introduced the affinity coefficient in a cluster analysis context and extended it to different types of data, including for the case of complex and heterogeneous data within the scope of symbolic data analysis (SDA). In this study, we refer to the most significant partitions obtained using the hierarchical cluster analysis (h.c.a.) of two well-known datasets that were taken from the literature on complex (symbolic) data analysis. h.c.a. is based on the weighted generalized affinity coefficient for the case of interval data and on probabilistic aggregation criteria from a VL parametric family. To calculate the values of this coefficient, two alternative algorithms were used and compared. Both algorithms were able to detect clusters of macrodata (aggregated data into groups of interest) that were consistent and consonant with those reported in the literature, but one performed better than the other in some specific cases. Moreover, both approaches allow for the treatment of large microdatabases (non-aggregated data) after their transformation into macrodata from the huge microdata.

Original languageEnglish
Pages (from-to)1082-1094
Number of pages13
JournalStats
Volume6
Issue number4
DOIs
Publication statusPublished - 13 Oct 2023
Externally publishedYes

Keywords

  • Discrete probability distributions
  • Hierarchical cluster analysis
  • Interval data
  • Weighted generalized affinity coefficient

Fingerprint

Dive into the research topics of 'Comparison between two algorithms for computing the weighted generalized affinity coefficient in the case of interval data'. Together they form a unique fingerprint.

Cite this