TY - JOUR
T1 - Classification of primary progressive aphasia
T2 - do unsupervised data mining methods support a logopenic variant?
AU - Maruta, Carolina
AU - Pereira, Telma
AU - Madeira, Sara C.
AU - De Mendonça, Alexandre
AU - Guerreiro, Manuela
PY - 2015/6/1
Y1 - 2015/6/1
N2 - Our objective was to test whether data mining techniques, through an unsupervised learning approach, support the three-group diagnostic model of primary progressive aphasia (PPA) versus the existence of two main/classic groups. A series of 155 PPA patients observed in a clinical setting and subjected to at least one neuropsychological/language assessment was studied. Several demographic, clinical and neuropsychological attributes, grouped in distinct sets, were introduced in unsupervised learning methods (Expectation Maximization, K-Means, X-Means, Hierarchical Clustering and Consensus Clustering). Results demonstrated that unsupervised learning methods revealed two main groups consistently obtained throughout all the analyses (with different algorithms and different set of attributes). One group included most of the agrammatic/non-fluent and some logopenic cases while the other was mainly composed of semantic and logopenic cases. Clustering the patients in a larger number of groups (k > 2) revealed some clusters composed mostly of non-fluent or of semantic cases. However, we could not evidence any group chiefly composed of logopenic cases. In conclusion, unsupervised data mining approaches do not support a clear distinction of logopenic PPA as a separate variant.
AB - Our objective was to test whether data mining techniques, through an unsupervised learning approach, support the three-group diagnostic model of primary progressive aphasia (PPA) versus the existence of two main/classic groups. A series of 155 PPA patients observed in a clinical setting and subjected to at least one neuropsychological/language assessment was studied. Several demographic, clinical and neuropsychological attributes, grouped in distinct sets, were introduced in unsupervised learning methods (Expectation Maximization, K-Means, X-Means, Hierarchical Clustering and Consensus Clustering). Results demonstrated that unsupervised learning methods revealed two main groups consistently obtained throughout all the analyses (with different algorithms and different set of attributes). One group included most of the agrammatic/non-fluent and some logopenic cases while the other was mainly composed of semantic and logopenic cases. Clustering the patients in a larger number of groups (k > 2) revealed some clusters composed mostly of non-fluent or of semantic cases. However, we could not evidence any group chiefly composed of logopenic cases. In conclusion, unsupervised data mining approaches do not support a clear distinction of logopenic PPA as a separate variant.
KW - Data mining
KW - Logopenic variant (lvPPA)
KW - Non-fluent variant (nfvPPA)
KW - Primary progressive aphasia
KW - Semantic variant (svPPA)
UR - http://www.scopus.com/inward/record.url?scp=84929744594&partnerID=8YFLogxK
U2 - 10.3109/21678421.2015.1026266
DO - 10.3109/21678421.2015.1026266
M3 - Article
C2 - 25871701
AN - SCOPUS:84929744594
SN - 2167-8421
VL - 16
SP - 147
EP - 159
JO - Amyotrophic Lateral Sclerosis and Frontotemporal Degeneration
JF - Amyotrophic Lateral Sclerosis and Frontotemporal Degeneration
IS - 3-4
ER -