Utilize este identificador para referenciar este registo: http://hdl.handle.net/10400.21/6191
Título: Comparing clustering solutions: the use of adjusted paired indices
Autor: Amorim, Maria José de Pina da Cruz
Cardoso, Margarida G. M. S.
Palavras-chave: Adjusted indices
Indices of paired agreement
Clustering evaluation
External evaluation
Data: 2015
Editora: Ios Press
Citação: AMORIM, Maria josé; CARDOSO, Margarida G. M. S. - Comparing clustering solutions: the use of adjusted paired indices. Intelligent Data Analysis. ISSN 1088-467X. Vol. 19, N.º 6 (2015), pp. 1275-1296
Resumo: In the present paper we compare clustering solutions using indices of paired agreement. We propose a new method - IADJUST - to correct indices of paired agreement, excluding agreement by chance. This new method overcomes previous limitations known in the literature as it permits the correction of any index. We illustrate its use in external clustering validation, to measure the accordance between clusters and an a priori known structure. The adjusted indices are intended to provide a realistic measure of clustering performance that excludes agreement by chance with ground truth. We use simulated data sets, under a range of scenarios - considering diverse numbers of clusters, clusters overlaps and balances - to discuss the pertinence and the precision of our proposal. Precision is established based on comparisons with the analytical approach for correction specific indices that can be corrected in this way are used for this purpose. The pertinence of the proposed correction is discussed when making a detailed comparison between the performance of two classical clustering approaches, namely Expectation-Maximization (EM) and K-Means (KM) algorithms. Eight indices of paired agreement are studied and new corrected indices are obtained.
Peer review: yes
URI: http://hdl.handle.net/10400.21/6191
DOI: 10.3233/IDA-150782
ISSN: 1088-467X
1571-4128
Aparece nas colecções:ISEL - Matemática - Artigos



FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.