loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: João M. N. Duarte 1 ; Ana L. N. Fred 2 and F. Jorge F. Duarte 3

Affiliations: 1 Instituto de Telecomunicações, Instituto Superior Técnico and Polytechnic of Porto, Portugal ; 2 Instituto de Telecomunicações and Instituto Superior Técnico, Portugal ; 3 Polytechnic of Porto, Portugal

Keyword(s): Clustering Validation, Constrained Data Clustering.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Data Reduction and Quality Assessment ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems

Abstract: Much attention is being given to the incorporation of constraints into data clustering, mainly expressed in the form of must-link and cannot-link constraints between pairs of domain objects. However, its inclusion in the important clustering validation process was so far disregarded. In this work, we integrate the use of constraints in clustering validation. We propose three approaches to accomplish it: produce a weighted validity score considering a traditional validity index and the constraint satisfaction ratio; learn a new distance function or feature space representation which better suits the constraints, and use it with a validation index; and a combination of the previous. Experimental results in 14 synthetic and real data sets have shown that including the information provided by the constraints increases the performance of the clustering validation process in selecting the best number of clusters.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.133.119.66

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
M. N. Duarte, J.; L. N. Fred, A. and F. Duarte, F. (2013). Data Clustering Validation using Constraints. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing (IC3K 2013) - KDIR; ISBN 978-989-8565-75-4; ISSN 2184-3228, SciTePress, pages 17-27. DOI: 10.5220/0004543800170027

@conference{kdir13,
author={João {M. N. Duarte}. and Ana {L. N. Fred}. and F. Jorge {F. Duarte}.},
title={Data Clustering Validation using Constraints},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing (IC3K 2013) - KDIR},
year={2013},
pages={17-27},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004543800170027},
isbn={978-989-8565-75-4},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing (IC3K 2013) - KDIR
TI - Data Clustering Validation using Constraints
SN - 978-989-8565-75-4
IS - 2184-3228
AU - M. N. Duarte, J.
AU - L. N. Fred, A.
AU - F. Duarte, F.
PY - 2013
SP - 17
EP - 27
DO - 10.5220/0004543800170027
PB - SciTePress