Central partition for a partition-distance and strong pattern graph

Authors

  • Joaquim F. Pinto da Costa Universidade do Porto
  • P.R. Rao Goa University

DOI:

https://doi.org/10.57805/revstat.v2i2.11

Keywords:

clustering, graph algorithms, node cover, combinatorial problems, strong pattern, central partition

Abstract

When several clustering algorithms are applied to a dataset E or the same algorithm with different parameters, we get several different partitions of the dataset. In this paper we consider the problem of finding a consensus partition between the set of these partitions. This consensus partition, called central partition, minimises the average number of disagreements between all of the partitions and has been considered for instance in [14, 5] in a different context from ours. We consider it in the context of partition-distance defined in [7]. We focus our attention in two particular distance functions between partitions and then do an experimental comparison between the two corresponding central partitions. In addition, by using the concept of strong patterns (maximal subset of elements that are always clustered together in all partitions), we define a new graph where the nodes are the strong patterns. This graph contains essentially the same information as the partition graph corresponding to the set E defined in [7], but is much simpler as the number of strong patterns is expected to be much smaller than the cardinal of E. Then, some properties of this new graph are proved.

Published

2004-11-30

How to Cite

Pinto da Costa , J. F., & Rao , P. (2004). Central partition for a partition-distance and strong pattern graph . REVSTAT-Statistical Journal, 2(2), 127–143. https://doi.org/10.57805/revstat.v2i2.11