Section: New Results
Algorithmic foundations
Keywords: Computational geometry, computational topology, optimization, data analysis.
Comparing two clusterings using matchings between clusters of clusters
Participants : F. Cazals, D. Mazauric, R. Tetley.
In collaboration with R. Watrigant, University Lyon I.
Clustering is a fundamental problem in data science, yet, the variety of clustering methods and their sensitivity to parameters make clustering hard. To analyze the stability of a given clustering algorithm while varying its parameters, and to compare clusters yielded by different algorithms, several comparison schemes based on matchings, information theory and various indices (Rand, Jaccard) have been developed. In this work [15], we go beyond these by providing a novel class of methods computing meta-clusters within each clustering– a meta-cluster is a group of clusters, together with a matching between these.
Let the intersection graph of two clusterings be the edge-weighted
bipartite graph in which the nodes represent the clusters, the edges
represent the non empty intersection between two clusters, and the
weight of an edge is the number of common items. We introduce the
so-called
Our experiments illustrate the role of
Low-Complexity Nonparametric Bayesian Online Prediction with Universal Guarantees
Participant : F. Cazals.
In collaboration with A. Lhéritier, Amadeus SA.
In this work [18],
we propose a novel nonparametric online predictor for discrete labels
conditioned on multivariate continuous features. The predictor is
based on a feature space discretization induced by a full-fledged k-d
tree with randomly picked directions and a recursive Bayesian
distribution, which allows to automatically learn the most relevant
feature scales characterizing the conditional distribution. We prove
its pointwise universality, i.e., it achieves a normalized log loss
performance asymptotically as good as the true conditional entropy of
the labels given the features. The time complexity to process the n-th
sample point is
How long does it take for all users in a social network to choose their communities?
Participant : D. Mazauric.
In collaboration with J.-C. Bermond (Coati project-team), A. Chaintreau (Columbia University), and G. Ducoffe (National Institute for Research and Development in Informatics, Bucharest).
In this work [14],
we consider a community formation problem in social networks, where the users are either friends or enemies. The users are partitioned into conflict-free groups (i.e., independent sets in the conflict graph