GitXplorerGitXplorer
o

federated_kmeans

public
26 stars
2 forks
1 issues

Commits

List of commits on branch master.
Verified
e8e6f89fb3b7c7209d14c1c2f66b21b694199a4f

Merge pull request #2 from ourownstory/add-license-1

oourownstory committed 3 years ago
Verified
ca5a9638b3a7d3f59483a64aef951d7c0ea85254

Create LICENSE

oourownstory committed 3 years ago
Verified
587a9e74547e3f1e5439cd99d2a76e7d318877a9

Update README.md

oourownstory committed 4 years ago
Unverified
537331afbc4e9f02b6d9cb3036a5d83ea0019c63

update paper arxiv

committed 4 years ago
Verified
df45365e04730c4e2a99875c04be1d7f7c19fe66

Update README.md

oourownstory committed 4 years ago
Verified
19e1f678948fa4d28cbc64d538c9f45c285d91d9

Update README.md

oourownstory committed 4 years ago

README

The README file for this repository.

Federated K-Means Clustering Algorithm

You can find the details on the algorithm and a proof of concept in the short paper

Abstract

An algorithm to cluster distributed datasets without communicating data is introduced. It builds upon the concept of federated learning, distributed K-Means and mini-batch K-Means. Results on synthetic data and real data are presented for a non-iid setting. The algorithm is able to produce a clustering of similar or better quality than central K-Means clustering, while preserving privacy.

Results

The Federated clustering algorithm is able to match or outperform the central clustering algorithm if the hyperparameters are appropriately set.

Further preliminary experiments on real energy datasets indicated that the Algorithm is simple and effective to apply to industrial problem settings (results not included).

Note

This is unpolished work, done in May to September 2019. Nevertheless, we decided to share the findings already with the broader science community as others might find use in our simple algorithm.