Skip to content

Parallel k-medoids clustering with high accuracy and efficiency, python implementation

License

Notifications You must be signed in to change notification settings

francescodalbello/pamae-clustering

Repository files navigation

PAMAE: Parallel k-Medoids Clustering with High Accuracy and Efficiency

Welcome to the PAMAE repository! This project is created by Francesco Dal Bello and contains the source code for the PAMAE application.

Description

PAMAE is the python implementation of the algorithm described in this paper and has the following functionality I merely implemented it in python, the intellectual rights and merits are of the researchers cited.

Libraries

Configuration

  pamae(ds_import, 2, 1000, 5)

To use the algorithm change the parameters in the pamae() function --> pamae(dataset, number_of_sample, sample_size, number_of_clusters). The dataset should be in csv format and should contain only quantitative values, so any pre-editing of the file may be necessary. For plots to make sense the data must be in two dimensions, loading data in p dimensions the plots will be equally generated but meaningless.

Output example

Phase 1 output: image

Phase 2 output: image

MongoDB export

The algorithm saves the result of the processing in a MongoDB database image image

This is the result on a benchmark dataset image

Contributions

Thank you for the suggestions provided: Pierluigi and Filippo

License

This project is licensed under the MIT Licence. See the LICENSE file for more details.

Contact

If you have any questions or suggestions regarding Pamae, please feel free to contact us:

I appreciate your interest in my implementation of PAMAE and look forward to your feedback!

About

Parallel k-medoids clustering with high accuracy and efficiency, python implementation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages