This is an implementation of some well known Data Stream Clustering Algorithms for my engineering thesis @ PWR Algorithmic Computer Science. This project also includes sampling wrappers for aforementioned algorithms.
Maksymilian Neumann
- Internet connection
- Rust toolchain
- Python 3.12
- Install python libraries
pip install -r requirements.txt
- Generate data
python gen_all_data.py
- Run
cargo run -r
- Choose any benchmarks or run all
- Plot to see results in ./plots
python gen_all_plots.py
- (Optional) generate demo results for k-means and DBSCAN as a comparison
python classic_demo.py
- BIRCH
- CluStream
- DenStream
- Static
- Dynamic
Real data used:
B. S. and R. Nagapadma. "RT-IoT2022 ," UCI Machine Learning Repository, 2023. [Online]. Available: https://doi.org/10.24432/C5P338.