Skip to content

computorg/published-202507-jacques-count-data

Repository files navigation

Model-Based Clustering and Variable Selection for Multivariate Count Data

build and publish DOI:10.57750/6v7b-8483 reviews SWH Creative Commons License

Authors:

Model-based clustering provides a principled way of developing clustering methods. We develop a new model-based clustering methods for count data. The method combines clustering and variable selection for improved clustering. The method is based on conditionally independent Poisson mixture models and Poisson generalized linear models. The method is demonstrated on simulated data and data from an ultra running race, where the method yields excellent clustering and variable selection performance.