Extended Agglomerative Hierarchical Clustering in R
R package mdendro enables the calculation of agglomerative hierarchical clustering (AHC), extending the standard functionalities in several ways:
-
Native handling of both similarity and dissimilarity (distances) matrices.
-
Calculation of pair-group dendrograms and variable-group multidendrograms [1].
-
Implementation of the most common AHC methods in both weighted and unweighted forms: single linkage, complete linkage, average linkage (UPGMA and WPGMA), centroid (UPGMC and WPGMC), and Ward.
-
Implementation of two additional parametric families of methods: versatile linkage [2], and beta flexible. Versatile linkage leads naturally to the definition of two additional methods: harmonic linkage, and geometric linkage.
-
Calculation of the cophenetic (or ultrametric) matrix.
-
Calculation of five descriptors of the final dendrogram: cophenetic correlation coefficient, space distortion ratio, agglomerative coefficient, chaining coefficient, and tree balance.
-
Calculation and plots of the descriptors for the parametric methods.
All this functionality is obtained with three functions: linkage
, descval
and descplot
. Function linkage
may be considered as a replacement for functions hclust
(in package stats) and agnes
(in package cluster). To enhance usability and interoperability, the linkage
class includes several methods for plotting, summarizing information, and class conversion.
- A. Fernández, S. Gómez. Solving non-uniqueness in agglomerative hierarchical clustering using multidendrograms. Journal of Classification 25, 43-65 (2008). DOI:10.1007/s00357-008-9004-x.
- A. Fernández, S. Gómez. Versatile linkage: A family of space-conserving strategies for agglomerative hierarchical clustering. Journal of Classification 37, 584-597 (2020). DOI:10.1007/s00357-019-09339-z.
-
Alberto Fernández: Dept. Enginyeria Química, Universitat Rovira i Virgili, Tarragona (Spain). (email) (ORCID) (Google Scholar) (GitHub)
-
Sergio Gómez: Dept. Enginyeria Informàtica i Matemàtiques, Universitat Rovira i Virgili, Tarragona (Spain). (web) (email) (ORCID) (Google Scholar) (GitHub) (Twitter)
The full documentation of mdendro, including description, installation, tutorial, rationale and reference manual, can be found here.