Skip to content

Apply a function in parallel on data sets that are split by some factor variable

License

Notifications You must be signed in to change notification settings

EcoGRAPH/clusterapply

Folders and files

NameName
Last commit message
Last commit date

Latest commit

d8ac1fa · Mar 5, 2021

History

59 Commits
Mar 5, 2021
Mar 4, 2021
Jun 24, 2020
Jun 24, 2020
Jun 24, 2020
Mar 5, 2021
Jun 23, 2020
Feb 10, 2021
Jun 23, 2020
Jun 23, 2020
Jun 22, 2020

Repository files navigation

clusterapply

Clusterapply is meant to break datasets into smaller frames on which functions (regressions, for example) can be applied. Much of this can already be accomplished with the family of *apply functions, but here we are building towards parallelization with an eye on memory use. The package does not yet contain all intended features and should be considered provisional.

This package is used primarily in the EPIDEMIA forecasting system’s epidemiar package: https://github.com/EcoGRAPH/epidemiar/releases/latest with an associated demonstration project available at: https://github.com/EcoGRAPH/epidemiar-demo/releases/latest

Installation

The development version, whenever public, is available from GitHub with:

# install.packages("devtools")
devtools::install_github("EcoGRAPH/clusterapply")

Example

Please see the associated ca-vignette.Rmd file for an example of clusterapply’s use.