DESIGN: Future API - Minimal/Core/Essential API and Extended/Optional API #172

HenrikBengtsson · 2017-10-25T19:18:23Z

The future package receives many interesting and handy feature requests. Some of them are straighforward whereas others does not necessarily fit straight in. I'm creating this issue to clarify why it's not straightforward to implement these and what the alternatives going forward are, and to encourage further discussion and ideas.

Minimal Future API (aka Future API)

In it's most minimal and essential form, the Future API provides:

future() - creates a future (on any future backend)
value() - collects the value of the future (waits for it to resolve if not already done)
resolved() - checks whether a future is resolved or not.
A future is stateless, i.e. just as plain R functions, evaluation of a future expression is purely functional without side effects and the outcome is the value (or a condition) of the evaluated expression.
The values of futures should not depend in what order they are resolved.

On top of this, we have arguments controlling whether the future should be resolved lazily or eagerly, what or how globals are exported, polling and timeout strategies, etc.

I probably forgot something above, so please feel free to comment.

It is critical that this Minimal Future API can be supported by all future backends (including those not yet implemented by that may show up in the future). Because of this, the Minimal Future API is limited in what it can provide.

Examples of features that probably would fits in the Minimal Future API, but has not yet been added:

Optional Future API

Any features related to futures that can not be supported by all backends belongs to what I consider an extended / optional API - let's call it the Optional Future API. Some features may be specific to a single backend while others to a majority of backends but not all.

Below is a set of features that fit into this category:

"Passing" existing futures to an new one, e.g. a <- future(1); b <- future(value(a)) - requires b to be able to "communicate" with a (e.g. different machines)
Suspending/terminating a future currently being evaluated, e.g. suspend(f) (Issue WISH: Add method for interrupting / terminating a future #93)
Instant forwarding of the future's standard output ~~and standard error~~ streams to the owner process (Issues Obtain function console output before it is resolved? #141, future_lapply() is silent for the multisession plan #171)
"Monitoring" of a future, e.g. progress updates / progress bars (Issue Progressbar HenrikBengtsson/doFuture#8)
Persistent workers, i.e. a future can change the state of an underlying worker that a following future can utilize.
- efficiency: don't export globals that already exist on the worker (requires a method for asserting identical(local, remote).
- this can be for efficiency, e.g. futures that share the same global variables may resolve faster if they are resolved by the same worker (this can be optional, i.e. export global if not already available; think memoization)
- a future preserves a value for a downstream future (not sure if this fits into the concept of futures, but I'll add it here in case someone has thoughts about this)
Resources specifications typically seen in HPC environments, e.g. how much available memory and wall-time need to be available in order to start resolving a specific future. Other examples are access to a GPU. The future.batchtools package actually provides a little bit of these features under the hood, but such features are currently experimental and exploratory.
Other resource specifications, such as only running on the local machine, on the local file system, on a given version of R, access to a certain set of files, and so on.
...

Some of the referenced issues discuss why it's hard to implement the features in a generic fashion such that they would work with all future backends (i.e. why the cannot be added to the Minimal Future API but belongs to a set of optional features).

The text was updated successfully, but these errors were encountered:

wlandau · 2018-02-05T17:14:52Z

For ropensci/drake#227, I am interested in detecting failed futures without having to look at the value. I think there is a need to learn when a future has crashed, for whatever reason.

library(future)
plan(multicore)
f <- future({
  stop("some kind of error")
})

resolved(f)

# [1] TRUE

f$state

# [1] "running"

As I understand it, f$state is not part of the minimal API.

HenrikBengtsson added the help wanted label Oct 25, 2017

This was referenced Oct 25, 2017

future_lapply() is silent for the multisession plan #171

Closed

"Submission rate too high" with a large future_lapply HenrikBengtsson/future.batchtools#13

Closed

Add support for controlling the submission rate of jobs HenrikBengtsson/future.batchtools#14

Closed

HenrikBengtsson mentioned this issue Nov 16, 2017

Migration to batchtools Bioconductor/BiocParallel#64

Closed

This was referenced Nov 30, 2017

future does not invoke winProgressBar #180

Closed

Different plan()s for different futures #181

Open

HenrikBengtsson mentioned this issue Jan 12, 2018

Non-blocking handling of results #163

Closed

This was referenced Feb 6, 2018

future state is "running" after it has been resolved #193

Closed

Manual scheduling ropensci/drake#227

Closed

HenrikBengtsson mentioned this issue Feb 18, 2018

Initial work on a manual scheduler (#227) ropensci/drake#259

Merged

HenrikBengtsson mentioned this issue Mar 11, 2018

Relaunching a future: some notes #205

Open

HenrikBengtsson mentioned this issue Apr 10, 2018

How to redirect callr output while calling from future HenrikBengtsson/future.callr#3

Closed

HenrikBengtsson changed the title ~~DESIGN: Future API - Minimal/Essential API and Extended/Optional API~~ DESIGN: Future API - Minimal/Core/Essential API and Extended/Optional API Apr 22, 2018

HenrikBengtsson mentioned this issue May 11, 2018

detect forking #224

Closed

HenrikBengtsson mentioned this issue Jun 29, 2018

IDEA: Add argument 'future.label' for producing future labels futureverse/future.apply#15

Closed

This was referenced Jul 23, 2018

array.jobs supported for SLURM systems? HenrikBengtsson/future.batchtools#23

Open

retrieve future expression on local machine HenrikBengtsson/future.batchtools#21

Open

This was referenced Aug 25, 2018

Multiple active plans (or connections?) / beeping when resolved #247

Closed

Allow additional globals to be supplied #227

Closed

wlandau mentioned this issue Oct 9, 2018

evaluator column for future_lapply parallelism? ropensci/drake#540

Closed

wlandau mentioned this issue Oct 27, 2018

Remove all non-clustermq parallel backends? ropensci/drake#561

Closed

This was referenced Dec 7, 2018

Determine if calling future({}) will block #264

Closed

Integration with futile.logger #268

Open

This was referenced Feb 11, 2019

Question: best way to prioritise cluster local vars if they exist futureverse/future.apply#37

Closed

Error when using envir & globals parameters in futures that are not using the multisession plan #280

Open

HenrikBengtsson pinned this issue Feb 16, 2019

This was referenced Feb 16, 2019

Fetching globals assigned as function arguments fails when run in parallel futureverse/future.apply#36

Open

setting the finalize option in tweak [options future.delete = FALSE] HenrikBengtsson/future.batchtools#37

Closed

Enchufa2 mentioned this issue Apr 16, 2019

Worker selection #301

Open

HenrikBengtsson mentioned this issue Mar 5, 2020

Working directory set with withr::with_dir() not respected in multisession #363

Open

HenrikBengtsson added a commit that referenced this issue Oct 8, 2020

Experimental support for hook functions [#172] [ci skip]

a09c617

HenrikBengtsson unpinned this issue Oct 11, 2020

This was referenced Nov 28, 2020

Feature request: Evaluate results in master node #447

Closed

Future for package developers and temporary overriding of plans #450

Open

How can I monitor what is being passed out to the workers ? HenrikBengtsson/doFuture#40

Closed

HenrikBengtsson added feature/resources feature/profiling feature/terminate labels Dec 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DESIGN: Future API - Minimal/Core/Essential API and Extended/Optional API #172

DESIGN: Future API - Minimal/Core/Essential API and Extended/Optional API #172

HenrikBengtsson commented Oct 25, 2017 •

edited

Loading

wlandau commented Feb 5, 2018 •

edited

Loading

DESIGN: Future API - Minimal/Core/Essential API and Extended/Optional API #172

DESIGN: Future API - Minimal/Core/Essential API and Extended/Optional API #172

Comments

HenrikBengtsson commented Oct 25, 2017 • edited Loading

Minimal Future API (aka Future API)

Optional Future API

wlandau commented Feb 5, 2018 • edited Loading

HenrikBengtsson commented Oct 25, 2017 •

edited

Loading

wlandau commented Feb 5, 2018 •

edited

Loading