-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
python API for retrieving outputs from previous runs #4488
Comments
For those using software-defined assets, |
hey @sryza is |
@zhh210 load_asset_value is documented here. If you need help troubleshooting please create a new issue. |
Thanks @garethbrickman , created a separate ticket on the issue. It seems the |
Use Case
Dagster has multiple execution entrypoints, and it's reasonable that someone might execute in one entry point, and want to utilize the results of that execution in another entrypoint. For example, I might execute a job in Dagit, but I want to observe / validate the intermediary results of that execution in python. We don't currently have a python API that would allow us to do this.
This has broader use case of being able to observe outputs from runs that have been launched/submitted externally, rather than run locally.
There is also the memoization use case, where an output that I am using / assuming exists in the current run was created in a previous run. It's reasonable to want to observe previously memoized outputs in addition to those populated by this run.
Ideas of Implementation
We would need access to the dagster instance for the run history, and also the pipeline itself to reconstruct the IO managers and retrieve results.
Result retrieval would basically just call
load_input
on the IO manager with a properly populated InputContext.User Requests
Message from the maintainers:
Excited about this feature? Give it a 👍. We factor engagement into prioritization.
The text was updated successfully, but these errors were encountered: