Open
Description
Right now, MULTISHOT experiments under the parallelization limit only use a single core. We could use this as an opportunity to do multicore computations. The general flow would be:
- Is the computation under the parallelization limit? If no, perform single-threaded.
- Does the computation exceed the number of shots necessary to parallelize? If no, perform single-threaded.
- Parallelize across threads, each modifying thread-local state, then combine state.
This was requested by the Slack community.