New AWS jobstore. #5123

DailyDreaming · 2024-10-14T19:23:05Z

Rewritten without SDB.

Changelog Entry

To be copied to the draft changelog by merger:

PR submitter writes their recommendation for a changelog entry here

Reviewer Checklist

Make sure it is coming from issues/XXXX-fix-the-thing in the Toil repo, or from an external repo.
- If it is coming from an external repo, make sure to pull it in for CI with:
```
contrib/admin/test-pr otheruser theirbranchname issues/XXXX-fix-the-thing
```
- If there is no associated issue, create one.
Read through the code changes. Make sure that it doesn't have:
- Addition of trailing whitespace.
- New variable or member names in camelCase that want to be in snake_case.
- New functions without type hints.
- New functions or classes without informative docstrings.
- Changes to semantics not reflected in the relevant docstrings.
- New or changed command line options for Toil workflows that are not reflected in docs/running/{cliOptions,cwl,wdl}.rst
- New features without tests.
Comment on the lines of code where problems exist with a review comment. You can shift-click the line numbers in the diff to select multiple lines.
Finish the review with an overall description of your opinion.

Merger Checklist

Make sure the PR passes tests.
Make sure the PR has been reviewed since its last modification. If not, review it.
Merge with the Github "Squash and merge" feature.
- If there are multiple authors' commits, add Co-authored-by to give credit to all contributing authors.
Copy its recommended changelog entry to the Draft Changelog.
Append the issue number in parentheses to the changelog entry.

adamnovak

It looks like this probably works as written, but the documentation for how the S3-backed storage works (metadata objects named by file IDs referencing ETag hashes for content-addressed data objects many-to-one) doesn't match what's actually implemented (metadata objects and data objects both named by file ID).

It might be good to actually implement the described content-addressing design, but there are are some big unsolved questions:

ETags on S3 are all MD5-based and not collision-resistant, so it's easy to get two actual files with the same MD5 and put them in the job store to break it.
Streaming uploads don't know their hash until they are uploaded, so where should they be uploaded to?
Without transactions, how do you maintain the consistency of many-to-one relationships between metadata objects and data objects, while still allowing data objects to be deleted when the last referencing metadata object goes away?

It might make sense to not solve this deduplication problem now, and to just admit to using one data copy per file ID as is actually implemented. If we cram executability into the ID somehow, and use the ETag provided by S3 in the response header for integrity checking, it might be possible to not have a metadata file at all.

contrib/admin/mypy-with-ignore.py

src/toil/jobStores/aws/jobStore.py

src/toil/lib/aws/s3.py

src/toil/lib/aws/utils.py

src/toil/lib/pipes.py

.gitlab-ci.yml

adamnovak

I think this is way better!

I did notice that the "existing" key getter function was used in a couple places where the key won't actually necessarily exist, which I don't think will work.

And I don't think we want to comment out the contents of docstrings.

But those are probably the only things I think really need to be changed.

adamnovak · 2025-04-03T18:54:37Z

src/toil/jobStores/abstractJobStore.py


    def __init__(
-        self, jobStoreFileID: FileID, customName: Optional[str] = None, *extra: Any
+        self, jobStoreFileID: Union[FileID, str], customName: Optional[str] = None, *extra: Any


Is there a good reason to say this can be a string now? Is it too hard to drag the typed file ID object through in the new implementation for some good reason?

When this is a string, is this meant to be the string-packed version of the file ID? Or just the ID part without e.g. the size packed in?

I guess some of the user-facing file ID API still accepts file IDs typed as strings, so maybe we do still need to be able to take them here.

adamnovak · 2025-04-03T18:57:33Z

src/toil/jobStores/abstractJobStore.py

    ) -> ContextManager[IO[str]]: ...

    @abstractmethod
+    @contextmanager  # type: ignore


Don't we really only need/want @contextmanager on the implementation? It doesn't really help on the unimplemented stub, and it apparently means we have to break out of typing.

adamnovak · 2025-04-03T19:01:11Z

src/toil/jobStores/aws/jobStore.py

+    #     4. Shared Files: These are a small set of special files.  Most are needed by all jobs:
+    #         * environment.pickle   (environment variables)
+    #         * config.pickle        (user options)
+    #         * pid.log              (process ID of the workflow; when it finishes, the workflow either succeeded/failed)
+    #         * userScript           (hot deployment;  this is the job module)
+    #         * rootJobReturnValue   (workflow succeeded or not)


This isn't quite right; Python workflows have access to user-defined file names in here, which we just hope don't collide with any that Toil uses internally.

adamnovak · 2025-04-03T19:02:28Z

src/toil/jobStores/aws/jobStore.py

+    #        1. AWS s3 has strong consistency.
+    #        2. s3's filter/query speed is pretty good.
+    #      However, there may be reasons in the future to provide users with a database:
+    #        * s3 throttling has limits (3,500/5,000 requests; something like dynamodb supports 100,000+ requests).


Are these per second?

adamnovak · 2025-04-03T19:03:45Z

src/toil/jobStores/aws/jobStore.py

+    #      WARNING: Etag values differ for the same file when the part size changes, so part size should always
+    #      be Set In Stone, unless we hit s3's 10,000 part limit, and we need to account for that.
+    #
+    #  - This class inherits self.config only when initialized/restarted and is None upon class instantiation.  These


I don't think "inherits" is the right word; that makes it sound like it is coming from a base class but only sometimes.

adamnovak · 2025-04-03T21:11:37Z

src/toil/jobStores/aws/jobStore.py

+            # content_type = response['ContentType']  # e.g. "binary/octet-stream"
+            # etag = response['ETag'].strip('\"')  # e.g. "\"586af4cbd7416e6aefd35ccef9cbd7c8\""


Probably these can just be cut.

Suggested change

# content_type = response['ContentType'] # e.g. "binary/octet-stream"

# etag = response['ETag'].strip('\"') # e.g. "\"586af4cbd7416e6aefd35ccef9cbd7c8\""

adamnovak · 2025-04-03T21:12:33Z

src/toil/jobStores/aws/jobStore.py

+            )
+            # TODO: verify etag after copying here?
+
+            # cannot determine exec bit from foreign s3 so default to False


This comment needs to go up where the 0 is now.

adamnovak · 2025-04-03T21:22:24Z

src/toil/lib/pipes.py

-    RuntimeError: Hello, world!
-    >>> y = os.dup(0); os.close(y); x == y
-    True
+    # An object-oriented wrapper for os.pipe. Clients should subclass it, implement


I don't think it makes sense to try to have a big comment inside a docstring like this.

adamnovak · 2025-04-03T21:23:45Z

src/toil/lib/pipes.py

+                    os.close(self.readable_fh)
+                except OSError as e:
+                    # OSError: [Errno 9] Bad file descriptor implies this file handle is already closed
+                    if not e.errno == 9:


This should be errno.EBADF and not a magic 9.

Suggested change

if not e.errno == 9:

if not e.errno == errno.EBADF:

adamnovak · 2025-04-03T21:25:38Z

src/toil/test/__init__.py

 # See the License for the specific language governing permissions and
 # limitations under the License.
 from contextlib import contextmanager, AbstractContextManager
 import datetime


Should we also revise the role permissions in the provisioner? Or make a new issue for that?

…move-sdb

…cate copies of pipe classes, drop SimpleDB permissions

…marker at startup

… FileID

…move-sdb

…n be read without encryption

…yption on for shared files

New aws jobstore.

b0b1752

DailyDreaming self-assigned this Oct 14, 2024

adamnovak mentioned this pull request Oct 30, 2024

Simplify s3 functions: create_s3_bucket. #4566

Closed

19 tasks

DailyDreaming added 2 commits January 23, 2025 21:57

Update.

7dc1a7c

Updates.

06b7a40

adamnovak requested changes Feb 6, 2025

View reviewed changes

DailyDreaming added 12 commits March 3, 2025 12:08

Linting.

4c12449

Update.

2900c99

Update from master.

3a345dd

Update.

9322285

Update and rebase.

031bab4

Update and rebase.

c58e025

Assuage make docs's anger.

698b450

Rebase.

b9f3cc8

Some compat, some review comments.

321a2c6

Move boto imports.

faf0581

Update.

a9a8880

Update comments, move imports, and update docstrings.

082311d

DailyDreaming requested a review from adamnovak March 31, 2025 19:23

Update imports.

0608622

DailyDreaming marked this pull request as ready for review April 1, 2025 17:18

Merge branch 'master' into issues/964-aws-remove-sdb

687b9b7

adamnovak requested changes Apr 3, 2025

View reviewed changes

This was referenced Apr 18, 2025

Move aws functions. #4134

Closed

Add s3 head object function. #5020

Closed

adamnovak added 5 commits July 24, 2025 14:35

Merge remote-tracking branch 'upstream/master' into issues/964-aws-re…

133fc26

…move-sdb

Fix typing

0b2316b

Enable type checking and fix utils typing

d778e1e

Address code review comments, drop comments in docstrings, drop dupli…

aaf6085

…cate copies of pipe classes, drop SimpleDB permissions

Reformat and revise docs so docs build works

b99e462

adamnovak added 27 commits July 25, 2025 11:51

Make missing AWS modules produce ImportError and not NotImplementedError

4631950

Stop trying to import the boto 2 error types

1f75eac

Use the key function everywhere and deal with not having a log place …

3a643b7

…marker at startup

Add missing pre_update_hook call

9e912b2

Stop logging every write

654d8e7

Only write the marker when it moves

cc0f8c4

Use the content key prefix when uploading files

7f3c3d2

Get executable bit from the end of the key fields

adef28a

Add --toil suffix to test bucket cleanup script

1122742

Make FileJobStore _write_to_url a classmethod again

14d1338

Stop tracking executability in key because it is tracked in the typed…

2126540

… FileID

Merge remote-tracking branch 'upstream/master' into issues/964-aws-re…

3503b01

…move-sdb

Fix self reference in classmethod

5fc4c39

Add pytest-randomly to report and set seeds only

4a1de2e

Fix removed method and enable AWS util test type checking

05c04fc

Move single-test teardown into test and fix argument type

7a3df6d

Fix typing by moving import

80aabc9

Merge remote-tracking branch 'upstream/master' into issues/964-aws-re…

47d2518

…move-sdb

Satisfy MyPy on the pipes

7822305

Respect turning off encryption for stream uploads so config.pickle ca…

2d6d7ad

…n be read without encryption

Get the config from self

c369df4

Make AWS encryption settings update live from the config to satisfy test

bfc6616

Handle error from trying to read without encryption, and default encr…

df02144

…yption on for shared files

Make Bucket from the resource and not free-floating

86a8fe6

Satisfy MyPy

59b2eec

Avoid depending on strongly-consistent clean in subTest tests

278fb2f

Raise correct nonexistent job store exception

4f55510

adamnovak merged commit 881f600 into master Aug 11, 2025
3 checks passed

adamnovak deleted the issues/964-aws-remove-sdb branch August 11, 2025 20:05

adamnovak mentioned this pull request Aug 11, 2025

Replace SDB with S3 in AWS job store #964

Closed

		# content_type = response['ContentType'] # e.g. "binary/octet-stream"
		# etag = response['ETag'].strip('\"') # e.g. "\"586af4cbd7416e6aefd35ccef9cbd7c8\""

New AWS jobstore. #5123

New AWS jobstore. #5123

Uh oh!

Conversation

DailyDreaming commented Oct 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog Entry

Reviewer Checklist

Merger Checklist

Uh oh!

adamnovak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adamnovak left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DailyDreaming commented Oct 14, 2024 •

edited

Loading