-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[New Hub] BICAN (MIT Brain) #3827
Comments
Completed Phase 1. New AWS account exists. Quota increases were automatically sent from the request template, though I think our templates ask for less quota than we are given by default judging by this response in freshdesk https://2i2c.freshdesk.com/a/tickets/1434 |
Thanks for pointing out the quota setup, @sgibson91. I've handled that in #3780 (comment). I'll amend our documentation now to match. |
I've also opened #3834 to cross-link the GPU work at cluster creation time, so that's set up easily. |
Now that #3834 (comment) is merged, I've removed the explicit pointer to the GPU docs from the issue directly. |
#3836 also clarifies the current situation with quotas. |
A process note (more documentation): If we want each phase to be self-contained and actionable by separate engineers if necessary, then I think the following section of the new cluster docs should be moved into the the new hub docs, as I am not creating any hub files at this time. The same may be said for the two sections that follow:
I think these are all here because they're AWS-specific and it was easier at the time they were written, but now we could use synced panels to show/hide cloud-vendor-specific info at the appropriate times, like we do on the GCP/Azure cluster setup docs |
Phase 2 now complete and ready for review in PR #3840 |
I opened #3839 with some docs updates as I went through, and addressed link (3) from #3827 (comment). Links (1) and (2) I didn't really know where to move them too right now, especially considering the new hub docs are not as "run this command, then do this thing" based as the new cluster docs are. Perhaps the engineer who completes Phase 3 will have a better inclination of where in the docs those sections should live. |
Thanks @sgibson91! I'll try to incorporate those changes into various places. |
Earlier was just defaulting the first profile item to lab. I'll add this to the spec on the image. Ref 2i2c-org#3827 Ref 2i2c-org#3824
Copied over from https://github.com/2i2c-org/meta/issues/913
Process Note
I'm using this as a way to try to rejig our new hub request process. See https://github.com/2i2c-org/meta/issues/897 (particularly https://github.com/2i2c-org/meta/issues/897#issuecomment-2010984904) for more information.
https://miro.com/app/board/uXjVNjUP3iQ=/, describes the various 'phases' of new hub turn-up. Each phase will be marked as "READY" or "NOT READY" when all information needed for it is available. Each section should also link to an appropriate runbook.
There will be customizations after this is all set up, but this is pathway towards a standardized hub turn up.
Phase 1: Account setup (READY)
This is applicable for cases where this is a dedicated cluster. The following table lists the information before this phase can start.
bican
Appropriate runbook: https://infrastructure.2i2c.org/hub-deployment-guide/cloud-accounts/new-aws-account/
Phase 2: Cluster setup (READY)
This assumes all engineers have access to this new account, and will be able to set up the cluster + support, without any new hubs being set up.
Appropriate runbooks:
Phase 3 : Hub setup (READY)
There's going to be a number of hubs, and this starts specifying them.
Hub 1: Staging
Phase 3.1: Initial setup
Phase 3.2: Authentication
@kabilar, @aaronkanzer, @asmacdo, @satra, @djarecka
Phase 3.3: Object storage access
Phase 3.4: Profile List
This was derived from looking at https://github.com/dandi/dandi-hub/blob/dandi/config.yaml.j2#L138-L210 and adopting to match our standards.
Environments
Resource Allocations
CPU
Generated by
deployer generate resource-allocation choices r5.xlarge --num-allocations 4
GPU
Manually set up, but should be autogenerated
Hub 2: BICAN hub
The same as staging, just different name (bican).
The text was updated successfully, but these errors were encountered: