-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[New Hub] LINC (MIT Brain) #3828
Comments
@consideRatio I believe this is now complete and you should be able to set up the hub now. I notice in #3854 it's set up as a daskhub - note that it should instead be set up as a base hub. As you go through these, if you find you're having to make choices that make use of information not present in this issue, please point it out so I can make sure to incorporate that into the process. Thanks. |
Due to the fact that they want more CPUs in their GPU nodes, we need to set up |
This made me think that you requested that instead of naming the hub Looking at how you set things up for bican, I'm assuming the config name should be |
@yuvipanda when filling in funded_by, I don't know what to write. On the lincbrain website I see this, but it doesn't mean this 2i2c hub should be considered funded by them also. For now, leaving it blank: funded_by:
name: ""
url: "" |
I'll set this up to provide only the admin users access for now, not enabling |
staging and prod cluster's We got three different choices on combining LINC / DANDI / BICAN with MIT, and with (prod) or no (prod).
|
I obseved also a discrepancy on how we configure |
Default to |
Starting up a GPU server (don't remember what image) took sometime between 9-10 minutes, and the startup timeout is 10 minutes. I've increased the timeout to 15 minutes to provide some margin of error for bican/dandi/linc for now. EDIT: Fixed in PR |
Thanks for the feedback, @consideRatio. I'll incorporate them into the process. |
I think this was completed and then we decomissioned it also - closing. |
Copied over from https://github.com/2i2c-org/meta/issues/913
Process Note
I'm using this as a way to try to rejig our new hub request process. See https://github.com/2i2c-org/meta/issues/897 (particularly https://github.com/2i2c-org/meta/issues/897#issuecomment-2010984904) for more information.
https://miro.com/app/board/uXjVNjUP3iQ=/, describes the various 'phases' of new hub turn-up. Each phase will be marked as "READY" or "NOT READY" when all information needed for it is available. Each section should also link to an appropriate runbook.
There will be customizations after this is all set up, but this is pathway towards a standardized hub turn up.
Phase 1: Account setup (READY)
This is applicable for cases where this is a dedicated cluster. The following table lists the information before this phase can start.
linc
Appropriate runbook: https://infrastructure.2i2c.org/hub-deployment-guide/cloud-accounts/new-aws-account/
Phase 2: Cluster setup (READY)
This assumes all engineers have access to this new account, and will be able to set up the cluster + support, without any new hubs being set up.
Appropriate runbooks:
Phase 3 : Hub setup (READY)
There's going to be a number of hubs, and this starts specifying them.
Hub 1: Staging
Phase 3.1: Initial setup
Phase 3.2: Authentication
@kabilar, @aaronkanzer, @asmacdo, @satra
Phase 3.3: Object storage access
Phase 3.4: Profile List
This was derived from looking at https://github.com/dandi/dandi-hub/blob/dandi/config.yaml.j2#L138-L210 and adopting to match our standards.
Environments
Resource Allocations
CPU
Generated by
deployer generate resource-allocation choices r5.xlarge --num-allocations 4
GPU
Manually set up, but should be autogenerated
Hub 2: LINC hub
The same as staging, just different name (linc).
The text was updated successfully, but these errors were encountered: