-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suggestion: in flux jobs, show jobs with nodes still in housekeeping #6248
Comments
One thing we talked about when housekeeping was being designed was representing housekeeping as a separate system job running as the flux user. I started to implement that but found it was more challenging than expected, and we needed something to staunch the bleeding on el cap. Maybe reviving that could address this issue without introducing unnecessary coupling between the original job and housekeeping. |
I haven't looked into how difficult this would be, but maybe I can't decide which is the "right" approach here though. Representing the housekeeping workload as a job is an attractive option, but since it isn't a job there would be so many "job" things that won't work it almost seems like it could possibly cause more trouble down the road... |
Hmm yeah, that is partly what made the job idea hard. Going all the way and making it a real job with all the trimmings seems like overkill. |
It is trivial to add a new
note that the Also, for some reason, At this point, I'm not sure if the above is helpful or not. It is easy to include it, and maybe put neither I'm open to any feedback here. |
There's a corner case we keep running into (which I guess makes it a very wide corner):
flux resource status
shows N nodes availableflux jobs -A
shows no running jobsI realize this is covered by also checking
flux resource list
for nodes marked as allocated, so the change would be more of a QoL improvement for users and admins than anything else. But since it's my life we're talking about, naturally I'm all for it.Possible implementation - new job state between CLEANUP and INACTIVE.
The text was updated successfully, but these errors were encountered: