-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a job in SM to collect Scylla Doctor from whole cluster #4259
Comments
@karol-kokoszka can you triage? |
@tarzanek Scylla Manager is not SSHing, it's calling Agent's API, that why I'm asking about the way to execute the job. You want to merge it with the Scylla Manager task scheduler ? |
it's cli command that needs to be executed on target hosts ( https://github.com/scylladb/scylla-doctor/tree/master/scylla-doctor#usage ) as root it results will be in a vitals file that will need a download to SM |
It means that we would need to call agent to execute the CLI and collect the output.
Assuming the doctor is executed through API call to agent, it's not a problem, as it will be just in the payload. @tarzanek could the scylla doctor be imported to the agent codebase somehow ? Through the golang dependency for example ? How do you see scheduling this job ? Part of the task scheduler in manager (the same as we use for repair, backup) ? Is it needed to be scheduled ? Or it's rather "ad-hoc" job ? |
SM and its agents can be leveraged to collect from central place the state of all nodes using
Scylla Doctor ( https://github.com/scylladb/scylla-doctor )
this will help support to properly and quickly help customers to verify their clusters and their health and ev. config drifts
Can a job be added to run SD on all nodes and collect its outputs?
The text was updated successfully, but these errors were encountered: