Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a job in SM to collect Scylla Doctor from whole cluster #4259

Open
tarzanek opened this issue Feb 18, 2025 · 4 comments
Open

Add a job in SM to collect Scylla Doctor from whole cluster #4259

tarzanek opened this issue Feb 18, 2025 · 4 comments

Comments

@tarzanek
Copy link

SM and its agents can be leveraged to collect from central place the state of all nodes using
Scylla Doctor ( https://github.com/scylladb/scylla-doctor )

this will help support to properly and quickly help customers to verify their clusters and their health and ev. config drifts

Can a job be added to run SD on all nodes and collect its outputs?

@tarzanek
Copy link
Author

@karol-kokoszka can you triage?
we can certainly add knowledge how to run SD or internal ways how this is gathered for Scylla Cloud or others

@karol-kokoszka
Copy link
Collaborator

@tarzanek
How to call Scylla-Doctor ?
Is it CLI that needs to be executed on the hosts or it has some API ?
Or maybe it can be called from any server (let's say manager server VM) ?

Scylla Manager is not SSHing, it's calling Agent's API, that why I'm asking about the way to execute the job.

You want to merge it with the Scylla Manager task scheduler ?

@tarzanek
Copy link
Author

tarzanek commented Feb 18, 2025

it's cli command that needs to be executed on target hosts ( https://github.com/scylladb/scylla-doctor/tree/master/scylla-doctor#usage ) as root

it results will be in a vitals file that will need a download to SM

@karol-kokoszka
Copy link
Collaborator

it's cli command that needs to be executed on target hosts

It means that we would need to call agent to execute the CLI and collect the output.

it results will be in a vitals file that will need a download to SM

Assuming the doctor is executed through API call to agent, it's not a problem, as it will be just in the payload.

@tarzanek could the scylla doctor be imported to the agent codebase somehow ? Through the golang dependency for example ?
UPDATE: It's python, it couldn't.

How do you see scheduling this job ? Part of the task scheduler in manager (the same as we use for repair, backup) ? Is it needed to be scheduled ? Or it's rather "ad-hoc" job ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants