-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SCT for backup #8752
Comments
I think this is on @mikliapko plate as part of SM tests. |
I updated the description, does it make more sense now? |
@mikliapko ^^ |
2024 Q4 is planned to finish with better backup and restore capabilities.
@kreuzerkrieg is assigned to this and will start shortly working on the details.
Yes. I think the focus should be on a large, full cluster backups. And we should measure heavy user loads during the backup. |
@regevran - can you please add more details regarding how @mikliapko should assist? |
@kreuzerkrieg is collecting information including the requested support 😸. |
@mikliapko I have a couple of questions.
We need these three to compare the current state of the system with the one we will have in the nearest future - scylla based backup, so we need these numbers. If we dont have all or any of above, what would it take to create these metrics? CC: @rayakurl |
Yes, recently we introduced two jobs:
The purpose of these jobs is to check restore performance build by build.
No, we don't have such benchmarks. At least I'm not aware of any of them.
Nope.
Actually, it's hard to provide you any more or less exact estimations without digging deeper. |
@mikliapko as of urgency, good question, @regevran what do you say? what is the expectation? |
We plan to start the new approach in the beginning of November 2024. It would be great if the tests are ready then with current implementation already measured. |
I see, alright, we will talk with @rayakurl and prioritize it accordingly. |
@karol-kokoszka @Michal-Leszczynski Guys, I want to hear your opinion about these tests value for Manager in general to not implement something that would be used only one time. I suppose they might be pretty useful. |
@rayakurl who can assist (guide) with this task? |
@kreuzerkrieg @mikliapko can assist |
Also please make sure @cezarmoise is in the loop too. |
I would say that the first benchmark is more interesting and important, as backups run all the time and cluster should work fine when they are running. The second one is less important, as for now the main objective is to make full cluster restore as fast as possible - it assumes that cluster does not handle any user related traffic. Optimizing restore on a running cluster is left as a future effort for now. |
We are not done with this issue, as we still couldn't create a good baseline for measures. |
Measure backup process performance - how long does it take to backup.
Measure user-queries during backup - what is the impact of backup on user experience.
Measures inside Scylla.
SM tests may or may not be combined with the tests in this issue.
The text was updated successfully, but these errors were encountered: