Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test and optimize IO performance when automatically generating BMI init configs #654

Closed
robertbartel opened this issue Jun 11, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request maas MaaS Workstream

Comments

@robertbartel
Copy link
Contributor

While integration of BMI init config auto-generation capabilities was done in #607, practical performance testing was not conducted. Given #637 and the fact that DMOD currently only implements an object store dataset backing, there may be some practical issues with the current implementation; e.g., it produces configs perfectly correctly, but takes an impractical or excessive (compare to the job needing the configs) amount of time to complete.

First, analysis is needed for the running time in various scenarios, given the current implementation and more practical off-the-shelf hardware configuration (i.e., at most, a small cluster of desktop-level machines). Depending on the results, adjustments to the implementation should be made to optimize it for current dataset capabilities. Where possible, this should be done in a way that lends itself well to future dataset backings (i.e., #593), which may or may not have the same IO performance characteristics and thus may need (or benefit from) certain differences in the implementation.

@aaraney
Copy link
Member

aaraney commented Jun 14, 2024

I added support for writing config files to various archive formats in this PR. Here is an example of writing config files on the fly to a gzipped archive file. Compression is not required (and of course slows things down). I would be interested to see the performance of writing to just a tar archive.

This issue will also be useful in conducting benchmarks. The issue shows an, albeit naive, approach to generating config files concurrently.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request maas MaaS Workstream
Projects
None yet
Development

No branches or pull requests

2 participants