Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

On configuration search, and cost #128

Open
umayrh opened this issue Dec 27, 2018 · 0 comments
Open

On configuration search, and cost #128

umayrh opened this issue Dec 27, 2018 · 0 comments

Comments

@umayrh
Copy link

umayrh commented Dec 27, 2018

I've been working on using OpenTuner to tune Spark applications (https://github.com/umayrh/sketchy-polytopes/tree/master/python/sparktuner), which are notoriously hard to tune given the large number of configuration parameters, and dependence on data, program logic, and infrastructure.

I came across issue #82, which describes why exhaustive search space isn't yet implemented in OpenTuner. Spark applications, like other large data processing programs, can take a relatively long time (say, 0.5-1hr) to run and consume significant amount of cloud memory/CPU resources. So the cost of running such programs over a large configuration search space might be prohibitive. At the same time, it's possible to limit this search space based on past experience. For such cases, perhaps the total 'cost' of the search space, instead of its size, is a better measure. Any thoughts on how we can generalize the notion of search space so we focus on being limited by prohibitively expensive search instead of exhaustive search?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant