bayesian-optimizer #602

spline2hg · 2025-06-05T18:58:18Z

No description provided.

codecov · 2025-06-08T14:19:57Z

Codecov Report

Attention: Patch coverage is 79.04762% with 22 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/optimagic/optimizers/bayesian_optimizer.py	75.90%	20 Missing ⚠️
src/optimagic/config.py	60.00%	2 Missing ⚠️

❌ Your patch check has failed because the patch coverage (79.04%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.

Files with missing lines	Coverage Δ
src/optimagic/algorithms.py	`86.02% <100.00%> (+0.08%)`	⬆️
src/optimagic/config.py	`70.12% <60.00%> (+0.68%)`	⬆️
src/optimagic/optimizers/bayesian_optimizer.py	`75.90% <75.90%> (ø)`

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

janosg

Hi @spline2hg, thanks for the great PR.

The PR is quite good already but I have made many small comments about the code style.

I'm happy to talk if there are questions.

.pre-commit-config.yaml

janosg · 2025-06-11T11:44:14Z

docs/source/algorithms.md

+
+    - **verbose** (int): Verbosity level from 0 (silent) to 3 (most verbose). Default is 2.
+
+    - **kappa** (float): Parameter to balance exploration vs exploitation in UCB. Higher values 


We need to mention more explicitly that this parameter is only used if the ucb acquisition function is used and the user did not pass a configured instance of an AcquisitionFunction object. Same for xi below.

Also, we should not use all UPPERCASE notation for UCB and not use the abbreviation. Documentation can be verbose as long as it is precise. I.e. it would be better to write: "Paremeter to balance the exploration vs. exploitation trade-off if the upper_confidence_bound acquisition function is used."

janosg · 2025-06-11T11:47:22Z

docs/source/algorithms.md

+      - "ucb" or "upper_confidence_bound": Upper Confidence Bound 
+      - "ei" or "expected_improvement": Expected Improvement 
+      - "poi" or "probability_of_improvement": Probability of Improvement
+      Default is None (uses package default).


What is the package default?

UpperConfidenceBound if no constraints are passed; otherwise ExpectedImprovement.

janosg · 2025-06-11T11:48:36Z

docs/source/algorithms.md

+    - **init_points** (int): Number of random exploration points to evaluate before 
+      starting optimization. Default is 5.
+
+    - **n_iter** (int): Number of Bayesian optimization iterations to perform after 
+      the initial random exploration. Default is 25.


How were these numbers chosen? Are they just the defaults from the package? I would have expected that those should rather be adaptive (e.g. getting larger for higher dimensional problems) but it is ok to go with the package defaults. Just needs to be documented where the numbers come from.

Yes, these are the package defaults. Should we explicitly mention in the documentation that these values come from the package defaults? Also, to make them adaptive, how could we relate these parameters to the dimensionality of the problem?

janosg · 2025-06-11T11:51:09Z

docs/source/algorithms.md

+    - **xi** (float): Parameter to balance exploration vs exploitation in EI and POI. Higher 
+      values mean more exploration. Default is 0.01.
+
+    - **exploration_decay** (float or None): Rate at which exploration decays over time.


Is any float valid for this or is there a specific range? What does the value mean? The same applies to other parameters like xi. Please go over all of them and try to be as precise as possible.