docs: extending the documentation and docstrings

automl · Nov 5, 2024 · 64ffe6e · 64ffe6e
1 parent cacfda8
commit 64ffe6e
Show file tree

Hide file tree

Showing 43 changed files with 549 additions and 242 deletions.
diff --git a/.github/workflows/docs.yml b/.github/workflows/docs.yml
@@ -23,5 +23,5 @@ jobs:
           path: .cache
       - name: "Install dependencies"
         run: python -m pip install mkdocs-material mkdocs-autorefs mkdocs-glightbox mkdocs-literate-nav mkdocstrings[python] mkdocs-gen-files mkdocs-awesome-pages-plugin typing-extensions more-itertools pillow cairosvg mike markdown-exec
-      - name: "Build Docs"
+      - name: "Build docs"
         run: mkdocs gh-deploy --force
diff --git a/README.md b/README.md
@@ -1,69 +1,94 @@
-# QuickTuneTool: A Framework for Efficient Model Selection and Hyperparameter Optimization
+# Quick-Tune-Tool: A Framework for Efficient Model Selection and Hyperparameter Optimization
 
-QuickTuneTool tackles the challenge of selecting the best pretrained model and fine-tuning hyperparameters for new datasets. It simplifies this process using a Combined Algorithm Selection and Hyperparameter Optimization (CASH) technique within a Bayesian optimization framework.
+[![image](https://img.shields.io/pypi/l/quicktunetool.svg)](https://pypi.python.org/pypi/quicktunetool)
+[![image](https://img.shields.io/pypi/pyversions/quikctunetool.svg)](https://pypi.python.org/pypi/quicktunetool)
+[![Ruff](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json)](https://github.com/astral-sh/ruff)
+[![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg)](CONTRIBUTING.md)
 
+Quick-Tune-Tool is a streamlined framework designed to tackle model selection and hyperparameter tuning for pretrained models on new datasets. Built on a Combined Algorithm Selection and Hyperparameter Optimization (CASH) approach within a Bayesian optimization framework, it aims to identify the best-performing model and hyperparameter configuration quickly and efficiently.
 
 ### Table of Contents
 - [Key Features](#key-features)
 - [Getting Started](#getting-started)
-  - [Installation](#installation)
-    - [Install from Source](#install-from-source)
-- [Basic Usage](#basic-usage)
-- [Advanced Usage](#advanced-usage)
+- [Usage](#usage)
 - [References](#references)
 - [Citations](#citations)
 
-The approach relies on three key components:
-1. **Gray-Box HPO**: Instead of fully training all models, QuickTuneTool leverages partial learning curves by running models for only a few initial epochs and then focusing on the most promising ones.
-2. **Meta-Learning**: The tool draws insights from prior evaluations on related tasks to accelerate and improve the search for optimal models and hyperparameters.
-3. **Cost-Awareness**: QuickTuneTool balances the trade-off between time and performance, ensuring an efficient search for the best configurations.
+## Key Features
+Quick-Tune-Tool leverages three main techniques to optimize model performance effectively:
+1. **Gray-Box Hyperparameter Optimization (HPO)**: Instead of fully training every model, QuickTuneTool only trains each for a few initial epochs and focuses on the most promising models, significantly reducing computation time.
+2. **Meta-Learning**: By using prior evaluations on related tasks, the tool accelerates the search process and refines model recommendations.
+3. **Cost-Awareness**: QuickTuneTool balances time and performance, maintaining an efficient search for the best configurations.
 
-For more details, check out the paper `Quick-Tune: Quickly Learning Which Pre Trained Model to Fine Tune and How` [ICLR2024](https://openreview.net/forum?id=tqh1zdXIra)
-
-**At the moment only *Image Classification* is implemented.**
+***Note: Currently, only Image Classification is supported.***
 
 ## Getting Started
 
 ### Installation
 
-### Install from source
+To install QuickTuneTool (`QTT`), you can simply use `pip`:
+
 ```bash
-git clone https://github.com/automl/QTT
-pip install -e QTT  # -e for editable mode
+pip install quicktunetool
 ```
 
-## Simple Usage
-We provide a simple to use script.
+#### Install from Source
+To install QuickTuneTool directly from the source:
 
-```python
-from qtt import QuickCVCLSTuner
-tuner = QuickCVCLSTuner("path/to/dataset")
-tuner.run(fevals=100, time_budget=3600)
+```bash
+git clone https://github.com/automl/quicktunetool
+pip install -e quicktunetool    # Use -e for editable mode
 ```
 
-For more code examples take a look into the notebooks [folder](notebooks).
+## Usage
+QuickTuneTool's interface makes it easy to get started. Here's a simple script to run QuickTuneTool on your dataset:
 
+```python
+from ConfigSpace import ConfigurationSpace
+from qtt import QuickTuner, QuickOptimizer
+
+cs = ConfigurationSpace({
+    "cat": [False, True],  # Categorical
+    "float": (0.1, 1.0),   # Uniform Float
+    "int": (1, 10),        # Uniform Int
+    "constant": Constant("constant": (42)),
+})
+
+def fn(trial: dict, task_info: dict):
+    config = trial["config"]
+    fidelity = trial["fidelity"]
+    ...
+    # Training logic and checkpoint loading here
+    score = ...  # float: 0 - 1
+    cost = ...
+
+    report = trial.copy()
+    report["score"] = score
+    report["cost"] = cost
+    return report
+
+opt = QuickOptimizer(cs, max_fidelity=100, cost_aware=True, ...)
+tuner = QuickTuner(opt, fn)
+tuner.run(fevals=100, time_budget=3600)
+```
 
-## Advanced Usage
+For more code examples, explore the [examples](examples) folder.
 
-Please check out our documentation for more:
+### Usage Examples
+For further customization options and advanced usage, please refer to our [documentation](docs).
 
 ## References
 
-The concepts and methodologies of this project are discussed in the following workshop paper:
-
-**Title**: *Quick-Tune-Tool: A Practical Tool and its User Guide for Automatically Finetuning Pretrained Models*  
-
-**Authors**: Ivo Rapant, Lennart Purucker, Fabio Ferreira, Sebastian Pineda Arango, Arlind Kadra, Josif Grabocka, Frank Hutter
-
-**Conference**: AutoML 2024 Workshop
-
-You can access the full paper and additional details on OpenReview [here](https://openreview.net/forum?id=d0Hapti3Uc).
+The concepts and methodologies of QuickTuneTool are detailed in the following workshop paper:
 
+- **Title**: *Quick-Tune-Tool: A Practical Tool and its User Guide for Automatically Fine-Tuning Pretrained Models*  
+- **Authors**: Ivo Rapant, Lennart Purucker, Fabio Ferreira, Sebastian Pineda Arango, Arlind Kadra, Josif Grabocka, Frank Hutter  
+- **Event**: AutoML 2024 Workshop  
+- **Availability**: The full paper is accessible on [OpenReview](https://openreview.net/forum?id=d0Hapti3Uc), where research details and methodology discussions can be found.
 
-### Further References / Citations
+### Citation
+If you use QuickTuneTool in your research, please also cite the following paper:
 
-This project is based on the following paper. Please also consider citing this paper:
 ```
 @inproceedings{
 arango2024quicktune,

diff --git a/docs/index.md b/docs/index.md
@@ -1,3 +1,78 @@
-Welcome to the Quick-Tune-Tool docs.
+# Quick-Tune-Tool
 
-For a quick-start, check out [examples](./examples/index.md) for copy-pastable snippets to start from. 
+**A Practical Tool and User Guide for Automatically Finetuning Pretrained Models**
+
+> Quick-Tune-Tool is an automated solution designed to streamline the process of selecting and finetuning pretrained models across various machine learning domains. Built upon the Quick-Tune algorithm, this tool abstracts complex research-level code into a user-friendly framework, making model finetuning accessible and efficient for practitioners.
+
+---
+
+
+## Installation
+```bash
+pip install quicktunetool
+# or
+git clone https://github.com/automl/quicktunetool
+pip install -e quicktunetool  # Use -e for editable mode
+```
+
+---
+
+## Usage
+
+A simple example for using Quick-Tune-Tool with a pretrained optimizer for image classification:
+
+```python
+from qtt import QuickTuner, get_pretrained_optimizer
+from qtt.finetune.cv.classification import finetune_script
+
+# Load task information and meta-features
+task_info, metafeat = extract_task_info_metafeat("path/to/dataset")
+
+# Initialize the optimizer
+optimizer = get_pretrained_optimizer("mtlbm/micro")
+optimizer.setup(128, metafeat)
+
+# Create QuickTuner instance and run
+qt = QuickTuner(optimizer, finetune_script)
+qt.run(task_info, time_budget=3600)
+```
+
+This code snippet demonstrates how to run QTT on an image dataset in just a few lines of code.
+
+---
+
+## Contributing
+
+Contributions are welcome! Please follow these steps:
+
+1. Fork the repository
+2. Create a new branch (`git checkout -b feature/YourFeature`)
+3. Commit your changes (`git commit -m 'Add your feature'`)
+4. Push to the branch (`git push origin feature/YourFeature`)
+5. Open a pull request
+
+For any questions or suggestions, please contact the maintainers.
+
+---
+
+## Project Status
+
+- ✅ Active development
+
+---
+
+## Support
+
+- 📝 [Documentation](https://automl.github.io/quicktunetool/)
+- 🐛 [Issue Tracker](https://github.com/automl/quicktunetool/issues)
+- 💬 [Discussions](https://github.com/automl/quicktunetool/discussions)
+
+---
+
+## License
+
+This project is licensed under the BSD License - see the LICENSE file for details.
+
+---
+
+Made with ❤️ by https://github.com/automl
diff --git a/docs/reference/index.md b/docs/reference/index.md
@@ -1 +1,61 @@
-# Index
+# Code References
+
+This section provides references for the core code components of Quick-Tune-Tool, detailing the primary modules and classes that make up the tool's architecture. The code is organized into three main parts: **Optimizers**, **Predictors**, and **Tuners**.
+
+---
+
+## 1. Optimizers
+
+The Optimizers module is responsible for suggesting configurations for evaluation, using various optimization strategies. Available optimizers include:
+
+- **QuickTune Optimizer**  
+  - File: `optimizers/quick.py`
+  - Implements the QuickTune algorithm, balancing multi-fidelity expected improvement with cost estimation to select configurations.
+
+- **Random Search Optimizer**  
+  - File: `optimizers/random.py`
+  - Provides a basic random search optimizer as a baseline for comparison with other optimization strategies.
+
+---
+
+## 2. Predictors
+
+The Predictors module includes components that estimate model performance and finetuning costs, enabling efficient configuration selection.
+
+- **Performance Predictor**
+    - File: `predictors/perf.py`
+    - Uses meta-learning to estimate the potential performance of a model configuration based on historical data and auxiliary task information.
+
+- **Cost Predictor**
+    - File: `predictors/cost.py`
+    - Evaluates the computational cost associated with different finetuning configurations, helping to balance resource efficiency with optimization goals.
+
+---
+
+## 3. Tuners
+
+The Tuners module coordinates the tuning process, managing environment setup, experiment flow, and result handling.
+
+- **QuickTuner**  
+    - File: `tuners/quick.py`
+    - Serves as the central class that manages the tuning process, integrating optimizers and predictors to manage iterative evaluations and updates.
+
+- **CV-Classification**  
+    - File: `tuners/cv_cls.py`
+    - A specialized tuner for image classification tasks, offering a reduced interface where users simply provide the path to the image dataset.
+
+---
+
+## Additional Resources
+
+- **Objective Functions**  
+    - Directory: `objective/`
+    - Functions used to evaluate configurations, returning performance metrics for each step.
+
+- **Utility Scripts**  
+    - Directory: `utils/`
+    - A collection of helper functions and utilities to support data processing, result logging, and other ancillary tasks.
+
+---
+
+Refer to each module's in-code documentation for further details on function arguments, usage examples, and dependencies.
diff --git a/docs/reference/optimizers/index.md b/docs/reference/optimizers/index.md
@@ -0,0 +1,24 @@
+# Overview
+
+The `Optimizer` class serves as a base class within the Quick-Tune-Tool, providing low-level functionality. It is designed to support flexible configuration management and interact with tuners during the optimization process. Key aspects of the class include directory setup, model saving, and interfacing methods for requesting and reporting trial evaluations.
+
+Here’s an overview of the [`Optimizer`][qtt.optimizers.optimizer] class:
+
+#### Core Methods
+
+- **`ask`**: Abstract method that must be implemented in subclasses. It requests a new configuration trial from the optimizer, returning it as a dictionary. Raises `NotImplementedError` if not overridden.
+
+- **`tell`**: Accepts and processes a report (result) from a trial evaluation. This method allows the optimizer to record outcomes for each configuration and adjust future suggestions accordingly. Supports both single and multiple trial reports.
+
+- **`ante`**: A placeholder method for pre-processing tasks to be performed before requesting a configuration trial (used by tuners). Can be overridden in subclasses for custom pre-processing.
+
+- **`post`**: A placeholder for post-processing tasks, executed after a trial evaluation has been submitted. Designed
+
+This class is intended to be extended for specific optimization strategies, with `ask` and `tell` as the primary methods for interaction with tuners.
+
+---
+
+### Available Optimizers
+
+- [**`RandomOptimizer`**][qtt.optimizers.random]
+- [**`QuickOptimizer`**][qtt.optimizers.quick]
diff --git a/docs/reference/predictors/index.md b/docs/reference/predictors/index.md
@@ -1 +1,31 @@
 # Overview
+
+The `Predictor` class serves as a base class for implementing predictive models within the Quick-Tune-Tool. It provides core functionality for model setup, data handling, training, and persistence (saving/loading), allowing specific predictive models to extend and customize these methods.
+
+#### Core Methods
+
+- **`fit`** and **`_fit`**: 
+    - `fit`: Public method for training the model. It takes feature data `X`, target labels `y`, verbosity level, and any additional arguments.
+    - `_fit`: Abstract method where specific model training logic is implemented. Models inheriting from `Predictor` should override `_fit` to implement their own fitting procedures.
+
+- **`preprocess`** and **`_preprocess`**: 
+    - `preprocess`: Wrapper method that calls `_preprocess` to prepare data for fitting or prediction.
+    - `_preprocess`: Abstract method where data transformation logic should be added. Designed to clean and structure input data before model training or inference.    
+
+- **`load`** and **`save`**:
+    - `load`: Class method to load a saved model from disk, optionally resetting its path and logging the location.
+    - `save`: Saves the current model to disk in a specified path, providing persistence for trained models.
+
+- **`predict`**:<br>
+Abstract method for generating predictions on new data. Specific predictive models should implement this method based on their inference logic.
+
+This `Predictor` class offers a foundation for different predictive models, providing essential methods for data handling, training, and saving/loading, with extensibility for custom implementations.
+
+---
+
+#### Available Predictors
+
+- [**`PerfPredictor`**][qtt.predictors.perf]
+  Predicts the performance of a configuration on a new dataset.
+- [**`CostPredictor`**][qtt.predictors.cost]
+  Predicts the cost of training a configuration on a new dataset.
diff --git a/docs/reference/tuner/cv_cls_tuner.md b/docs/reference/tuner/cv_cls_tuner.md
@@ -0,0 +1,3 @@
+# CV-Classification-Tuner
+
+::: qtt.tuner.cv_cls
diff --git a/docs/reference/tuner/index.md b/docs/reference/tuner/index.md
@@ -1 +1,19 @@
-# Index
+# Overview
+
+The `QuickTuner` class is a high-level tuner designed to optimize a given objective function by managing iterative evaluations and coordinating with an `Optimizer`. It provides comprehensive functionality for logging, result tracking, checkpointing, and handling evaluation budgets.
+
+#### Core Methods
+
+- **`run`**: Executes the optimization process within a specified budget of function evaluations (`fevals`) or time (`time_budget`). This method iteratively:
+    - Requests new configurations from the optimizer.
+    - Evaluates configurations using the objective function `f`.
+    - Updates the optimizer with evaluation results and logs progress.
+    - Saves results based on the specified `save_freq` setting.
+
+- **`save`** and **`load`**: 
+    - `save`: Saves the current state of the tuner, including the incumbent, evaluation history, and tuner state.
+    - `load`: Loads a previously saved tuner state to resume optimization from where it left off.
+
+#### Usage Example
+
+The `QuickTuner` is typically used to optimize an objective function with the support of an optimizer, managing configuration sampling, evaluation, and tracking. It is particularly suited for iterative optimization tasks where tracking the best configuration and logging results are essential.
diff --git a/docs/reference/tuner/quicktuner.md b/docs/reference/tuner/quicktuner.md
@@ -0,0 +1,3 @@
+# QuickTuner
+
+::: qtt.tuner.quick
diff --git a/examples/quicktuning.py b/examples/quicktuning.py
@@ -1,4 +1,4 @@
 """A quick example of using QuickCVCLSTuner to tune vision classifiers on a dataset."""
-from qtt import QuickCVCLSTuner
-tuner = QuickCVCLSTuner("path/to/dataset")
+from qtt import QuickTuner_CVCLS
+tuner = QuickTuner_CVCLS("path/to/dataset")
 tuner.run(fevals=100, time_budget=3600)
Original file line number	Diff line number	Diff line change
		@@ -0,0 +1,3 @@
		# CV-Classification-Tuner

		::: qtt.tuner.cv_cls