Support for regularization #131

hkirvesl · 2023-09-06T05:56:30Z

Is your feature request related to a problem? Please describe.

When training models, the loss function with which the network weights are updated must be a function of the input data and the response variable, $L=L(X,y)$.

Many times you would like to add regularization to the model: If there are many ways to achieve similar training error, we would like a model that is in some way as simple as possible.

In order to do so, the parameter update step must take into account some properties of the network.

Describe the solution you'd like

A straightforward way to fix this problem would be to add this logic to the trainer class. This is consistent with the literature where regularization for model M with inputs X and response y is commonly phrased as

$$L_{\textrm{total}}(X,y,M) = L_{\textrm{original_loss}}(X,y) + \lambda L_{\textrm{regression_penalty}}(M). $$

Describe alternatives you've considered

An alternative way to achieve this would be to have a wrapper that modifies the model $M$ and input data $X$ to include whatever factors are needed, i.e. $X^{}=(X,M)$ and then use custom loss $L^{}(X^{},y^{})$.

However, this approach take does not generalize well (as each problem would need their own wrapper) and may lead to unnecessary overhead (e.g. Input $X^{*}$ needs to contain $M$).

raphaelreinauer · 2023-09-08T15:37:39Z

Would you like to incorporate a general regularization approach? If you are specifically interested in L2 penalty, it can be directly used within the Adam optimizer.

RaphaelLilt · 2024-04-06T07:55:32Z

@hkirvesl Can this be closed?

hkirvesl · 2024-04-06T08:40:55Z

@RaphaelLilt Absolutely, this should be closed indeed for it's solved by #132

hkirvesl added the enhancement New feature or request label Sep 6, 2023

hkirvesl mentioned this issue Sep 6, 2023

add regularizer #132

Merged

9 tasks

hkirvesl closed this as completed Apr 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for regularization #131

Support for regularization #131

hkirvesl commented Sep 6, 2023

raphaelreinauer commented Sep 8, 2023

RaphaelLilt commented Apr 6, 2024

hkirvesl commented Apr 6, 2024

Support for regularization #131

Support for regularization #131

Comments

hkirvesl commented Sep 6, 2023

raphaelreinauer commented Sep 8, 2023

RaphaelLilt commented Apr 6, 2024

hkirvesl commented Apr 6, 2024