Add the FTRL (Follow The Regularized Leader) optimizer. #268

copybara-service · 2025-05-29T19:23:20Z

Add the FTRL (Follow The Regularized Leader) optimizer.

This implementation is based on the FTRL algorithm, McMahan et al., 2013.

Features / Params in FTRLOptimizerSpec (as used in the primitive):

learning_rate: The base learning rate.
learning_rate_power: Controls the per-coordinate learning rate decay (typically -0.5).
l1_regularization_strength: Applies L1 regularization, which can lead to sparsity in the model weights.
l2_regularization_strength: Applies L2 regularization.
beta: An additional smoothing term..
clip_weight_min, clip_weight_max: Optional bounds for clipping the updated embedding weights.
weight_decay_factor: Factor for applying weight decay to the gradients.
multiply_weight_decay_factor_by_learning_rate: Boolean flag; if true, the weight_decay_factor is multiplied by the learning_rate before applying decay.
multiply_linear_by_learning_rate: Boolean flag; if true, the linear term update incorporates the learning_rate differently.
allow_zero_accumulator: Boolean flag; if true, allows the accumulator to be exactly zero. Otherwise, a small epsilon is added for numerical stability when accumulator is zero.

The optimizer maintains two slot variables for each trainable embedding parameter:

accumulator: Stores the sum of squared gradients, used to adapt the learning rate on a per-coordinate basis.
linear: Stores a linear combination related to the gradients, which is central to the FTRL weight update rule.

This implementation is based on the FTRL algorithm, [McMahan et al., 2013](https://research.google.com/pubs/archive/41159.pdf). Features / Params in `FTRLOptimizerSpec` (as used in the primitive): - **learning_rate**: The base learning rate. - **learning_rate_power**: Controls the per-coordinate learning rate decay (typically -0.5). - **l1_regularization_strength**: Applies L1 regularization, which can lead to sparsity in the model weights. - **l2_regularization_strength**: Applies L2 regularization. - **beta**: An additional smoothing term.. - **clip_weight_min**, **clip_weight_max**: Optional bounds for clipping the updated embedding weights. - **weight_decay_factor**: Factor for applying weight decay to the gradients. - **multiply_weight_decay_factor_by_learning_rate**: Boolean flag; if true, the `weight_decay_factor` is multiplied by the `learning_rate` before applying decay. - **multiply_linear_by_learning_rate**: Boolean flag; if true, the linear term update incorporates the `learning_rate` differently. - **allow_zero_accumulator**: Boolean flag; if true, allows the accumulator to be exactly zero. Otherwise, a small epsilon is added for numerical stability when `accumulator` is zero. The optimizer maintains two slot variables for each trainable embedding parameter: - **accumulator**: Stores the sum of squared gradients, used to adapt the learning rate on a per-coordinate basis. - **linear**: Stores a linear combination related to the gradients, which is central to the FTRL weight update rule. PiperOrigin-RevId: 764794731

copybara-service bot force-pushed the test_764794731 branch from 4f30082 to 2fee215 Compare May 29, 2025 19:51

copybara-service bot changed the title ~~Add the Adam optimizer from [Kingma et al., 2014](http://arxiv.org/abs/1412.6980).~~ Add the FTRL (Follow The Regularized Leader) optimizer. May 29, 2025

copybara-service bot force-pushed the test_764794731 branch 3 times, most recently from 40d69c6 to 63f5235 Compare June 3, 2025 15:51

copybara-service bot force-pushed the test_764794731 branch from 63f5235 to e46ff56 Compare June 3, 2025 16:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add the FTRL (Follow The Regularized Leader) optimizer. #268

Add the FTRL (Follow The Regularized Leader) optimizer. #268

Uh oh!

copybara-service bot commented May 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add the FTRL (Follow The Regularized Leader) optimizer. #268

Are you sure you want to change the base?

Add the FTRL (Follow The Regularized Leader) optimizer. #268

Uh oh!

Conversation

copybara-service bot commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

copybara-service bot commented May 29, 2025 •

edited

Loading