Add the Adagrad with Momentum optimizer. #269

copybara-service · 2025-05-30T21:29:16Z

Add the Adagrad with Momentum optimizer.

This implementation matches the one described in Duchi et al., 2011 with the momentum integration discussed in Sutskever et al., 2013.

Features / Params in AdagradMomentumOptimizerSpec:

learning_rate: The base learning rate.
momentum: The momentum parameter (exponential decay for the momentum buffer).
beta2: The decay rate for the running average of squared gradients (accumulator).
epsilon: A small constant added for numerical stability.
exponent: The power to which the accumulator is raised (often referred to as k_power in some contexts, typically 0.5 for Adagrad).
use_nesterov: Boolean flag; if true, Nesterov momentum is used.

The optimizer maintains two slot variables for each trainable embedding parameter:

accumulator: Stores the running average of squared gradients.
momentum_buffer: Stores the first-order momentum term.

This implementation matches the one described in [Duchi et al., 2011](https://arxiv.org/abs/1103.4296) with the momentum integration discussed in [Sutskever et al., 2013](https://proceedings.mlr.press/v28/sutskever13.pdf). Features / Params in `AdagradMomentumOptimizerSpec`: - **learning_rate**: The base learning rate. - **momentum**: The momentum parameter (exponential decay for the momentum buffer). - **beta2**: The decay rate for the running average of squared gradients (accumulator). - **epsilon**: A small constant added for numerical stability. - **exponent**: The power to which the accumulator is raised (often referred to as `k_power` in some contexts, typically 0.5 for Adagrad). - **use_nesterov**: Boolean flag; if true, Nesterov momentum is used. The optimizer maintains two slot variables for each trainable embedding parameter: - **accumulator**: Stores the running average of squared gradients. - **momentum_buffer**: Stores the first-order momentum term. PiperOrigin-RevId: 769858905

copybara-service bot force-pushed the test_765046416 branch 4 times, most recently from 9d061e1 to 50adf60 Compare May 30, 2025 22:51

copybara-service bot force-pushed the test_765046416 branch from 50adf60 to 1fff875 Compare June 10, 2025 18:28

copybara-service bot changed the title ~~Add the Adagrad-Momentum optimizer.~~ Add the Adagrad with Momentum optimizer. Jun 10, 2025

copybara-service bot force-pushed the test_765046416 branch from 1fff875 to b0703e5 Compare June 10, 2025 23:32

copybara-service bot force-pushed the test_765046416 branch from b0703e5 to 191e114 Compare June 10, 2025 23:51

copybara-service bot merged commit 191e114 into main Jun 10, 2025

copybara-service bot deleted the test_765046416 branch June 10, 2025 23:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add the Adagrad with Momentum optimizer. #269

Add the Adagrad with Momentum optimizer. #269

Uh oh!

copybara-service bot commented May 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add the Adagrad with Momentum optimizer. #269

Add the Adagrad with Momentum optimizer. #269

Uh oh!

Conversation

copybara-service bot commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

copybara-service bot commented May 30, 2025 •

edited

Loading