Survey for Stochastic Optimization

ICML TUTORIAL ON PARAMETER-FREE ONLINE OPTIMIZATION

Awesome Stochastic Optimization

Parameter-free Online Learning

Cutkosky, Ashok. 2020. “Better Full-Matrix Regret via Parameter-Free Online Learning.” Advances in Neural Information Processing Systems 33.

Parameter-free Optimization for Deep Learning

Johnson, Tyler, Pulkit Agrawal, Haijie Gu, and Carlos Guestrin. 2020. “AdaScale SGD: A User-Friendly Algorithm for Distributed Training.” In International Conference on Machine Learning, 4911–20. PMLR.
Cutkosky, Ashok. 2020. “Parameter-Free, Dynamic, and Strongly-Adaptive Online Learning.” In Proceedings of the 37th International Conference on Machine Learning, edited by Hal Daumé Iii and Aarti Singh, 119:2250–59. Proceedings of Machine Learning Research. Virtual: PMLR.
A. Cutkosky and T. Sarlos. “Matrix-Free Preconditioning in Online Learning”. In: Proc. of International Conference on Machine Learning. 2019
F. Orabona and T. Tommasi. “Training Deep Networks without Learning Rates Through Coin Betting”. In: Advances in Neural Information Processing Systems 30. 2017
A. Cutkosky and K. A. Boahen. “Online Convex Optimization with Unconstrained Domains and Losses”. In: Advances in Neural Information Processing Systems 29. 2016, pp. 748–756

Parameter-free Learning with Experts

N. J. A. Harvey, C. Liaw, E. Perkins, and S. Randhawa. “Optimal anytime regret with two experts”. In: arXiv:2002.08994. 2020
T. Koren and R. Livni. “Affine-Invariant Online Optimization and the Low-rank Experts Problem”. In: Advances in Neural Information Processing Systems 30. Curran Associates, Inc., 2017, pp. 4747–4755
K.-S. Jun, F. Orabona, S. Wright, and R. Willett. “Online Learning for Changing Environments Using Coin Betting”. In: Electron. J. Statist. 11.2 (2017), pp. 5282–5310
D. J. Foster, A. Rakhlin, and K. Sridharan. “Adaptive Online Learning”. In: Advances in Neural Information Processing Systems 28. Curran Associates, Inc., 2015, pp. 3375–3383
W. M. Koolen and T. van Erven. “Second-order Quantile Methods for Experts and Combinatorial Games”. In: Proc. of COLT. 2015, pp. 1155–1175
H. Luo and R. E. Schapire. “Achieving All with No Parameters: AdaNormalHedge”. In: Proc. of COLT. 2015, pp. 1286–1304
H. Luo and R. E. Schapire. “A Drifting-Games Analysis for Online Learning and Applications to Boosting”. In: Advances in Neural Information Processing Systems. 2014
A. Chernov and V. Vovk. “Prediction with Advice of Unknown Number of Experts”. In: Proc. of the 26th Conf. on Uncertainty in Artificial Intelligence. AUAI Press, 2010
K. Chaudhuri, Y. Freund, and D. J. Hsu. “A Parameter-Free Hedging Algorithm”. In: Advances in neural information processing systems. 2009, pp. 297–305

Optimization Heuristics Related to Parameter-free Algorithms

Hoffer, Elad, Tal Ben-Nun, Itay Hubara, Niv Giladi, Torsten Hoefler, and Daniel Soudry. 2020. “Augment Your Batch: Improving Generalization Through Instance Repetition.” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8129–38. openaccess.thecvf.com.
You, Yang, Yuhui Wang, Huan Zhang, Zhao Zhang, James Demmel, and Cho-Jui Hsieh. 2020. “The Limit of the Batch Size.” arXiv [cs.LG]. arXiv. http://arxiv.org/abs/2006.08517.
J. Bernstein, A. Vahdat, Y. Yue, and M.-Y. Liu. “On the distance between two neural networks and the stability of learning”. In: arXiv:2002.03432. 2020
Y. You, Z. Zhang, C.-J. Hsieh, J. Demmel, and K. Keutzer. “ImageNet training in minutes”. In: Proc. of the 47th International Conference on Parallel Processing. 2018
Y. You, I. Gitman, and B. Ginsburg. “Scaling SGD batch size to 32K for Imagenet training”. Technical Report UCB/EECS-2017-156, University of California, Berkeley, 2017

Stochastic Optimization on Riemannian Manifolds

Meta-Algorithm for Stochastic Optimization

Diakonikolas, Ilias, Gautam Kamath, Daniel Kane, Jerry Li, Jacob Steinhardt, and Alistair Stewart. 2019. “Sever: A Robust Meta-Algorithm for Stochastic Optimization.” In Proceedings of the 36th International Conference on Machine Learning, edited by Kamalika Chaudhuri and Ruslan Salakhutdinov, 97:1596–1606. Proceedings of Machine Learning Research. Long Beach, California, USA: PMLR.
Eftimov, Tome, and Peter Korošec. 2019. “Identifying Practical Significance through Statistical Comparison of Meta-Heuristic Stochastic Optimization Algorithms.” Applied Soft Computing 85 (December): 105862.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/ISSUE_TEMPLATE

.github/ISSUE_TEMPLATE

README.md

README.md

Repository files navigation

Survey for Stochastic Optimization

ICML TUTORIAL ON PARAMETER-FREE ONLINE OPTIMIZATION

Awesome Stochastic Optimization

Parameter-free Online Learning

Parameter-free Optimization for Deep Learning

Parameter-free Learning with Experts

Optimization Heuristics Related to Parameter-free Algorithms

Stochastic Optimization on Riemannian Manifolds

Meta-Algorithm for Stochastic Optimization

Optimizers for Deep Neural Networks

About

Releases

Packages

Contributors 3

CompML/survey-stochastic-optimization

Folders and files

Latest commit

History

Repository files navigation

Survey for Stochastic Optimization

Awesome Stochastic Optimization

Parameter-free Online Learning

Parameter-free Optimization for Deep Learning

Parameter-free Learning with Experts

Optimization Heuristics Related to Parameter-free Algorithms

Stochastic Optimization on Riemannian Manifolds

Meta-Algorithm for Stochastic Optimization

Optimizers for Deep Neural Networks

About

Resources

Stars

Watchers

Forks