Why 0.9? Towards Better Momentum Strategies in Deep Learning

By Medium - 2021-02-26

Description

Momentum is a widely-used strategy for accelerating the convergence of gradient-based optimization techniques. Momentum was designed to speed up learning in directions of low curvature, without…

Summary

Towards Better Momentum Strategies in Deep Learning.
In deep learning, most practitioners set the value of momentum to 0.9 without attempting to further tune this hyperparameter (i.e., this is the default value for momentum in many popular deep learning packages).
Demon in Code The code for implementing the Demon schedule is also extremely simple.
Demon performance is depicted in the top row, while the performance of vanilla optimizer counterparts (i.e., SGDM and Adam) are depicted in the bottom row.

Topics

Machine_Learning (0.42)
Backend (0.17)
NLP (0.12)

Similar Articles

SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II

By DeepAI - 2020-12-24

12/24/20 - AlphaStar, the AI that reaches GrandMaster level in StarCraft II, is a remarkable milestone demonstrating what deep reinforcement ...

Machine Learning Optimization Methods and Techniques

By Medium - 2020-12-07

The principal goal of machine learning is to create a model that performs well and gives accurate predictions in a particular set of cases. In order to achieve that, we need machine learning…

Combining Dask and PyTorch for Better, Faster Transfer Learning

By Saturn Cloud - 2020-12-01

Combining Dask and PyTorch for better transfer learning allows the data scientist to significantly improve the effective learning of a model

CLIP: Connecting Text and Images

By OpenAI - 2021-01-05

We’re introducing a neural network called CLIP which efficiently learns visual concepts from natural language supervision.

Best Machine Learning Books (Updated for 2020)

By FloydHub Blog - 2020-03-05

The list of the best machine learning & deep learning books for 2020.

3 deep learning mysteries: Ensemble, knowledge- and self-distillation

By Microsoft Research - 2021-01-19

Microsoft and CMU researchers begin to unravel 3 mysteries in deep learning related to ensemble, knowledge distillation & self-distillation. Discover how their work leads to the first theoretical proo ...