Article Details

µTransfer: A technique for hyperparameter tuning of enormous neural networks - Microsoft Research

Retrieved on: 2022-03-10 18:58:05

Tags for this article:

Click the tags to see associated articles and topics

µTransfer: A technique for hyperparameter tuning of enormous neural networks - Microsoft Research. View article details on HISWAI: https://www.microsoft.com/en-us/research/blog/%25C2%25B5transfer-a-technique-for-hyperparameter-tuning-of-enormous-neural-networks/

Excerpt

Figure 1: In the default parameterization in PyTorch, the graph on the left, the activation scales diverge in width after one step of training. But in ...

Article found on: www.microsoft.com

View Original Article

This article is found inside other Hiswai user's workspaces. To start your own collection, sign up for free.

Sign Up