Article Details
Retrieved on: 2017-12-06 16:15:00
Tags for this article:
Click the tags to see associated articles and topics
Excerpt
<div><b>AlphaGo Zero</b> tuned the hyper-parameter of its search by Bayesian optimization. In AlphaZero they reuse the same hyper-parameters for all games without game-specific tuning. The sole exception is the noise that is added to the prior policy to ensure exploration; this is scaled in proportion to the typical ...</div>
Article found on:
This article is found inside other hiswai user's workspaces. To start your own collection, sign up for free.
Sign UpAlready have an account? Log in here