Alphago explained by Demi sHassabis
Demi explains the policy and value neural networks. Policy is neural network trained to make reasonable moves based upon supervised learning of 100,000 games. Value network is built from tens of millions of games to be able to determine what winning positions are. The value scoring of positions was previously believed to be impossible. Alphago …