One neural network, the “policy network”, selects the next move to play.
These neural networks take a description of the Go board as an input and process it through a number of different network layers containing millions of neuron-like connections. We created AlphaGo, a computer program that combines advanced search tree with deep neural networks.
To capture the intuitive aspect of the game, we needed a new approach.