by Inufu on 12/6/18, 7:01 PM with 89 comments
by gamegoblin on 12/6/18, 9:41 PM
Their results show that they are only just barely stronger than Stockfish 8, but Stockfish 9 and 10 are stronger than 8 as well.
EDIT: Also meant to include a shout-out to http://www.lczero.org/ which is an open source implementation of AlphaZero chess. Here is their forum post for this paper: https://groups.google.com/forum/#!topic/lczero/TfmaNHI99gk
SECOND EDIT: I was wrong! They did play against a newer SF than 8, specifically, SF at this commit: https://github.com/official-stockfish/Stockfish/commit/b508f... , which was about 2 weeks before SF 9 was released, so maybe it is close in strength to SF 9.
by mindgam3 on 12/7/18, 12:11 AM
IMHO as a competitive scholastic chess player (former national U16 champion and top 3 world U10) and software engineer, it would significantly increase credibility of results. Not to mention would be fascinating to see the “ugly” games in addition to the ones handpicked by your team.
by mcphage on 12/6/18, 10:24 PM
> The amount of training the network needs depends on the style and complexity of the game, taking approximately 9 hours for chess, 12 hours for shogi, and 13 days for Go.
How much would that much computing power would cost on something like AWS? That's a lot of hardware, but if you're only renting it for 9 hours... the beefiest EC2+GPU instance Amazon has currently is p3.16xlarge, which has 8 Tesla V100 GPUs, and 64 (virtual) CPUs, for $25/hour on-demand. My understanding is that a V100 is slightly more powerful than a Titan V, so does that mean you could run the Chess training (at least the AlphaZero side) for $225? That seems impossible?
EDIT: pacala below pointed out that the hardware listed was just for running AlphaZero against Stockfish, not for training it. Digging through the preprint itself, they say that for training they used:
> During training only, 5,000 first-generation tensor processing units (TPUs) (19) were used to generate self-play games, and 16 second-generation TPUs were used to train the neural networks.
So that would be... a lot more.
by madisfun on 12/7/18, 8:57 AM
AlphaZero was a great concept and execution, but if we have to judge its relative strength, it should compete fairly. 4 TPUs (~ 4 Titan V) + 44 cores for AlphaZero vs only 44 cores for Stockfish pre-9 may or may not have put Stockfish at a disadvantage.
BTW, current, presumably balanced, TCEC 14 configurations are:
Non-GPU Server: CPUs: 2 x Intel Xeon E5 2699 v4 @ 2.8 GHz, Cores: 44 physical, RAM: 64 GB DDR4 ECC
GPU Server: GPUs: 1 x 2080 ti + 1 x 2080, CPU: Quad Core i5 2600k, RAM: 16GB DDR3-2133
TCEC GPU server looks more modest than what A0 authors used to "beat" SF.
by RivieraKid on 12/6/18, 7:50 PM
Will you be answering questions?
by antirez on 12/7/18, 9:58 AM
"Traditional chess engines – including the world computer chess champion Stockfish and IBM’s ground-breaking Deep Blue – rely on thousands of rules and heuristics handcrafted by strong human players that try to account for every eventuality in a game."
by stabbles on 12/6/18, 8:17 PM
by schaefer on 12/6/18, 11:03 PM
As a Go player, is there any way I could download and review the game records described in this paper?
by Rampoina on 12/7/18, 1:51 PM
I'm excited about their work but it seems that it would be much better for everyone if they just released their work openly.
by kevinwang on 12/7/18, 6:42 PM
by detaro on 12/6/18, 7:04 PM
by YetAnotherNick on 12/6/18, 10:05 PM
by thom on 12/6/18, 9:43 PM
by nuguy on 12/7/18, 4:51 AM
Please stop applying your intillenge to AI.
Edit: substantive counter-arguments would be highly appreciated