Belgian reinvents Leela Zero and upsets Chinese professionals as go gets overhaul
A computer program that plays the board game of go and upsets Chinese go professionals carries the promise beyond remaking AlphaGo Zero, which has outsmarted any human go players.
Leela Zero, developed by 35-year-old Belgian electronics engineer Gian-Carlo Pascutto, beat Chinese professionals on the popular go platform Yike Weiqi.
“It’s beginning to feel like AlphaGo,” says Fu Qixuan, co-founder and CEO of Yike, referring to the program by tech powerhouse Google Deepmind.
But in the case of Leela Zero, it is the work of one man, who barely plays go himself, plus contribution of computing power by volunteers from across the globe, compared with a concerted team effort by a resourceful Internet giant.
Go is a game widely viewed as a grand challenge for artificial intelligence, and the Google Deepmind team’s success was a game-changer and wake-up call in recent years.
Leela Zero is a fairly faithful re-implementation of the system described in the AlphaGo Zero paper “Mastering the Game of Go without Human Knowledge” published in the academic journal Nature, Pascutto says, adding that “for all intents and purposes, it is an open source AlphaGo Zero.”
Revealed in October 2017, AlphaGo Zero was the first computer program that learns to play simply by playing games against itself, starting from completely random play. In doing so, it quickly surpassed human level of play by using a novel form of reinforcement learning, in which AlphaGo Zero becomes its own teacher.
In other words, the system starts off with a neural network that knows nothing about go. It then plays games against itself, by combining this neural network with a powerful search algorithm. As it plays, the neural network is tuned and updated to predict moves, as well as the eventual winner of the games.
Similar to AlphaGo Zero, Leela Zero also used what is known as a Monte Carlo tree search and a deep residual convolutional neural network stack, with no human provided knowledge.
But there is still a catch. Network weights, something crucial to the improvement of the computer program’s strength, has to be computed and Pascutto does not possess the computing powers of Google Deepmind.
Instead of relying solely on his own computing hardwares, which by Pascutto’s first estimation would take 1,700 years to reach a level similar to AlphaGo Zero, he made Leela Zero, together with the source code, freely available online.
From November 10, 2017, anonymous Internet users have contributed their own computing capacity to the cause: In general, about 500 people connect every day, letting their own hardwares do Leela Zero’s self-training. At any given moment, the number of people connected has never been under 200.
Pascutto says that lagged behind Google Deepmind’s army of chips. “To give some comparison, Alpha Zero used about 5,000 special purpose chips each worth about four high end graphics cards. So the distributed project is probably several hundred times slower.”
But four months into the distributed computing, Leela Zero already began to show good results.
“I did not have any idea what to expect, but I am very happy with the current results: It has beaten strong professionals,” Pascutto says.
Fu, of go platform Yike Weiqi, agrees.
“It’s a grand experiment, with Gian-Carlo Pascutto laying the foundation and Internet users’ passion helping Leela Zero grow. Its openness and public participation is amazing,” he says.
Fu’s platform, which enables humans to play against each other, accommodated Leela Zero via what is known as the “Go Text Protocol” and has arranged a number of professional players against the program.
While Leela Zero has crushed many of them, including national champions of China, it also made some very amateurish mistakes in the games.
“This also happens without self-training, but obviously as the program has no human knowledge programmed in to ‘fix the holes’ in its knowledge, they are more pronounced,” Pascutto says.
He says he explicitly chose to exclude human knowledge “because this allows seeing where the move choices of the program match those of the best humans. Any differences should be very interesting for go players.”
Working thousands of kilometers away from East Asia, where go was invented and considered as one of the four essential arts of the cultured aristocratic Chinese scholars in antiquity, Pascutto is a native of Ninove, roughly halfway between Brussels and Ghent.
He was actually not a go enthusiast and last played the game more than a decade ago. But he did want to shed light on the ancient game, perhaps more than other computer programmers do.
After Google Deepmind retired AlphaGo from playing with humans, there are other computer programs available, but they haven’t revealed much about their coding.
Pascutto believes his work has meaning not only in remaking AlphaGo, which he described as “maybe close enough.”
“As these (other) teams don’t publish what they do, we do not know about their methods,” Pascutto says.
“In the end, if the result is not widely available, what purpose does it serve? Deepmind’s program beat the best humans, but what use is it to the go players? They have gotten only a few games to study. I have made everything open so others can carry on the work.”
- About Us
- |
- Terms of Use
- |
- RSS
- |
- Privacy Policy
- |
- Contact Us
- |
- Shanghai Call Center: 962288
- |
- Tip-off hotline: 52920043
- 沪ICP证:沪ICP备05050403号-1
- |
- 互联网新闻信息服务许可证:31120180004
- |
- 网络视听许可证:0909346
- |
- 广播电视节目制作许可证:沪字第354号
- |
- 增值电信业务经营许可证:沪B2-20120012
Copyright © 1999- Shanghai Daily. All rights reserved.Preferably viewed with Internet Explorer 8 or newer browsers.