AlphaZero - whatʹs missing?

Abstract

In December 2017, the game playing program AlphaZero was reported to have learned in less than 24 hours to play each of the games of chess, Go and shogi better than any human, and better than any other existing specialised computer program for these games. This was achieved just by self-play, without access to any knowledge of these games other than the rules of the game. In this paper we consider some limitations to this spectacular success. The program was trained in well-defined and relatively small domains (admittedly with enormous combinatorial complexity) compared to many real world problems, and it was possible to generate large amounts of learning data through simulated games which is typically not possible in real life domains. When it comes to understanding the games played by AlphaZero, the program’s inability to explain its games and the knowledge acquired in human-understandable terms is a serious limitation.

Keywords

No keyword data available

Data

Language:	English
Year of publishing:	2018
Typology:	1.01 - Original Scientific Article
Organization:	UL FRI - Faculty of Computer and Information Science
UDC:	004.8:004.5:794.1
COBISS:	304025600
ISSN:	0350-5596
Views:	76
Downloads:	24
Average score:	0 (0 votes)
Metadata:

Other data