Ivan Bratko (Author)

Abstract

In December 2017, the game playing program AlphaZero was reported to have learned in less than 24 hours to play each of the games of chess, Go and shogi better than any human, and better than any other existing specialised computer program for these games. This was achieved just by self-play, without access to any knowledge of these games other than the rules of the game. In this paper we consider some limitations to this spectacular success. The program was trained in well-defined and relatively small domains (admittedly with enormous combinatorial complexity) compared to many real world problems, and it was possible to generate large amounts of learning data through simulated games which is typically not possible in real life domains. When it comes to understanding the games played by AlphaZero, the program’s inability to explain its games and the knowledge acquired in human-understandable terms is a serious limitation.

Keywords

No keyword data available

Data

Language: English
Year of publishing:
Typology: 1.01 - Original Scientific Article
Organization: UL FRI - Faculty of Computer and Information Science
UDC: 004.8:004.5:794.1
COBISS: 304025600 Link will open in a new window
ISSN: 0350-5596
Views: 76
Downloads: 24
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: Slovenian
Secondary keywords: računalniško igranje iger;umetna inteligenca;strojno učenje;računalniške igre;računalniški šah;računalniški programi;
Type (COBISS): Article
Pages: str. 7-11
Volume: ǂVol. ǂ42
Issue: ǂno. ǂ1
Chronology: Mar. 2018
ID: 26070792
Recommended works:
, no subtitle data available
, no subtitle data available
, magistrsko delo
, (magistrsko diplomsko delo)