Iskalni niz:
išči po
išči po
išči po
išči po
Vrsta gradiva:
Jezik:
Št. zadetkov: 1
Video in druga učna gradiva
Oznake: computer science;machine learning
We consider the problem of undiscounted reinforcement learning in continuous state space. Regret bounds in this setting usually hold under various assumptions on the structure of the reward and transition function. Under the assumption that the rewards and transition probabilities are Lipschitz, for ...
Leto: 2015 Vir: videolectures.net
Št. zadetkov: 1
Ključne besede:
Leto izdaje:
Repozitorij:
Tipologija:
Jezik: