Št. zadetkov: 10
Video in druga učna gradiva
Oznake:
mathematics;game theory
Partial monitoring games form a common ground for problems such as learning with expert advice, the multi-armed bandit problem, dynamic pricing, the dark pool problem, label efficient prediction, channel allocation in cognitive radio, linear and convex optimization with various feedbacks. How hard i ...
Leto:
2011
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
computer science;machine learning;reinforcement learning
The tutorial will introduce
Reinforcement Learning, that is, learning what actions to take,
and when to take them, so as to optimize long-term performance. This may
involve sacrificing immediate reward to obtain greater reward in the
long-term or just to obtain more information about the environ ...
Leto:
2008
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
computer science;machine learning;on-line learning
How should an online learner choose its actions to trade off between exploration and exploitation to maximize the accuracy of predictions where the choice of actions directly influence what information the learner receives? First, using the abstract framework of partial monitoring, we provide a full ...
Leto:
2012
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
computer science;machine learning;active learning;exploration vs. exploitation
Leto:
2006
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
computer science;machine learning
Leto:
2008
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
computer science;machine learning;reinforcement learning
A popular approach in reinforcement learning is to use a model-based algorithm, i.e., an algorithm that utilizes a model learner to learn an approximate model to the environment. It has been shown such a model-based learner is efficient if the model learner is efficient in the so-called \knows what ...
Leto:
2011
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
technology;engineering;electrical engineering
We study the average cost Linear Quadratic (LQ) problem with unknown model parameters,
also known as the adaptive control problem in the control community. We design an
algorithm and prove that its regret up to time T is O(√T) apart from logarithmic factors.
Unlike many classical approaches that ...
Leto:
2011
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
computer science;machine learning
We study independent component analysis with noisy observations. We present, for the first time in the literature, consistent, polynomial-time algorithms to recover non-Gaussian source signals and the mixing matrix with a reconstruction error that vanishes at a 1/T√ rate using T observations and sca ...
Leto:
2015
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
computer science;machine learning;deep learning;reinforcement learning;unsupervised learning
Leto:
2017
Vir:
videolectures.net
Video in druga učna gradiva
Oznake:
social sciences;education;computer science;robotics
In this video we demonstrate how robots are able to find their location on a map. The video is based on the project work of 4th year students of the University of Alberta.
Leto:
2009
Vir:
videolectures.net