Nacionalni portal odprte znanosti

Toward the understanding of partial-monitoring games

Csaba Szepesvári

Video in druga učna gradiva

Oznake: mathematics;game theory

Partial monitoring games form a common ground for problems such as learning with expert advice, the multi-armed bandit problem, dynamic pricing, the dark pool problem, label efficient prediction, channel allocation in cognitive radio, linear and convex optimization with various feedbacks. How hard i ...

Leto: 2011 Vir: videolectures.net

Introduction to Reinforcement Learning

Csaba Szepesvári

Video in druga učna gradiva

Oznake: computer science;machine learning;reinforcement learning

The tutorial will introduce Reinforcement Learning, that is, learning what actions to take, and when to take them, so as to optimize long-term performance. This may involve sacrificing immediate reward to obtain greater reward in the long-term or just to obtain more information about the environ ...

Leto: 2008 Vir: videolectures.net

Tradeoffs in online learning under partial information feedback

Csaba Szepesvári

Video in druga učna gradiva

Oznake: computer science;machine learning;on-line learning

How should an online learner choose its actions to trade off between exploration and exploitation to maximize the accuracy of predictions where the choice of actions directly influence what information the learner receives? First, using the abstract framework of partial monitoring, we provide a full ...

Leto: 2012 Vir: videolectures.net

Using upper confidence bounds to control exploration and exploitation

Csaba Szepesvári

Video in druga učna gradiva

Oznake: computer science;machine learning;active learning;exploration vs. exploitation

Leto: 2006 Vir: videolectures.net

The use of Unlabeled Data in Supervised Learning: the Manifold Dossier

Csaba Szepesvári

Video in druga učna gradiva

Oznake: computer science;machine learning

Leto: 2008 Vir: videolectures.net

Agnostic KWIK learning and efficient approximate reinforcement learning

Csaba Szepesvári

Video in druga učna gradiva

Oznake: computer science;machine learning;reinforcement learning

A popular approach in reinforcement learning is to use a model-based algorithm, i.e., an algorithm that utilizes a model learner to learn an approximate model to the environment. It has been shown such a model-based learner is efficient if the model learner is efficient in the so-called \knows what ...

Leto: 2011 Vir: videolectures.net

Regret Bounds for the Adaptive Control of Linear Quadratic Systems

Csaba Szepesvári

Video in druga učna gradiva

Oznake: technology;engineering;electrical engineering

We study the average cost Linear Quadratic (LQ) problem with unknown model parameters, also known as the adaptive control problem in the control community. We design an algorithm and prove that its regret up to time T is O(√T) apart from logarithmic factors. Unlike many classical approaches that ...

Leto: 2011 Vir: videolectures.net

Deterministic Independent Component Analysis

Csaba Szepesvári

Video in druga učna gradiva

Oznake: computer science;machine learning

We study independent component analysis with noisy observations. We present, for the first time in the literature, consistent, polynomial-time algorithms to recover non-Gaussian source signals and the mixing matrix with a reconstruction error that vanishes at a 1/T√ rate using T observations and sca ...

Leto: 2015 Vir: videolectures.net

Theory of RL

Csaba Szepesvári

Video in druga učna gradiva

Oznake: computer science;machine learning;deep learning;reinforcement learning;unsupervised learning

Leto: 2017 Vir: videolectures.net

36. Little Robot Goes Missing

Csaba Szepesvári , Reka Szepesvari , Eszter Szepesvari

Video in druga učna gradiva

Oznake: social sciences;education;computer science;robotics

In this video we demonstrate how robots are able to find their location on a map. The video is based on the project work of 4th year students of the University of Alberta.

Leto: 2009 Vir: videolectures.net

Nacionalni portal odprte znanosti

Dostop do znanja slovenskih raziskovalnih organizacij