MLCS: José Miguel Hernández-Lobato, University of Cambridge "Advances in Compression and Exploration via Probabilistic Machine Learning"

Machine Learning Coffee seminars are weekly seminars held jointly by the Aalto University and the University of Helsinki. The seminars aim to gather people from different fields of science with interest in machine learning. Seminars will be held weekly on Mondays at 9 am – 10 am.
MLCS seminar

Advances in Compression and Exploration via Probabilistic Machine Learning

José Miguel Hernández-Lobato
Professor of Computer Science
University of Cambridge


In this talk, I will describe two recent contributions in the area of probabilistic machine learning. The first one is MIRACLE, a method for finding compressed representations of neural network weights which can be very useful for the design of mobile apps and energy-efficient hardware. We encode the network weights using a random sample, requiring only a number of bits corresponding to the Kullback-Leibler divergence between the sampled variational distribution and the encoding distribution. Unlike other methods, we can explicitly control the compression rate while optimizing the expected loss on the training set. The employed encoding scheme can be shown to be close to the optimal information-theoretical lower bound. The second contribution is Successor Uncertainties (SU), a probabilistic Q-learning method for balancing exploration and exploitation in reinforcement learning. SU can incorporate uncertainty about long-term consequences of actions accounting for the fundamental dependencies in state-action values implied by the Bellman equation. SU outperforms existing algorithms on several tabular benchmarks and attains strong performance on the Atari benchmark suite.

See the next talks at the seminar webpage

Please spread the news and join us for our weekly habit of beginning the week by an interesting machine learning talk!


  • Julkaistu:
  • Päivitetty:
URL kopioitu