COMPUTATIONAL RESEARCH in BOSTON and BEYOND (CRIBB)

Date November 2, 2018
Speaker Stefanie Jegelka Massachusetts Institute of Technology
Topic Aspects of Robustness and Representation in Machine Learning
Abstract

Reliability of machine learning methods includes many facets. One aspect are robust, stable algorithms. Another one is a better theoretical understanding of properties of currently popular models. In this talk, I will show recent work on both these directions.

First, we address robustness for black-box optimization with Gaussian Processes (GPs). GP-based methods have become popular tools for sequentially optimizing an unknown function that is expensive to evaluate, with applications in robotics, hyperparameter tuning, recommender systems and environmental monitoring. In such applications, robust, stable solutions are of interest for several reasons:

the underlying functions during optimization and implementation stages are different, or one seeks an entire region of good inputs rather than only a single point. We formalize this by allowing the query point to be adversarially perturbed, and require the function value to remain as high as possible even after this perturbation. Standard GP optimization approaches can fail in this setting. We provide a new, confidence-bound based algorithm, and establish lower and upper bounds on the required number of samples to find a near-optimal point.

Second, we explore the representational power of ResNet, a popular recent neural network architecture that augments the network with a parallel identity mapping. While classical results address wide, shallow networks, we ask how narrow a deep ResNet can be to still allow universal approximation. Our results show that one hidden unit is enough, in sharp contrast to fully connected networks.

This talk is based on joint work with Ilija Bogunovic, Jonathan Scarlett, Volkan Cevher and Hongzhou Lin.

Archives

Acknowledgements

We thank the generous support of MIT IS&T, CSAIL, and the Department of Mathematics for their support of this series.

MIT Math CSAIL EAPS Lincoln Lab Harvard Astronomy

Accessibility