Learning Anytime Predictions in Neural Networks via Adaptive Loss Balancing

Hanzhang Hu, Debadeepta Dey, Martial Hebert, and J. Andrew Bagnell

Conference Paper, Proceedings of 33rd National Conference on Artificial Intelligence (AAAI '19), pp. 3812 - 3821, January, 2019

View Publication

Abstract

This work considers the trade-off between accuracy and test-time computational cost of deep neural networks (DNNs) via anytime predictions from auxiliary predictions. Specifically, we optimize auxiliary losses jointly in an adaptive weighted sum, where the weights are inversely proportional to average of each loss.
Intuitively, this balances the losses to have the same scale.

We demonstrate theoretical considerations that motivate this approach from multiple viewpoints, including connecting it to optimizing the geometric mean of the expectation of each loss, an objective that ignores the scale of losses.

Experimentally, the adaptive weights induce more competitive anytime predictions on multiple recognition data-sets and models than non-adaptive approaches including weighing all losses equally. In particular, anytime neural networks (ANNs) can achieve the same accuracy faster using adaptive weights on a small network than using static constant weights on a large one. For problems with high performance saturation, we also show a sequence of exponentially deepening ANNs can achieve near-optimal anytime results at any budget, at the cost of a const fraction of extra computation.

BibTeX

@conference{Hu-2019-110314,
author = {Hanzhang Hu and Debadeepta Dey and Martial Hebert and J. Andrew Bagnell},
title = {Learning Anytime Predictions in Neural Networks via Adaptive Loss Balancing},
booktitle = {Proceedings of 33rd National Conference on Artificial Intelligence (AAAI '19)},
year = {2019},
month = {January},
pages = {3812 - 3821},
keywords = {anytime prediction, neural network,},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.