Neural Batch Sampling with Reinforcement Learning for Semi-Supervised Anomaly Detection

Master's Thesis, Tech. Report, CMU-RI-TR-20-09, Robotics Institute, Carnegie Mellon University, May, 2020

View Publication

Abstract

We are interested in the detection and segmentation of anomalies in images where the anomalies are typically small (i.e., a small tear in woven fabric, broken pin of an IC chip). From a statistical learning point of view, anomalies have low occurrence probability and are not from the main modes of a data distribution. Learning a generative model of anomalous data from a natural distribution of data can be diÿcult because the data distribution is heavily skewed towards a large amount of non-anomalous data. When training a generative model on such imbalanced data using an iterative learning algorithm like stochastic gradient descent (SGD), we observe an expected yet interesting trend in the loss values (a measure of the learned models performance) after each gradient update across data samples. Naturally, as the model sees more non-anomalous data during training, the loss values over a nonanomalous data sample decreases, while the loss values on an anomalous data sample ﬂuctuates. In this work, our key hypothesis is that this change in loss values during training can be used as a feature to identify anomalous data. In particular, we propose a novel semi-supervised learning algorithm for anomaly detection and segmentation using an anomaly classiﬁer that uses as input the loss proﬁle of a data sample processed through an autoencoder. The loss proﬁle is deﬁned as a sequence of reconstruction loss values produced during iterative training. To amplify the difference in loss proﬁles between anomalous and nonanomalous data, we also introduce a Reinforcement Learning based metaalgorithm, which we call the neural batch sampler, to strategically sample training batches during autoencoder training. Experimental results on multiple datasets with a high diversity of textures and objects, often with multiple modes of defects within them, demonstrate the capabilities and effectiveness of our method when compared with existing state-of-the-art baselines.

BibTeX

@mastersthesis{Chu-2020-121379,
author = {Wen-Hsuan Chu},
title = {Neural Batch Sampling with Reinforcement Learning for Semi-Supervised Anomaly Detection},
year = {2020},
month = {May},
school = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-20-09},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.