D-PAttNet: Dynamic patch-attentive deep network for action unit detection

Itir Onal Ertugrul, Le Yang, Laszlo A. Jeni, and Jeffrey F. Cohn

Journal Article, Frontiers in Computer Science, Vol. 1, No. 11, November, 2019

Abstract

Facial action units (AUs) relate to specific local facial regions. Recent efforts in automated AU detection have focused on learning the facial patch representations to detect specific AUs. These efforts have encountered three hurdles. First, they implicitly assume that facial patches are robust to head rotation; yet non-frontal rotation is common. Second, mappings between AUs and patches are defined a priori, which ignores co-occurrences among AUs. And third, the dynamics of AUs are either ignored or modeled sequentially rather than simultaneously as in human perception. Inspired by recent advances in human perception, we propose a dynamic patch-attentive deep network, called D-PAttNet, for AU detection that (i) controls for 3D head and face rotation, (ii) learns mappings of patches to AUs, and (iii) models spatiotemporal dynamics. D-PAttNet approach significantly improves upon existing state of the art.

BibTeX

@article{Ertugrul-2019-119647,
author = {Itir Onal Ertugrul and Le Yang and Laszlo A. Jeni and Jeffrey F. Cohn},
title = {D-PAttNet: Dynamic patch-attentive deep network for action unit detection},
journal = {Frontiers in Computer Science},
year = {2019},
month = {November},
volume = {1},
number = {11},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.