Automatic recognition of facial displays of unfelt emotions

Humans modify their facial expressions in order to communicate their internal states and sometimes to mislead observers regarding their true emotional states. Evidence in experimental psychology shows that discriminative facial responses are short and subtle. This suggests that such behavior would b...

Descripción completa

Detalles Bibliográficos
Autores: Kulkarni, Kaustubh, Corneanu, Ciprian, Ofodile, Ikechukwu, Escalera, Sergio, Baró, Xavier, Hyniewska, Sylwia, Allik, Jüri, Anbarjafari, Gholamreza
Tipo de recurso: artículo
Estado:Versión enviada para evaluación y publicación
Fecha de publicación:2018
País:España
Institución:Universitat Oberta de Catalunya (UOC)
Repositorio:O2, repositorio institucional de la UOC
OAI Identifier:oai:openaccess.uoc.edu:10609/93201
Acceso en línea:https://hdl.handle.net/10609/93201
Access Level:acceso abierto
Palabra clave:affective computing
facial expression recognition
unfelt facial expression of emotion
human behaviour analysis
computación afectiva
reconocimiento de la expresión facial
expresión facial sin emoción
análisis del comportamiento humano
computació afectiva
reconeixement d'expressió facial
expressió facial sense emoció
anàlisi del comportament humà
Human face recognition (Computer science)
Reconeixement facial (Informàtica)
Reconocimiento facial (Informática)
Descripción
Sumario:Humans modify their facial expressions in order to communicate their internal states and sometimes to mislead observers regarding their true emotional states. Evidence in experimental psychology shows that discriminative facial responses are short and subtle. This suggests that such behavior would be easier to distinguish when captured in high resolution at an increased frame rate. We are proposing SASE-FE, the first dataset of facial expressions that are either congruent or incongruent with underlying emotion states. We show that overall the problem of recognizing whether facial movements are expressions of authentic emotions or not can be successfully addressed by learning spatio-temporal representations of the data. For this purpose, we propose a method that aggregates features along fiducial trajectories in a deeply learnt space. Performance of the proposed model shows that on average it is easier to distinguish among genuine facial expressions of emotion than among unfelt facial expressions of emotion and that certain emotion pairs such as contempt and disgust are more difficult to distinguish than the rest. Furthermore, the proposed methodology improves state of the art results on CK+ and OULU-CASIA datasets for video emotion recognition, and achieves competitive results when classifying facial action units on BP4D datase.