Time dependent ARMA for automatic recognition of fear-type emotions in speech

Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave, Julián D Arias-Londoño, Jesús Francisco Vargas-Bonilla, LD Avendaño, Elmar Nöth

January 2015

PDF

Abstract

The speech signals are non-stationary processes with changes in time and frequency. The structure of a speech signal is also affected by the presence of several paralinguistics phenomena such as emotions, pathologies, cognitive impairments, among others. Non-stationarity can be modeled using several parametric techniques. A novel approach based on time dependent auto-regressive moving average (TARMA) is proposed here to model the non-stationarity of speech signals. The model is tested in the recognition of “fear-typeo” emotions in speech. The proposed approach is applied to model syllables and unvoiced segments extracted from recordings of the Berlin and enterface05 databases. The results indicate that TARMA models can be used for the automatic recognition of emotions in speech.

Type

Conference paper

Publication

International Conference on Text, Speech, and Dialogue