Deep learning

On Modeling Glottal Source Information for Phonation Assessment in Parkinson’s Disease

Parkinson’s disease produces several motor symptoms, including different speech impairments that are known as hypokinetic dysarthria. Symptoms associated to dysarthria affect different dimensions of speech such as phonation, articulation, prosody, …

Parallel Representation Learning for the Classification of Pathological Speech: Studies on Parkinson’s Disease and Cleft Lip and Palate

Speech signals may contain different paralinguistic aspects such as the presence of pathologies that affect the proper communication capabilities of a speaker. Those speech disorders have different origin depending on the type of the disease. For …

Automatic detection of Voice Onset Time in voiceless plosives using gated recurrent units

Voice Onset Time (VOT) has been used by researchers as an acoustic measure in order to gain some understanding about the impact of different motor speech disorders in speech production. However, VOT values are usually obtained manually, which is …

Multi-channel spectrograms for speech processing applications using deep learning methods

Time–frequency representations of the speech signals provide dynamic information about how the frequency component changes with time. In order to process this information, deep learning models with convolution layers can be used to obtain feature …

Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson’s Disease from Speech in Three Different Languages

Parkinson’s disease patients develop different speech impairments that affect their communication capabilities. The automatic assessment of the speech of the patients allows the development of computer aided tools to support the diagnosis and the …

Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users

This paper proposes a methodology for automatic detection of speech disorders in Cochlear Implant users by implementing a multi-channel Convolutional Neural Network. The model is fed with a 2-channel input which consists of two spectrograms computed …

Aprendizaje por transferencia en redes neuronales convolucionales para el diagnóstico y monitoreo de la enfermedad de Parkinson usando señales de voz en tres idiomas diferentes

Ingeniería electrónica, Universidad de Antioquia, 2019

Modelling of Speech Aspects in Parkinson’s Disease by Multitask Deep Learning

Master in medical engineering, Friedrich Alexander Universität (FAU), 2019.

Phonet

a Keras Python Tool Based on Gated Recurrent Neural Networks to Extract Phonological Posteriors from Speech

Verificación biométrica de identidad usando reconocimiento de rostros y patrones de tecleo

Ingeniería en telecomunicaciones, Universidad de Antioquia, 2018