Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Acoustic Modeling for Emotion RecognitionEmotion Recognition Using Prosodic Features

Acoustic Modeling for Emotion Recognition: Emotion Recognition Using Prosodic Features [In computer vision, a feature is a set of measurements. Each measurement contains a piece of information and specifies the property or characteristics of the object. In speech recognition techniques, how the speech signals are produced and perceived by the human is starting point of the research. Human speech communication produces ideas (word sequence) which are made within the speaker brain. These word sequence are delivered by his/her text generator. The general human vocal system is modeled by the speech generator. The speech generator converts the word sequence into speech signal and is transferred to listener through air. At the listener side, the human auditory system receives these acoustic signal and listeners brain starts the processing of signal to understand its content. The speech recognizer modeled by the speech decoder, it decodes the acoustic signal into word sequence. So speech production and speech perception are in inverse processes in the speech recognition application.] http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png

Loading next page...
 
/lp/springer-journals/acoustic-modeling-for-emotion-recognition-emotion-recognition-using-cc50KqrXj5
Publisher
Springer International Publishing
Copyright
© The Author(s) - SpringerBriefs 2015
ISBN
978-3-319-15529-6
Pages
7 –15
DOI
10.1007/978-3-319-15530-2_2
Publisher site
See Chapter on Publisher Site

Abstract

[In computer vision, a feature is a set of measurements. Each measurement contains a piece of information and specifies the property or characteristics of the object. In speech recognition techniques, how the speech signals are produced and perceived by the human is starting point of the research. Human speech communication produces ideas (word sequence) which are made within the speaker brain. These word sequence are delivered by his/her text generator. The general human vocal system is modeled by the speech generator. The speech generator converts the word sequence into speech signal and is transferred to listener through air. At the listener side, the human auditory system receives these acoustic signal and listeners brain starts the processing of signal to understand its content. The speech recognizer modeled by the speech decoder, it decodes the acoustic signal into word sequence. So speech production and speech perception are in inverse processes in the speech recognition application.]

Published: Mar 15, 2015

Keywords: Speech Signal; Audio Signal; Word Sequence; Prosodic Feature; Speech Generator

There are no references for this article.