Danish emotional speech database

9/1/2023

Based on the response of the first level classifier, the input utterance is forwarded to an appropriate corpus-specific emotion recognition engine, in the second level. As most of the speakers involved in the construction of a specific corpus are from the same locality and cultural background, we assume that a corpus represents the cultural background of the speakers of the corpus constructed. The first level of the hierarchical engine is a culture identification system, which identifies the corpus of an input utterance. To address this issue, a two-level hierarchical engine has been designed to identify emotion from the speech of different cultural backgrounds.

The reason for the unsatisfactory performance of an emotion recognition engine built using mixed-cultural samples can be traced back to this. Among these factors, the cultural background of the speaker has a strong influence on the expression of emotion. Recognition of emotion in speech is a difficult task due to many speaker factors like gender, age, and the cultural background (nationality, ethnicity, and region) as well as the acoustical environment.

0 Comments

Danish emotional speech database

Leave a Reply.

Author

Archives

Categories