Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SPEECH EMOTION RECOGNITION METHOD AND APPARATUS
Document Type and Number:
WIPO Patent Application WO/2024/008215
Kind Code:
A2
Abstract:
The present application provides a speech emotion recognition method and apparatus. The method comprises: obtaining a first audio feature encoding of a current audio frame and text feature information of a historical audio frame, wherein the historical audio frame precedes the current audio frame; predicting a text feature encoding of the current audio frame on the basis of the text feature information of the historical audio frame; performing fusion on the first audio feature encoding and the text feature encoding of the current audio frame to obtain a fused feature vector; and performing speech emotion recognition on the basis of the fused feature vector to obtain a speech emotion recognition result of the current audio frame. The present application utilizes the text feature information of the historical audio frame to predict the text feature encoding of the current audio frame, and after performing fusion on the first audio feature encoding and the text feature encoding of the current audio frame, same performs speech emotion recognition; deep fusion is carried out on audio information and text information, and the accuracy of speech emotion recognition is able to be improved.

Inventors:
LIU RUZHOU (CN)
Application Number:
PCT/CN2023/117475
Publication Date:
January 11, 2024
Filing Date:
September 07, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SF TECH CO LTD (CN)
International Classes:
G10L25/63
Attorney, Agent or Firm:
BEIJING BRIGHT IP AGENCY CO., LTD. (CN)
Download PDF: