Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
小さいフットプリントのマルチチャネルキーワードスポッティング
Document Type and Number:
Japanese Patent JP7345667
Kind Code:
B2
Abstract:
A method (800) to detect a hotword in a spoken utterance (120) includes receiving a sequence of input frames (210) characterizing streaming multi-channel audio (118). Each channel (119) of the streaming multi-channel audio includes respective audio features (510) captured by a separate dedicated microphone (107). For each input frame, the method includes processing, using a three-dimensional (3D) single value decomposition filter (SVDF) input layer (302) of a memorized neural network (300), the respective audio features of each channel in parallel and generating a corresponding multi-channel audio feature representation (420) based on a concatenation of the respective audio features (344). The method also includes generating, using sequentially-stacked SVDF layers (350), a probability score (360) indicating a presence of a hotword in the audio. The method also includes determining whether the probability score satisfies a threshold and, when satisfied, initiating a wake-up process on a user device (102).

Inventors:
Jiron Wu
Iten Fan
Application Number:
JP2022543118A
Publication Date:
September 15, 2023
Filing Date:
January 15, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
Google LLC
International Classes:
G10L15/28; G10L15/10; G10L15/16
Domestic Patent References:
JP2019133156A
JP2019020598A
Foreign References:
WO2017187516A1
Attorney, Agent or Firm:
Yasuhiko Murayama
Shinya Mihiro
Tatsuhiko Abe