Patent search ap:("Google LLC") AND inv:"Yiteng Huang" Page 2

11.

发明授权
Selective adaptation and utilization of noise reduction technique in invocation phrase detection 有权

公开(公告)号：US12260857B2

公开(公告)日：2025-03-25

申请号：US18662334

申请日：2024-05-13

Applicant: GOOGLE LLC

Inventor： Christopher Hughes , Yiteng Huang , Turaj Zakizadeh Shabestary , Taylor Applebaum

IPC: G10L15/22 , G10L15/02 , G10L15/08 , G10L15/20 , G10L21/0232 , G10L25/84 , G10L21/0216

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

12.

发明申请
Cascade Architecture for Noise-Robust Keyword Spotting 有权

公开(公告)号：US20230097197A1

公开(公告)日：2023-03-30

申请号：US17905137

申请日：2020-04-08

Applicant: Google LLC

Inventor： Yiteng Huang , Alexander H. Gruenstein

IPC: G10L21/0216 , G10L15/08 , G10L15/22

Abstract: A method (400) includes receiving, at a first processor (110) of a user device (102), streaming multi-channel audio (118) captured by an array of microphones (107), each channel (119) including respective audio features. For each channel, the method also includes processing, by the first processor, using a first stage hotword detector (210), the respective audio features to determine whether a hotword is detected. When the first stage hotword detector detects the hotword, the method also includes the first processor providing chomped raw audio data (212) to a second processor that processes, using a first noise cleaning algorithm (250), the chomped raw audio data to generate a clean monophonic audio chomp (260). The method also includes processing, by the second processor using a second stage hotword detector (220), the clean monophonic audio chomp to detect the hotword.

13.

发明申请
Small Footprint Multi-Channel Keyword Spotting 有权

公开(公告)号：US20230022800A1

公开(公告)日：2023-01-26

申请号：US17757260

申请日：2020-01-15

Applicant: Google LLC

Inventor： Jilong Wu , Yiteng Huang

IPC: G10L15/16 , H04R3/00 , G10L15/28

Abstract: A method (800) to detect a hotword in a spoken utterance (120) includes receiving a sequence of input frames (210) characterizing streaming multi-channel audio (118). Each channel (119) of the streaming multi-channel audio includes respective audio features (510) captured by a separate dedicated microphone (107). For each input frame, the method includes processing, using a three-dimensional (3D) single value decomposition filter (SVDF) input layer (302) of a memorized neural network (300), the respective audio features of each channel in parallel and generating a corresponding multi-channel audio feature representation (420) based on a concatenation of the respective audio features (344). The method also includes generating, using sequentially-stacked SVDF layers (350), a probability score (360) indicating a presence of a hotword in the audio. The method also includes determining whether the probability score satisfies a threshold and, when satisfied, initiating a wake-up process on a user device (102).

14.

发明申请
SELECTIVE ADAPTATION AND UTILIZATION OF NOISE REDUCTION TECHNIQUE IN INVOCATION PHRASE DETECTION 有权

公开(公告)号：US20220392441A1

公开(公告)日：2022-12-08

申请号：US17886726

申请日：2022-08-12

Applicant: Google LLC

Inventor： Christopher Hughes , Yiteng Huang , Turaj Zakizadeh Shabestary , Taylor Applebaum

IPC: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

15.

发明申请
SELECTIVE ADAPTATION AND UTILIZATION OF NOISE REDUCTION TECHNIQUE IN INVOCATION PHRASE DETECTION 审中-公开

公开(公告)号：US20200294496A1

公开(公告)日：2020-09-17

申请号：US16886139

申请日：2020-05-28

Applicant: Google LLC

Inventor： Christopher Hughes , Yiteng Huang , Turaj Zakizadeh Shabestary , Taylor Applebaum

IPC: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

Patent Agency Ranking