Patent search ap:("Google LLC") AND inv:"Aleksandar Kracun" Page 3

21.

发明授权
Adapting automated speech recognition parameters based on hotword properties 有权

公开(公告)号：US12080276B2

公开(公告)日：2024-09-03

申请号：US18188238

申请日：2023-03-22

Applicant: Google LLC

Inventor： Matthew Sharifi , Aleksandar Kracun

IPC: G10L15/06 , G10L15/16 , G10L15/22 , G10L15/28 , G10L25/90 , G10L15/08 , G10L25/78

CPC classification number: G10L15/06 , G10L15/16 , G10L15/22 , G10L15/28 , G10L25/90 , G10L2015/088 , G10L2025/783

Abstract: A method for optimizing speech recognition includes receiving a first acoustic segment characterizing a hotword detected by a hotword detector in streaming audio captured by a user device, extracting one or more hotword attributes from the first acoustic segment, and adjusting, based on the one or more hotword attributes extracted from the first acoustic segment, one or more speech recognition parameters of an automated speech recognition (ASR) model. After adjusting the speech recognition parameters of the ASR model, the method also includes processing, using the ASR model, a second acoustic segment to generate a speech recognition result. The second acoustic segment characterizes a spoken query/command that follows the first acoustic segment in the streaming audio captured by the user device.

22.

发明授权
Freeze words 有权

公开(公告)号：US12073826B2

公开(公告)日：2024-08-27

申请号：US18322149

申请日：2023-05-23

Applicant: Google LLC

Inventor： Matthew Sharifi , Aleksandar Kracun

IPC: G10L15/16 , G10L15/05 , G10L15/08

CPC classification number: G10L15/16 , G10L15/05 , G10L2015/088

Abstract: A method for detecting freeze words includes receiving audio data that corresponds to an utterance spoken by a user and captured by a user device associated with the user. The method also includes processing, using a speech recognizer, the audio data to determine that the utterance includes a query for a digital assistant to perform an operation. The speech recognizer is configured to trigger endpointing of the utterance after a predetermined duration of non-speech in the audio data. Before the predetermined duration of non-speech, the method includes detecting a freeze word in the audio data. In response to detecting the freeze word in the audio data, the method also includes triggering a hard microphone closing event at the user device. The hard microphone closing event prevents the user device from capturing any audio subsequent to the freeze word.

23.

发明公开
Voice Query QoS based on Client-Computed Content Metadata 审中-公开

公开(公告)号：US20240029740A1

公开(公告)日：2024-01-25

申请号：US18480798

申请日：2023-10-04

Applicant: Google LLC

Inventor： Matthew Sharifi , Aleksandar Kracun

IPC: G10L15/30 , G06F16/63 , G10L15/08 , G10L15/22 , H04L67/568

CPC classification number: G10L15/30 , G06F16/63 , G10L15/08 , G10L15/22 , H04L67/568 , G10L2015/088

Abstract: A method includes receiving an automated speech recognition (ASR) request from a user device that includes a speech input captured by the user device and content metadata associated with the speech input. The content metadata is generated by the user device. The method also includes determining a priority score for the ASR request based on the content metadata associated with the speech input and caching the ASR request in a pre-processing backlog of pending ASR requests each having a corresponding priority score. The pending ASR requests in the pre-processing backlog are ranked in order of the priority scores. The method also includes providing, from the pre-processing backlog, one or more of the pending ASR requests to a backend-side ASR module, wherein pending ASR requests associated with higher priority scores are processed before pending ASR requests associated with lower priority scores.

24.

发明公开
Freeze Words 审中-公开

公开(公告)号：US20230298575A1

公开(公告)日：2023-09-21

申请号：US18322149

申请日：2023-05-23

Applicant: Google LLC

Inventor： Matthew Sharifi , Aleksandar Kracun

IPC: G10L15/16 , G10L15/05

CPC classification number: G10L15/16 , G10L15/05 , G10L2015/088

Abstract: A method for detecting freeze words includes receiving audio data that corresponds to an utterance spoken by a user and captured by a user device associated with the user. The method also includes processing, using a speech recognizer, the audio data to determine that the utterance includes a query for a digital assistant to perform an operation. The speech recognizer is configured to trigger endpointing of the utterance after a predetermined duration of non-speech in the audio data. Before the predetermined duration of non-speech, the method includes detecting a freeze word in the audio data. In response to detecting the freeze word in the audio data, the method also includes triggering a hard microphone closing event at the user device. The hard microphone closing event prevents the user device from capturing any audio subsequent to the freeze word.

25.

发明授权
Adapting hotword recognition based on personalized negatives 有权

公开(公告)号：US11749267B2

公开(公告)日：2023-09-05

申请号：US16953510

申请日：2020-11-20

Applicant: Google LLC

Inventor： Aleksandar Kracun , Matthew Sharifi

IPC: G10L15/22 , G10L15/197 , G10L17/06 , G10L17/24 , G10L15/30 , G10L15/08

CPC classification number: G10L15/22 , G10L15/197 , G10L15/30 , G10L17/06 , G10L17/24 , G10L2015/088 , G10L2015/223

Abstract: A method for adapting hotword recognition includes receiving audio data characterizing a hotword event detected by a first stage hotword detector in streaming audio captured by a user device. The method also includes processing, using a second stage hotword detector, the audio data to determine whether a hotword is detected by the second stage hot word detector in a first segment of the audio data. When the hotword is not detected by the second stage hotword detector, the method includes, classifying the first segment of the audio data as containing a negative hotword that caused a false detection of the hotword event in the streaming audio by the first stage hotword detector. Based on the first segment of the audio data classified as containing the negative hotword, the method includes updating the first stage hotword detector to prevent triggering the hotword event in subsequent audio data that contains the negative hotword.

26.

发明授权
Detecting and suppressing voice queries 有权

公开(公告)号：US11341969B2

公开(公告)日：2022-05-24

申请号：US16885072

申请日：2020-05-27

Applicant: Google LLC

Inventor： Alexander H. Gruenstein , Aleksandar Kracun , Matthew Sharifi

IPC: G10L15/22 , G10L15/08 , G10L15/26 , H04L29/06 , G10L17/00 , G10L15/06 , G06F16/432

Abstract: A computing system receives requests from client devices to process voice queries that have been detected in local environments of the client devices. The system identifies that a value that is based on a number of requests to process voice queries received by the system during a specified time interval satisfies one or more criteria. In response, the system triggers analysis of at least some of the requests received during the specified time interval to trigger analysis of at least some received requests to determine a set of requests that each identify a common voice query. The system can generate an electronic fingerprint that indicates a distinctive model of the common voice query. The fingerprint can then be used to detect an illegitimate voice query identified in a request from a client device at a later time.

27.

发明申请
SPEAKER DIARIZATION 有权

公开(公告)号：US20210295824A1

公开(公告)日：2021-09-23

申请号：US17222939

申请日：2021-04-05

Applicant: Google LLC

Inventor： Aleksandar Kracun , Richard Cameron Rose

IPC: G10L15/08 , G10L15/22

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.

28.

发明授权
Detecting and suppressing voice queries 有权

公开(公告)号：US12205588B2

公开(公告)日：2025-01-21

申请号：US17749892

申请日：2022-05-20

Applicant: GOOGLE LLC

Inventor： Alexander H. Gruenstein , Aleksandar Kracun , Matthew Sharifi

IPC: G10L15/22 , G10L15/08 , G10L15/26 , G10L17/00 , H04L9/40 , G06F16/33 , G06F16/432 , G10L15/06 , G10L15/18

Abstract: A computing system receives requests from client devices to process voice queries that have been detected in local environments of the client devices. The system identifies that a value that is based on a number of requests to process voice queries received by the system during a specified time interval satisfies one or more criteria. In response, the system triggers analysis of at least some of the requests received during the specified time interval to trigger analysis of at least some received requests to determine a set of requests that each identify a common voice query. The system can generate an electronic fingerprint that indicates a distinctive model of the common voice query. The fingerprint can then be used to detect an illegitimate voice query identified in a request from a client device at a later time.

29.

发明公开
Adapting Automated Speech Recognition Parameters Based on Hotword Properties 审中-公开

公开(公告)号：US20230223014A1

公开(公告)日：2023-07-13

申请号：US18188238

申请日：2023-03-22

Applicant: Google LLC

Inventor： Matthew Sharifi , Aleksandar Kracun

IPC: G10L15/16 , G10L15/28 , G10L15/22 , G10L25/90 , G10L15/08 , G10L25/78

CPC classification number: G10L15/16 , G10L15/22 , G10L15/28 , G10L25/90 , G10L2015/088 , G10L2025/783

Abstract: A method for optimizing speech recognition includes receiving a first acoustic segment characterizing a hotword detected by a hotword detector in streaming audio captured by a user device, extracting one or more hotword attributes from the first acoustic segment, and adjusting, based on the one or more hotword attributes extracted from the first acoustic segment, one or more speech recognition parameters of an automated speech recognition (ASR) model. After adjusting the speech recognition parameters of the ASR model, the method also includes processing, using the ASR model, a second acoustic segment to generate a speech recognition result. The second acoustic segment characterizes a spoken query/command that follows the first acoustic segment in the streaming audio captured by the user device.

30.

发明授权
Freeze words 有权

公开(公告)号：US11688392B2

公开(公告)日：2023-06-27

申请号：US17115742

申请日：2020-12-08

Applicant: Google LLC

Inventor： Matthew Sharifi , Aleksandar Kracun

IPC: G10L15/16 , G10L15/05 , G10L15/08

CPC classification number: G10L15/16 , G10L15/05 , G10L2015/088

Abstract: A method for detecting freeze words includes receiving audio data that corresponds to an utterance spoken by a user and captured by a user device associated with the user. The method also includes processing, using a speech recognizer, the audio data to determine that the utterance includes a query for a digital assistant to perform an operation. The speech recognizer is configured to trigger endpointing of the utterance after a predetermined duration of non-speech in the audio data. Before the predetermined duration of non-speech, the method includes detecting a freeze word in the audio data. In response to detecting the freeze word in the audio data, the method also includes triggering a hard microphone closing event at the user device. The hard microphone closing event prevents the user device from capturing any audio subsequent to the freeze word.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification