Patent search ap:("GOOGLE LLC") AND inv:"Nathan David Howard" Page 1

1.

发明申请
DETECTING CONTINUING CONVERSATIONS WITH COMPUTING DEVICES 有权

公开(公告)号：US20220414333A1

公开(公告)日：2022-12-29

申请号：US17902543

申请日：2022-09-02

Applicant: GOOGLE LLC

Inventor： Nathan David Howard , Gabor Simko , Andrei Giurgiu , Behshad Behzadi , Marcin M. Nowak-Przygodzki

IPC: G06F40/284 , G06F16/903 , G06F16/901 , G06N5/02 , G10L15/22 , G10L25/51 , G10L15/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.

2.

发明授权
Detecting conversations with computing devices 有权

公开(公告)号：US12223950B2

公开(公告)日：2025-02-11

申请号：US18144694

申请日：2023-05-08

Applicant: GOOGLE LLC

Inventor： Marcin Nowak-Przygodzki , Nathan David Howard , Gabor Simko , Andrei Giurgiu , Behshad Behzadi

IPC: G10L15/18 , G06F16/9032 , G10L15/07 , G10L15/08 , G10L15/22 , G10L25/51

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.

3.

发明授权
Joint acoustic echo cancelation, speech enhancement, and voice separation for automatic speech recognition 有权

公开(公告)号：US12119014B2

公开(公告)日：2024-10-15

申请号：US17644108

申请日：2021-12-14

Applicant: Google LLC

Inventor： Arun Narayanan , Tom O'malley , Quan Wang , Alex Park , James Walker , Nathan David Howard , Yanzhang He , Chung-Cheng Chiu

IPC: G10L21/0216 , G06N3/04 , G10L15/06 , G10L21/0208 , H04R3/04

CPC classification number: G10L21/0216 , G06N3/04 , G10L15/063 , H04R3/04 , G10L2021/02082

Abstract: A method for automatic speech recognition using joint acoustic echo cancellation, speech enhancement, and voice separation includes receiving, at a contextual frontend processing model, input speech features corresponding to a target utterance. The method also includes receiving, at the contextual frontend processing model, at least one of a reference audio signal, a contextual noise signal including noise prior to the target utterance, or a speaker embedding including voice characteristics of a target speaker that spoke the target utterance. The method further includes processing, using the contextual frontend processing model, the input speech features and the at least one of the reference audio signal, the contextual noise signal, or the speaker embedding vector to generate enhanced speech features.

4.

发明授权
Utterance classifier 有权

公开(公告)号：US11545147B2

公开(公告)日：2023-01-03

申请号：US16401349

申请日：2019-05-02

Applicant: Google LLC

Inventor： Nathan David Howard , Gabor Simko , Maria Carolina Parada San Martin , Ramkarthik Kalyanasundaram , Guru Prakash Arumugam , Srinivas Vasudevan

IPC: G10L15/08 , G10L15/22 , G06F3/16 , G10L15/16 , G10L15/18 , G10L15/30 , G10L17/00

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for classification using neural networks. One method includes receiving audio data corresponding to an utterance. Obtaining a transcription of the utterance. Generating a representation of the audio data. Generating a representation of the transcription of the utterance. Providing (i) the representation of the audio data and (ii) the representation of the transcription of the utterance to a classifier that, based on a given representation of the audio data and a given representation of the transcription of the utterance, is trained to output an indication of whether the utterance associated with the given representation is likely directed to an automated assistance or is likely not directed to an automated assistant. Receiving, from the classifier, an indication of whether the utterance corresponding to the received audio data is likely directed to the automated assistant or is likely not directed to the automated assistant. Selectively instructing the automated assistant based at least on the indication of whether the utterance corresponding to the received audio data is likely directed to the automated assistant or is likely not directed to the automated assistant.

5.

发明授权
Detecting continuing conversations with computing devices 有权

公开(公告)号：US11893350B2

公开(公告)日：2024-02-06

申请号：US17902543

申请日：2022-09-02

Applicant: GOOGLE LLC

Inventor： Nathan David Howard , Gabor Simko , Andrei Giurgiu , Behshad Behzadi , Marcin M. Nowak-Przygodzki

IPC: G06F40/284 , G06F16/903 , G06F16/901 , G06N5/02 , G10L15/22 , G10L25/51 , G10L15/08

CPC classification number: G06F40/284 , G06F16/9024 , G06F16/90335 , G06N5/02 , G10L15/08 , G10L15/22 , G10L25/51

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.

6.

发明申请
UTTERANCE CLASSIFIER 有权

公开(公告)号：US20220293101A1

公开(公告)日：2022-09-15

申请号：US17804657

申请日：2022-05-31

Applicant: Google LLC

Inventor： Nathan David Howard , Gabor Simko , Maria Carolina Parada San Martin , Ramkarthik Kalyanasundaram , Guru Prakash Arumugam , Srinivas Vasudevan

IPC: G10L15/22 , G06F3/16 , G10L15/16 , G10L15/18 , G10L15/30

Abstract: A method includes receiving a spoken utterance that includes a plurality of words, and generating, using a neural network-based utterance classifier comprising a stack of multiple Long-Short Term Memory (LSTM) layers, a respective textual representation for each word of the of the plurality of words of the spoken utterance. The neural network-based utterance classifier trained on negative training examples of spoken utterances not directed toward an automated assistant server. The method further including determining, using the respective textual representation generated for each word of the plurality of words of the spoken utterance, that the spoken utterance is one of directed toward the automated assistant server or not directed toward the automated assistant server, and when the spoken utterance is directed toward the automated assistant server, generating instructions that cause the automated assistant server to generate a response to the spoken utterance.

7.

发明授权
Utterance classifier 有权

公开(公告)号：US10311872B2

公开(公告)日：2019-06-04

申请号：US15659016

申请日：2017-07-25

Applicant: Google LLC

Inventor： Nathan David Howard , Gabor Simko , Maria Carolina Parada San Martin , Ramkarthik Kalyanasundaram , Guru Prakash Arumugam , Srinivas Vasudevan

IPC: G10L15/08 , G10L15/22 , G10L15/16 , G10L15/30 , G10L15/18

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for classification using neural networks. One method includes receiving audio data corresponding to an utterance. Obtaining a transcription of the utterance. Generating a representation of the audio data. Generating a representation of the transcription of the utterance. Providing (i) the representation of the audio data and (ii) the representation of the transcription of the utterance to a classifier that, based on a given representation of the audio data and a given representation of the transcription of the utterance, is trained to output an indication of whether the utterance associated with the given representation is likely directed to an automated assistance or is likely not directed to an automated assistant. Receiving, from the classifier, an indication of whether the utterance corresponding to the received audio data is likely directed to the automated assistant or is likely not directed to the automated assistant. Selectively instructing the automated assistant based at least on the indication of whether the utterance corresponding to the received audio data is likely directed to the automated assistant or is likely not directed to the automated assistant.

8.

发明申请
Joint Acoustic Echo Cancelation, Speech Enhancement, and Voice Separation for Automatic Speech Recognition 有权

公开(公告)号：US20250029624A1

公开(公告)日：2025-01-23

申请号：US18906761

申请日：2024-10-04

Applicant: Google LLC

Inventor： Arun Narayanan , Tom O'malley , Quan Wang , Alex Park , James Walker , Nathan David Howard , Yanzhang He , Chung-Cheng Chiu

IPC: G10L21/0216 , G06N3/04 , G10L15/06 , G10L21/0208 , H04R3/04

Abstract: A method for automatic speech recognition using joint acoustic echo cancellation, speech enhancement, and voice separation includes receiving, at a contextual frontend processing model, input speech features corresponding to a target utterance. The method also includes receiving, at the contextual frontend processing model, at least one of a reference audio signal, a contextual noise signal including noise prior to the target utterance, or a speaker embedding including voice characteristics of a target speaker that spoke the target utterance. The method further includes processing, using the contextual frontend processing model, the input speech features and the at least one of the reference audio signal, the contextual noise signal, or the speaker embedding vector to generate enhanced speech features.

9.

发明授权
Detecting continuing conversations with computing devices 有权

公开(公告)号：US11436411B2

公开(公告)日：2022-09-06

申请号：US16698350

申请日：2019-11-27

Applicant: Google LLC

Inventor： Nathan David Howard , Gabor Simko , Andrei Giurgiu , Behshad Behzadi , Marcin M. Nowak-Przygodzki

IPC: G06F40/284 , G06F16/903 , G06F16/901 , G06N5/02 , G10L15/22 , G10L25/51 , G10L15/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.

10.

发明申请
DETECTING CONTINUING CONVERSATIONS WITH COMPUTING DEVICES 审中-公开

公开(公告)号：US20200272690A1

公开(公告)日：2020-08-27

申请号：US16698350

申请日：2019-11-27

Applicant: Google LLC

Inventor： Nathan David Howard , Gabor Simko , Andrei Giurgiu , Behshad Behzadi , Marcin M. Nowak-Przygodzki

IPC: G06F17/27 , G06F16/903 , G06F16/901 , G06N5/02

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification