Patent search ap:("GOOGLE LLC") AND inv:"Justin Lu" Page 1

1.

发明授权
Arranging and/or clearing speech-to-text content without a user providing express instructions 有权

公开(公告)号：US12033637B2

公开(公告)日：2024-07-09

申请号：US17337804

申请日：2021-06-03

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Krishna Sapkota , Behshad Behzadi , Julia Proskurnia , Jacopo Sannazzaro Natta , Justin Lu , Magali Boizot-Roche , Márius {hacek over (S)}ajgalík , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari

IPC: G10L15/26 , G10L15/22

CPC classification number: G10L15/26 , G10L15/22 , G10L2015/223

Abstract: Implementations described herein relate to an application and/or automated assistant that can identify arrangement operations to perform for arranging text during speech-to-text operations—without a user having to expressly identify the arrangement operations. In some instances, a user that is dictating a document (e.g., an email, a text message, etc.) can provide a spoken utterance to an application in order to incorporate textual content. However, in some of these instances, certain corresponding arrangements are needed for the textual content in the document. The textual content that is derived from the spoken utterance can be arranged by the application based on an intent, vocalization features, and/or contextual features associated with the spoken utterance and/or a type of the application associated with the document, without the user expressly identifying the corresponding arrangements. In this way, the application can infer content arrangement operations from a spoken utterance that only specifies the textual content.

2.

发明授权
Contextual suppression of assistant command(s) 有权

公开(公告)号：US11557293B2

公开(公告)日：2023-01-17

申请号：US17321994

申请日：2021-05-17

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/18 , G10L25/78 , G10L15/05 , G10L15/08

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

3.

发明申请
ARRANGING AND/OR CLEARING SPEECH-TO-TEXT CONTENT WITHOUT A USER PROVIDING EXPRESS INSTRUCTIONS 有权

公开(公告)号：US20220366911A1

公开(公告)日：2022-11-17

申请号：US17337804

申请日：2021-06-03

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Krishna Sapkota , Behshad Behzadi , Julia Proskurnia , Jacopo Sannazzaro Natta , Justin Lu , Magali Boizot-Roche , Márius Sajgalík , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari

IPC: G10L15/26 , G10L15/22

Abstract: Implementations described herein relate to an application and/or automated assistant that can identify arrangement operations to perform for arranging text during speech-to-text operations—without a user having to expressly identify the arrangement operations. In some instances, a user that is dictating a document (e.g., an email, a text message, etc.) can provide a spoken utterance to an application in order to incorporate textual content. However, in some of these instances, certain corresponding arrangements are needed for the textual content in the document. The textual content that is derived from the spoken utterance can be arranged by the application based on an intent, vocalization features, and/or contextual features associated with the spoken utterance and/or a type of the application associated with the document, without the user expressly identifying the corresponding arrangements. In this way, the application can infer content arrangement operations from a spoken utterance that only specifies the textual content.

4.

发明申请
DYNAMICALLY CONFIGURING A WARM WORD BUTTON WITH ASSISTANT COMMANDS 有权

公开(公告)号：US20230061929A1

公开(公告)日：2023-03-02

申请号：US17532315

申请日：2021-11-22

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Antonio Gaetani , Bastiaan Van Eeckhoudt , Daniel Valcarce , Michael Golikov , Justin Lu , Ondrej Skopek , Nicolo D'Ercole , Zaheed Sabur , Behshad Behzadi , Luv Kothari

IPC: G10L17/22

Abstract: Implementations described herein relate to configuring a dynamic warm word button, that is associated with a client device, with particular assistant commands based on detected occurrences of warm word activation events at the client device. In response to detecting an occurrence of a given warm word activation event at the client device, implementations can determine whether user verification is required for a user that actuated the warm word button. Further, in response to determining that the user verification is required for the user that actuated the warm word button, the user verification can be performed. Moreover, in response to determining that the user that actuated the warm word button has been verified, implementations can cause an automated assistant to perform the particular assistant command associated with the warm word activation event. Audio-based and/or non-audio-based techniques can be utilized to perform the user verification.

5.

发明授权
Contextual suppression of assistant command(s) 有权

公开(公告)号：US12057119B2

公开(公告)日：2024-08-06

申请号：US18092883

申请日：2023-01-03

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/05 , G10L15/18 , G10L25/78 , G10L15/08

CPC classification number: G10L15/22 , G10L15/05 , G10L15/1815 , G10L25/78 , G10L2015/088 , G10L2015/223

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

6.

发明公开
CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S) 审中-公开

公开(公告)号：US20230143177A1

公开(公告)日：2023-05-11

申请号：US18092883

申请日：2023-01-03

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/05 , G10L15/18 , G10L25/78

CPC classification number: G10L15/22 , G10L15/05 , G10L15/1815 , G10L25/78 , G10L2015/088

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

7.

发明申请
CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S) 有权

公开(公告)号：US20220366903A1

公开(公告)日：2022-11-17

申请号：US17321994

申请日：2021-05-17

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/18 , G10L15/05 , G10L25/78

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

8.

发明公开
CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S) 审中-公开

公开(公告)号：US20240347060A1

公开(公告)日：2024-10-17

申请号：US18750663

申请日：2024-06-21

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov

IPC: G10L15/22 , G10L15/05 , G10L15/08 , G10L15/18 , G10L25/78

CPC classification number: G10L15/22 , G10L15/05 , G10L15/1815 , G10L25/78 , G10L2015/088 , G10L2015/223

Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.

9.

发明公开
ARRANGING AND/OR CLEARING SPEECH-TO-TEXT CONTENT WITHOUT A USER PROVIDING EXPRESS INSTRUCTIONS 审中-公开

公开(公告)号：US20240321277A1

公开(公告)日：2024-09-26

申请号：US18677629

申请日：2024-05-29

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Krishna Sapkota , Behshad Behzadi , Julia Proskurnia , Jacopo Sannazzaro Natta , Justin Lu , Magali Boizot-Roche , Marius Sajgalik , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari

IPC: G10L15/26 , G10L15/22

CPC classification number: G10L15/26 , G10L15/22 , G10L2015/223

Abstract: Implementations described herein relate to an application and/or automated assistant that can identify arrangement operations to perform for arranging text during speech-to-text operations—without a user having to expressly identify the arrangement operations. In some instances, a user that is dictating a document (e.g., an email, a text message, etc.) can provide a spoken utterance to an application in order to incorporate textual content. However, in some of these instances, certain corresponding arrangements are needed for the textual content in the document. The textual content that is derived from the spoken utterance can be arranged by the application based on an intent, vocalization features, and/or contextual features associated with the spoken utterance and/or a type of the application associated with the document, without the user expressly identifying the corresponding arrangements. In this way, the application can infer content arrangement operations from a spoken utterance that only specifies the textual content.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification