Patent search ap:("GOOGLE LLC") AND inv:"Viesturs Zarins" Page 1

1.

发明授权
Voice commands for an automated assistant utilized in smart dictation 有权

公开(公告)号：US12106758B2

公开(公告)日：2024-10-01

申请号：US17322765

申请日：2021-05-17

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Alvin Abdagic , Behshad Behzadi , Jacopo Sannazzaro Natta , Julia Proskurnia , Krzysztof Andrzej Goj , Srikanth Pandiri , Viesturs Zarins , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari

IPC: G06F3/01 , G06F3/0488 , G06N20/00 , G10L15/18 , G10L15/22 , G10L15/26

CPC classification number: G10L15/26 , G06F3/0488 , G06N20/00 , G10L15/18 , G10L15/22 , G10L2015/223

Abstract: Systems and methods described herein relate to determining whether to incorporate recognized text, that corresponds to a spoken utterance of a user of a client device, into a transcription displayed at the client device, or to cause an assistant command, that is associated with the transcription and that is based on the recognized text, to be performed by an automated assistant implemented by the client device. The spoken utterance is received during a dictation session between the user and the automated assistant. Implementations can process, using automatic speech recognition model(s), audio data that captures the spoken utterance to generate the recognized text. Further, implementations can determine whether to incorporate the recognized text into the transcription or cause the assistant command to be performed based on touch input being directed to the transcription, a state of the transcription, and/or audio-based characteristic(s) of the spoken utterance.

2.

发明申请
VOICE COMMANDS FOR AN AUTOMATED ASSISTANT UTILIZED IN SMART DICTATION 有权

公开(公告)号：US20220366910A1

公开(公告)日：2022-11-17

申请号：US17322765

申请日：2021-05-17

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Alvin Abdagic , Behshad Behzadi , Jacopo Sannazzaro Natta , Julia Proskurnia , Krzysztof Andrzej Goj , Srikanth Pandiri , Viesturs Zarins , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari

IPC: G10L15/26 , G10L15/22 , G10L15/18 , G06F3/0488 , G06N20/00

Abstract: Systems and methods described herein relate to determining whether to incorporate recognized text, that corresponds to a spoken utterance of a user of a client device, into a transcription displayed at the client device, or to cause an assistant command, that is associated with the transcription and that is based on the recognized text, to be performed by an automated assistant implemented by the client device. The spoken utterance is received during a dictation session between the user and the automated assistant. Implementations can process, using automatic speech recognition model(s), audio data that captures the spoken utterance to generate the recognized text. Further, implementations can determine whether to incorporate the recognized text into the transcription or cause the assistant command to be performed based on touch input being directed to the transcription, a state of the transcription, and/or audio-based characteristic(s) of the spoken utterance.

3.

发明授权
System(s) and method(s) to enable modification of an automatically arranged transcription in smart dictation 有权

公开(公告)号：US12266359B2

公开(公告)日：2025-04-01

申请号：US17902560

申请日：2022-09-02

Applicant: GOOGLE LLC

Inventor： Nicolo D'Ercole , Shumin Zhai , Swante Scholz , Mehek Sharma , Adrien Olczak , Akshay Kannan , Alvin Abdagic , Julia Proskurnia , Viesturs Zarins

IPC: G10L15/22 , G06F16/683 , G10L15/08

Abstract: Implementations described herein generally relate to generating a modification selectable element that may be provided for presentation to a user in a smart dictation session with an automated assistant. The modification selectable element may, when selected, cause a transcription, that includes textual data generated based on processing audio data that captures a spoken utterance and that is automatically arranged, to be modified. The transcription may be automatically arranged to include spacing, punctuation, capitalization, indentations, paragraph breaks, and/or other arrangement operations that are not specified by the user in providing the spoken utterance. Accordingly, a subsequent selection of the modification selectable element may cause these automatic arrangement operation(s), and/or the textual data locationally proximate to these automatic arrangement operation(s), to be modified. Implementations described herein also relate to generating the transcription and/or the modification selectable element on behalf of a third-party software application.

4.

发明申请
VOICE COMMANDS FOR AN AUTOMATED ASSISTANT UTILIZED IN SMART DICTATION 有权

公开(公告)号：US20240420699A1

公开(公告)日：2024-12-19

申请号：US18815252

申请日：2024-08-26

Applicant: GOOGLE LLC

Inventor： Victor Carbune , Alvin Abdagic , Behshad Behzadi , Jacopo Sannazzaro Natta , Julia Proskurnia , Krzysztof Andrzej Goj , Srikanth Pandiri , Viesturs Zarins , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari

IPC: G10L15/26 , G06F3/0488 , G06N20/00 , G10L15/18 , G10L15/22

Abstract: Systems and methods described herein relate to determining whether to incorporate recognized text, that corresponds to a spoken utterance of a user of a client device, into a transcription displayed at the client device, or to cause an assistant command, that is associated with the transcription and that is based on the recognized text, to be performed by an automated assistant implemented by the client device. The spoken utterance is received during a dictation session between the user and the automated assistant. Implementations can process, using automatic speech recognition model(s), audio data that captures the spoken utterance to generate the recognized text. Further, implementations can determine whether to incorporate the recognized text into the transcription or cause the assistant command to be performed based on touch input being directed to the transcription, a state of the transcription, and/or audio-based characteristic(s) of the spoken utterance.

5.

发明公开
SYSTEM(S) AND METHOD(S) TO ENABLE MODIFICATION OF AN AUTOMATICALLY ARRANGED TRANSCRIPTION IN SMART DICTATION 审中-公开

公开(公告)号：US20240029728A1

公开(公告)日：2024-01-25

申请号：US17902560

申请日：2022-09-02

Applicant: GOOGLE LLC

Inventor： Nicolo D'Ercole , Shumin Zhai , Swante Scholz , Mehek Sharma , Adrien Olczak , Akshay Kannan , Alvin Abdagic , Julia Proskurnia , Viesturs Zarins

IPC: G10L15/22 , G10L15/08 , G06F16/683

CPC classification number: G10L15/22 , G10L15/08 , G06F16/685

Abstract: Implementations described herein generally relate to generating a modification selectable element that may be provided for presentation to a user in a smart dictation session with an automated assistant. The modification selectable element may, when selected, cause a transcription, that includes textual data generated based on processing audio data that captures a spoken utterance and that is automatically arranged, to be modified. The transcription may be automatically arranged to include spacing, punctuation, capitalization, indentations, paragraph breaks, and/or other arrangement operations that are not specified by the user in providing the spoken utterance. Accordingly, a subsequent selection of the modification selectable element may cause these automatic arrangement operation(s), and/or the textual data locationally proximate to these automatic arrangement operation(s), to be modified. Implementations described herein also relate to generating the transcription and/or the modification selectable element on behalf of a third-party software application.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification