-
公开(公告)号:US20240078083A1
公开(公告)日:2024-03-07
申请号:US18388465
申请日:2023-11-09
Applicant: GOOGLE LLC
Inventor: Srikanth Pandiri , Luv Kolhari , Behshad Behzadi , Zaheed Sabur , Domenico Carbotta , Akshay Kannan , Qi Wang , Gokay Baris Gultekin , Angana Ghosh , Xu Liu , Yang Lu , Steve Cheng
IPC: G06F3/16 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G06F40/117 , G06F40/143 , G06F40/174 , G06F40/30 , G10L15/22 , G10L15/26
CPC classification number: G06F3/167 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G06F40/117 , G06F40/143 , G06F40/174 , G06F40/30 , G10L15/22 , G10L15/26
Abstract: Implementations set forth herein relate to an automated assistant that can selectively determine whether to incorporate a verbatim interpretation of portions spoken utterances into an entry field and/or incorporate synonymous content into the entry field. For instance, a user can be accessing an interface that provides an entry field (e.g., address field) for receiving user input. In order to provide input for entry field, the user can select the entry field and/or access a GUI keyboard to initialize an automated assistant for assisting with filling the entry field. Should the user provide a spoken utterance, the user can elect to provide a spoken utterance that embodies the intended input (e.g., an actual address) or a reference to the intended input (e.g., a name). In response to the spoken utterance, the automated assistant can fill the entry field with the intended input without necessitating further input from the user.
-
公开(公告)号:US20220253277A1
公开(公告)日:2022-08-11
申请号:US17619414
申请日:2019-12-13
Applicant: GOOGLE LLC
Inventor: Srikanth Pandiri , Luv Kothari , Behshad Behzadi , Zaheed Sabur , Domenico Carbotta , Akshay Kannan , Qi Wang , Gokay Baris Gultekin , Angana Ghosh , Xu Liu , Yang Lu , Steve Cheng
IPC: G06F3/16 , G06F3/0481 , G06F40/174 , G10L15/26 , G06F3/0484 , G06F3/04886 , G06F40/30 , G06F40/143 , G06F40/117 , G10L15/22
Abstract: Implementations set forth herein relate to an automated assistant that can selectively determine whether to incorporate a verbatim interpretation of portions spoken utterances into an entry field and/or incorporate synonymous content into the entry field. For instance, a user can be accessing an interface that provides an entry field (e.g., address field) for receiving user input. In order to provide input for entry field, the user can select the entry field and/or access a GUI keyboard to initialize an automated assistant for assisting with filling the entry field. Should the user provide a spoken utterance, the user can elect to provide a spoken utterance that embodies the intended input (e.g., an actual address) or a reference to the intended input (e.g., a name). In response to the spoken utterance, the automated assistant can fill the entry field with the intended input without necessitating further input from the user.
-
公开(公告)号:US12106758B2
公开(公告)日:2024-10-01
申请号:US17322765
申请日:2021-05-17
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Alvin Abdagic , Behshad Behzadi , Jacopo Sannazzaro Natta , Julia Proskurnia , Krzysztof Andrzej Goj , Srikanth Pandiri , Viesturs Zarins , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari
CPC classification number: G10L15/26 , G06F3/0488 , G06N20/00 , G10L15/18 , G10L15/22 , G10L2015/223
Abstract: Systems and methods described herein relate to determining whether to incorporate recognized text, that corresponds to a spoken utterance of a user of a client device, into a transcription displayed at the client device, or to cause an assistant command, that is associated with the transcription and that is based on the recognized text, to be performed by an automated assistant implemented by the client device. The spoken utterance is received during a dictation session between the user and the automated assistant. Implementations can process, using automatic speech recognition model(s), audio data that captures the spoken utterance to generate the recognized text. Further, implementations can determine whether to incorporate the recognized text into the transcription or cause the assistant command to be performed based on touch input being directed to the transcription, a state of the transcription, and/or audio-based characteristic(s) of the spoken utterance.
-
公开(公告)号:US11853649B2
公开(公告)日:2023-12-26
申请号:US17619414
申请日:2019-12-13
Applicant: GOOGLE LLC
Inventor: Srikanth Pandiri , Luv Kothari , Behshad Behzadi , Zaheed Sabur , Domenico Carbotta , Akshay Kannan , Qi Wang , Gokay Baris Gultekin , Angana Ghosh , Xu Liu , Yang Lu , Steve Cheng
IPC: G06F40/117 , G06F3/16 , G06F40/143 , G06F40/30 , G06F40/174 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G10L15/22 , G10L15/26
CPC classification number: G06F3/167 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G06F40/117 , G06F40/143 , G06F40/174 , G06F40/30 , G10L15/22 , G10L15/26
Abstract: Implementations set forth herein relate to an automated assistant that can selectively determine whether to incorporate a verbatim interpretation of portions spoken utterances into an entry field and/or incorporate synonymous content into the entry field. For instance, a user can be accessing an interface that provides an entry field (e.g., address field) for receiving user input. In order to provide input for entry field, the user can select the entry field and/or access a GUI keyboard to initialize an automated assistant for assisting with filling the entry field. Should the user provide a spoken utterance, the user can elect to provide a spoken utterance that embodies the intended input (e.g., an actual address) or a reference to the intended input (e.g., a name). In response to the spoken utterance, the automated assistant can fill the entry field with the intended input without necessitating further input from the user.
-
公开(公告)号:US20220366910A1
公开(公告)日:2022-11-17
申请号:US17322765
申请日:2021-05-17
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Alvin Abdagic , Behshad Behzadi , Jacopo Sannazzaro Natta , Julia Proskurnia , Krzysztof Andrzej Goj , Srikanth Pandiri , Viesturs Zarins , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari
IPC: G10L15/26 , G10L15/22 , G10L15/18 , G06F3/0488 , G06N20/00
Abstract: Systems and methods described herein relate to determining whether to incorporate recognized text, that corresponds to a spoken utterance of a user of a client device, into a transcription displayed at the client device, or to cause an assistant command, that is associated with the transcription and that is based on the recognized text, to be performed by an automated assistant implemented by the client device. The spoken utterance is received during a dictation session between the user and the automated assistant. Implementations can process, using automatic speech recognition model(s), audio data that captures the spoken utterance to generate the recognized text. Further, implementations can determine whether to incorporate the recognized text into the transcription or cause the assistant command to be performed based on touch input being directed to the transcription, a state of the transcription, and/or audio-based characteristic(s) of the spoken utterance.
-
公开(公告)号:US20240420699A1
公开(公告)日:2024-12-19
申请号:US18815252
申请日:2024-08-26
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Alvin Abdagic , Behshad Behzadi , Jacopo Sannazzaro Natta , Julia Proskurnia , Krzysztof Andrzej Goj , Srikanth Pandiri , Viesturs Zarins , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari
IPC: G10L15/26 , G06F3/0488 , G06N20/00 , G10L15/18 , G10L15/22
Abstract: Systems and methods described herein relate to determining whether to incorporate recognized text, that corresponds to a spoken utterance of a user of a client device, into a transcription displayed at the client device, or to cause an assistant command, that is associated with the transcription and that is based on the recognized text, to be performed by an automated assistant implemented by the client device. The spoken utterance is received during a dictation session between the user and the automated assistant. Implementations can process, using automatic speech recognition model(s), audio data that captures the spoken utterance to generate the recognized text. Further, implementations can determine whether to incorporate the recognized text into the transcription or cause the assistant command to be performed based on touch input being directed to the transcription, a state of the transcription, and/or audio-based characteristic(s) of the spoken utterance.
-
公开(公告)号:US20240411513A1
公开(公告)日:2024-12-12
申请号:US18808933
申请日:2024-08-19
Applicant: GOOGLE LLC
Inventor: Srikanth Pandiri , Luv Kothari , Behshad Behzadi , Zaheed Sabur , Domenico Carbotta , Akshay Kannan , Qi Wang , Gokay Baris Gultekin , Angana Ghosh , Xu Liu , Yang Lu , Steve Cheng
IPC: G06F3/16 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G06F40/117 , G06F40/143 , G06F40/174 , G06F40/30 , G10L15/22 , G10L15/26
Abstract: Implementations set forth herein relate to an automated assistant that can selectively determine whether to incorporate a verbatim interpretation of portions spoken utterances into an entry field and/or incorporate synonymous content into the entry field. For instance, a user can be accessing an interface that provides an entry field (e.g., address field) for receiving user input. In order to provide input for entry field, the user can select the entry field and/or access a GUI keyboard to initialize an automated assistant for assisting with filling the entry field. Should the user provide a spoken utterance, the user can elect to provide a spoken utterance that embodies the intended input (e.g., an actual address) or a reference to the intended input (e.g., a name). In response to the spoken utterance, the automated assistant can fill the entry field with the intended input without necessitating further input from the user.
-
公开(公告)号:US12093609B2
公开(公告)日:2024-09-17
申请号:US18388465
申请日:2023-11-09
Applicant: GOOGLE LLC
Inventor: Srikanth Pandiri , Luv Kothari , Behshad Behzadi , Zaheed Sabur , Domenico Carbotta , Akshay Kannan , Qi Wang , Gokay Baris Gultekin , Angana Ghosh , Xu Liu , Yang Lu , Steve Cheng
IPC: G06F3/048 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G06F3/16 , G06F40/117 , G06F40/143 , G06F40/174 , G06F40/30 , G10L15/22 , G10L15/26
CPC classification number: G06F3/167 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G06F40/117 , G06F40/143 , G06F40/174 , G06F40/30 , G10L15/22 , G10L15/26
Abstract: Implementations set forth herein relate to an automated assistant that can selectively determine whether to incorporate a verbatim interpretation of portions spoken utterances into an entry field and/or incorporate synonymous content into the entry field. For instance, a user can be accessing an interface that provides an entry field (e.g., address field) for receiving user input. In order to provide input for entry field, the user can select the entry field and/or access a GUI keyboard to initialize an automated assistant for assisting with filling the entry field. Should the user provide a spoken utterance, the user can elect to provide a spoken utterance that embodies the intended input (e.g., an actual address) or a reference to the intended input (e.g., a name). In response to the spoken utterance, the automated assistant can fill the entry field with the intended input without necessitating further input from the user.
-
-
-
-
-
-
-