-
公开(公告)号:US20240420696A1
公开(公告)日:2024-12-19
申请号:US18814060
申请日:2024-08-23
Applicant: GOOGLE LLC
Inventor: Gleb Skobeltsyn , Olga Kapralova , Konstantin Shagin , Vladimir Vuskovic , Yufei Zhao , Bradley Nelson , Alessio Macrì , Abraham Lee
Abstract: Implementations described herein relate to providing suggestions, via a display modality, for completing a spoken utterance for an automated assistant, in order to reduce a frequency and/or a length of time that the user will participate in a current and/or subsequent dialog session with the automated assistant. A user request can be compiled from content of an ongoing spoken utterance and content of any selected suggestion elements. When a currently compiled portion of the user request (from content of a selected suggestion(s) and an incomplete spoken utterance) is capable of being performed via the automated assistant, any actions corresponding to the currently compiled portion of the user request can be performed via the automated assistant. Furthermore, any further content resulting from performance of the actions, along with any discernible context, can be used for providing further suggestions.
-
公开(公告)号:US11682381B2
公开(公告)日:2023-06-20
申请号:US17457421
申请日:2021-12-02
Applicant: Google LLC
Inventor: Olga Kapralova , Evgeny A. Cherepanov , Dmitry Osmakov , Martin Baeuml , Gleb Skobeltsyn
CPC classification number: G10L15/063 , G10L15/01 , G10L15/06 , G10L15/10 , G10L15/22 , G10L15/32 , G10L2015/0635 , G10L2015/0638
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.
-
公开(公告)号:US20230274729A1
公开(公告)日:2023-08-31
申请号:US18312587
申请日:2023-05-04
Applicant: Google LLC
Inventor: Olga Kapralova , Evgeny A. Cherepanov , Dmitry Osmakov , Martin Baeuml , Gleb Skobeltsyn
CPC classification number: G10L15/063 , G10L15/06 , G10L15/22 , G10L15/32 , G10L15/01 , G10L15/10 , G10L2015/0635 , G10L2015/0638
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.
-
公开(公告)号:US20210280180A1
公开(公告)日:2021-09-09
申请号:US16343683
申请日:2019-02-07
Applicant: Google LLC
Inventor: Gleb Skobeltsyn , Olga Kapralova , Konstantin Shagin , Vladimir Vuskovic , Yufei Zhao , Bradley Nelson , Alessio Macrì , Abraham Lee
Abstract: Implementations described herein relate to providing suggestions, via a display modality, for completing a spoken utterance for an automated assistant, in order to reduce a frequency and/or a length of time that the user will participate in a current and/or subsequent dialog session with the automated assistant. A user request can be compiled from content of an ongoing spoken utterance and content of any selected suggestion elements. When a currently compiled portion of the user request (from content of a selected suggestion(s) and an incomplete spoken utterance) is capable of being performed via the automated assistant, any actions corresponding to the currently compiled portion of the user request can be performed via the automated assistant. Furthermore, any further content resulting from performance of the actions, along with any discernible context, can be used for providing further suggestions.
-
公开(公告)号:US20240061694A1
公开(公告)日:2024-02-22
申请号:US18235699
申请日:2023-08-18
Applicant: GOOGLE LLC
Inventor: Evgeny Cherepanov , Olga Kapralova , Dan Vallejo , Wendy Look , Mikhail Reutov
IPC: G06F9/451 , G06F3/0482 , G06F3/0484
CPC classification number: G06F9/453 , G06F3/0482 , G06F3/0484
Abstract: Implementations set forth herein relate to an automated assistant that can provide interactive application widgets based on their relevance to content that a user may have expressed interest in. The automated assistant can render the application widgets according to an estimated familiarity of the user with the content they expressed interest in. Each application widget can correspond to an application that can be accessed separately from the automated assistant. An application widget can be rendered at a display interface simultaneous to the user accessing the content that served as the basis for rendering the application widget. When the user interacts with the application widget, the automated assistant can communicate selection data to a corresponding application, which can respond with supplemental data that can be rendered at the display interface.
-
公开(公告)号:US11238857B2
公开(公告)日:2022-02-01
申请号:US16343683
申请日:2019-02-07
Applicant: Google LLC
Inventor: Gleb Skobeltsyn , Olga Kapralova , Konstantin Shagin , Vladimir Vuskovic , Yufei Zhao , Bradley Nelson , Alessio Macrí , Abraham Lee
Abstract: Implementations described herein relate to providing suggestions, via a display modality, for completing a spoken utterance for an automated assistant, in order to reduce a frequency and/or a length of time that the user will participate in a current and/or subsequent dialog session with the automated assistant. A user request can be compiled from content of an ongoing spoken utterance and content of any selected suggestion elements. When a currently compiled portion of the user request (from content of a selected suggestion(s) and an incomplete spoken utterance) is capable of being performed via the automated assistant, any actions corresponding to the currently compiled portion of the user request can be performed via the automated assistant. Furthermore, any further content resulting from performance of the actions, along with any discernible context, can be used for providing further suggestions.
-
7.
公开(公告)号:US20240062757A1
公开(公告)日:2024-02-22
申请号:US18235707
申请日:2023-08-18
Applicant: GOOGLE LLC
Inventor: Wendy Look , Evgeny Cherepanov , Olga Kapralova , Dan Vallejo , Mikhail Reutov
IPC: G10L15/22 , H04N21/431 , H04N21/472 , G06F3/0482 , G06Q10/1093
CPC classification number: G10L15/22 , H04N21/4316 , H04N21/47217 , G06F3/0482 , G06Q10/1095 , G10L2015/223
Abstract: Implementations set forth herein relate to incorporating automated assistant suggestions into an interface of a video application, when the video application is rendering video content. The video content—as well as any relevant content, can provide a basis for the automated assistant suggestions. The assistant suggestions can optionally link to one or more additional applications, which can be controlled in response to a user selecting one or more of the automated assistant suggestions. In response to a selection of an assistant suggestion, resulting data generated by another application can be rendered over an interface of the video application, while video content is being rendered and/or otherwise played. This can allow the user to control relevant actions of other applications without completely leaving an interface of the video application, thereby preserving memory and other computational resources that may be consumed when switching between application interfaces.
-
公开(公告)号:US11200887B2
公开(公告)日:2021-12-14
申请号:US16837393
申请日:2020-04-01
Applicant: Google LLC
Inventor: Olga Kapralova , Evgeny A. Cherepanov , Dmitry Osmakov , Martin Baeuml , Gleb Skobeltsyn
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.
-
公开(公告)号:US20200243070A1
公开(公告)日:2020-07-30
申请号:US16837393
申请日:2020-04-01
Applicant: Google LLC
Inventor: Olga Kapralova , Evgeny A. Cherepanov , Dmitry Osmakov , Martin Baeuml , Gleb Skobeltsyn
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.
-
公开(公告)号:US20180308471A1
公开(公告)日:2018-10-25
申请号:US16023658
申请日:2018-06-29
Applicant: Google LLC
Inventor: Olga Kapralova , Evgeny A. Cherepanov , Dmitry Osmakov , Martin Baeuml , Gleb Skobeltsyn
CPC classification number: G10L15/063 , G10L15/01 , G10L15/06 , G10L15/10 , G10L15/22 , G10L15/32 , G10L2015/0635 , G10L2015/0638
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.
-
-
-
-
-
-
-
-
-