-
公开(公告)号:US11113481B2
公开(公告)日:2021-09-07
申请号:US16621578
申请日:2019-05-02
Applicant: GOOGLE LLC
Inventor: Melvin Jose Johnson Premkumar , Vladimir Vuskovic , James Kuczmarski , Hongjie Chai
Abstract: Techniques described herein may serve to increase the language coverage of an automated assistant system, i.e. they may serve to increase the number of queries in one or more non-native languages for which the automated assistant is able to deliver reasonable responses. For example, techniques are described herein for training and utilizing a machine translation model to map a plurality of semantically-related natural language inputs in one language to one or more canonical translations in another language. In various implementations, the canonical translations may be selected and/or optimized for determining an intent of the speaker by the automated assistant, so that one or more responsive actions can be performed based on the speaker's intent. Put another way, the canonical translations may be specifically formatted for indicating the intent of the speaker to the automated assistant.
-
公开(公告)号:US11915692B2
公开(公告)日:2024-02-27
申请号:US17211488
申请日:2021-03-24
Applicant: Google LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu
IPC: G06F40/00 , G10L15/183 , G10L15/00 , G10L15/22 , G06F16/33 , G06N20/00 , G06F16/332 , G06F40/47 , G06F40/58 , H04L51/02 , G06F18/22
CPC classification number: G10L15/183 , G06F16/3329 , G06F16/3337 , G06F18/22 , G06F40/47 , G06F40/58 , G06N20/00 , G10L15/005 , G10L15/22 , H04L51/02
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US11354521B2
公开(公告)日:2022-06-07
申请号:US16792572
申请日:2020-02-17
Applicant: Google LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu , Hongjie Chai , Wangqing Yuan
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US11942082B2
公开(公告)日:2024-03-26
申请号:US17825778
申请日:2022-05-26
Applicant: GOOGLE LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu , Hongjie Chai , Wangqing Yuan
IPC: G06F40/47 , G06F16/33 , G06F16/332 , G06F18/22 , G06F40/58 , G06N20/00 , G10L15/00 , G10L15/183 , G10L15/22 , H04L51/02
CPC classification number: G10L15/183 , G06F16/3329 , G06F16/3337 , G06F18/22 , G06F40/47 , G06F40/58 , G06N20/00 , G10L15/005 , G10L15/22 , H04L51/02
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US11875788B2
公开(公告)日:2024-01-16
申请号:US17211488
申请日:2021-03-24
Applicant: Google LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu
IPC: G06F40/00 , G10L15/183 , G10L15/00 , G10L15/22 , G06F16/33 , G06N20/00 , G06F16/332 , G06F40/47 , G06F40/58 , H04L51/02 , G06F18/22
CPC classification number: G10L15/183 , G06F16/3329 , G06F16/3337 , G06F18/22 , G06F40/47 , G06F40/58 , G06N20/00 , G10L15/005 , G10L15/22 , H04L51/02
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US20220284198A1
公开(公告)日:2022-09-08
申请号:US17825778
申请日:2022-05-26
Applicant: GOOGLE LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu , Hongjie Chai , Wangqing Yuan
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US20210210076A1
公开(公告)日:2021-07-08
申请号:US17211488
申请日:2021-03-24
Applicant: Google LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu
IPC: G10L15/183 , G10L15/00 , G10L15/22
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US10984784B2
公开(公告)日:2021-04-20
申请号:US16082175
申请日:2018-04-16
Applicant: Google LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu
IPC: G10L15/00 , G10L15/183 , G10L15/22
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US20210064828A1
公开(公告)日:2021-03-04
申请号:US16621578
申请日:2019-05-02
Applicant: Google LLC
Inventor: Melvin Jose Johnson Premkumar , Vladimir Vuskovic , James Kuczmarski , Hongjie Chai
Abstract: Techniques described herein may serve to increase the language coverage of an automated assistant system, i.e. they may serve to increase the number of queries in one or more non-native languages for which the automated assistant is able to deliver reasonable responses. For example, techniques are described herein for training and utilizing a machine translation model to map a plurality of semantically-related natural language inputs in one language to one or more canonical translations in another language. In various implementations, the canonical translations may be selected and/or optimized for determining an intent of the speaker by the automated assistant, so that one or more responsive actions can be performed based on the speaker's intent. Put another way, the canonical translations may be specifically formatted for indicating the intent of the speaker to the automated assistant.
-
10.
公开(公告)号:US20200320984A1
公开(公告)日:2020-10-08
申请号:US16082175
申请日:2018-04-16
Applicant: Google LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu
IPC: G10L15/183 , G10L15/22 , G10L15/00
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
-
-
-
-
-
-
-
-