-
公开(公告)号:US10789955B2
公开(公告)日:2020-09-29
申请号:US16192953
申请日:2018-11-16
Applicant: Google LLC
Inventor: Assaf Hurwitz Michaely , Petar Aleksic , Pedro Moreno
Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.
-
公开(公告)号:US20220277749A1
公开(公告)日:2022-09-01
申请号:US17652923
申请日:2022-02-28
Applicant: Google LLC
Inventor: Assaf Hurwitz Michaely , Petar Aleksic , Pedro J. Moreno Mengibar
Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.
-
公开(公告)号:US20200160865A1
公开(公告)日:2020-05-21
申请号:US16192953
申请日:2018-11-16
Applicant: Google LLC
Inventor: Assaf Hurwitz Michaely , Petar Aleksic , Pedro Moreno
Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.
-
公开(公告)号:US10579730B1
公开(公告)日:2020-03-03
申请号:US16258230
申请日:2019-01-25
Applicant: Google LLC
Inventor: Evgeny A. Cherepanov , Gleb Skobeltsyn , Jakob Foerster , Petar Aleksic , Assaf Hurwitz Michaely
IPC: G10L15/22 , G10L15/30 , G10L15/26 , G06F17/27 , G10L15/197 , G10L15/187 , G10L15/32 , G10L15/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice input from a user device; generating a first recognition output; receiving a user selection of one or more terms in the first recognition output; receiving a second voice input spelling a correction of the user selection; determining a corrected recognition output for the selected portion; and providing a second recognition output that merges the first recognition output and the corrected recognition output.
-
公开(公告)号:US11676607B2
公开(公告)日:2023-06-13
申请号:US17652923
申请日:2022-02-28
Applicant: Google LLC
Inventor: Assaf Hurwitz Michaely , Petar Aleksic , Pedro J. Moreno Mengibar
CPC classification number: G10L15/26 , G06F40/56 , G10L15/22 , G10L2015/228
Abstract: A method for denormalizing raw speech recognition results. The method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The context metadata indicates that the speech input includes dictated speech directed to a messaging application that is currently executing on a user device for inclusion in an electronic message. The method further includes generating, using a speech recognizer, a raw speech recognition result including an explicit punctuation term spoken by the user and corresponding to the speech input. Based on the context metadata, the method includes denormalizing the generated raw speech recognition result into denormalized text by applying an explicit punctuation denormalizer to convert the explicit punctuation term in the raw speech recognition result into a corresponding punctuation symbol and displaying the denormalized text including the corresponding punctuation symbol on a display screen of the user device.
-
公开(公告)号:US20220238105A1
公开(公告)日:2022-07-28
申请号:US17157207
申请日:2021-01-25
Applicant: GOOGLE LLC
Inventor: Rafael Goldfarb , Or Guz , Lior Alon , Assaf Hurwitz Michaely , Golan Pundak , Shmuel Leibtag , Tomer Amiaz , Dan Rasin , Asaf Aharoni
Abstract: Implementations are directed to causing a voice bot to utilize a plurality of ML layers in resolving unique personal identifier(s) for a human while the voice bot is engaged in a corresponding conversation with the human. The unique personal identifier(s) can include a unique sequence of alphanumeric characters that is personal to the human. In some implementations, ASR speech hypothes(es) corresponding to spoken utterance(s) that include the unique personal identifier(s) can be processed to generate candidate unique personal identifier(s), given alphanumeric character(s) of the candidate unique personal identifier(s) can be selected, and the voice bot can prompt the human with clarification request(s) to clarify the given alphanumeric character(s) until it is predicted to correspond to the an actual unique personal identifier(s) for the human(s). The unique personal identifier(s) can then be utilized in performance of further action(s) by the voice bot and/or other systems.
-
7.
公开(公告)号:US20230419964A1
公开(公告)日:2023-12-28
申请号:US18462787
申请日:2023-09-07
Applicant: GOOGLE LLC
Inventor: Rafael Goldfarb , Or Guz , Lior Alon , Assaf Hurwitz Michaely , Golan Pundak , Shmuel Leibtag , Tomer Amiaz , Dan Rasin , Asaf Aharoni
CPC classification number: G10L15/22 , G10L15/063 , G10L2015/0635
Abstract: Implementations are directed to causing a voice bot to utilize a plurality of ML layers in resolving unique personal identifier(s) for a human while the voice bot is engaged in a corresponding conversation with the human. The unique personal identifier(s) can include a unique sequence of alphanumeric characters that is personal to the human. In some implementations, ASR speech hypothes(es) corresponding to spoken utterance(s) that include the unique personal identifier(s) can be processed to generate candidate unique personal identifier(s), given alphanumeric character(s) of the candidate unique personal identifier(s) can be selected, and the voice bot can prompt the human with clarification request(s) to clarify the given alphanumeric character(s) until it is predicted to correspond to the an actual unique personal identifier(s) for the human(s). The unique personal identifier(s) can then be utilized in performance of further action(s) by the voice bot and/or other systems.
-
公开(公告)号:US11790906B2
公开(公告)日:2023-10-17
申请号:US17157207
申请日:2021-01-25
Applicant: GOOGLE LLC
Inventor: Rafael Goldfarb , Or Guz , Lior Alon , Assaf Hurwitz Michaely , Golan Pundak , Shmuel Leibtag , Tomer Amiaz , Dan Rasin , Asaf Aharoni
CPC classification number: G10L15/22 , G10L15/063 , G10L2015/0635
Abstract: Implementations are directed to causing a voice bot to utilize a plurality of ML layers in resolving unique personal identifier(s) for a human while the voice bot is engaged in a corresponding conversation with the human. The unique personal identifier(s) can include a unique sequence of alphanumeric characters that is personal to the human. In some implementations, ASR speech hypothes(es) corresponding to spoken utterance(s) that include the unique personal identifier(s) can be processed to generate candidate unique personal identifier(s), given alphanumeric character(s) of the candidate unique personal identifier(s) can be selected, and the voice bot can prompt the human with clarification request(s) to clarify the given alphanumeric character(s) until it is predicted to correspond to the an actual unique personal identifier(s) for the human(s). The unique personal identifier(s) can then be utilized in performance of further action(s) by the voice bot and/or other systems.
-
公开(公告)号:US11282525B2
公开(公告)日:2022-03-22
申请号:US17009494
申请日:2020-09-01
Applicant: Google LLC
Inventor: Assaf Hurwitz Michaely , Petar Aleksic , Pedro J. Moreno Mengibar
Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.
-
-
-
-
-
-
-
-