Patent search ap:("GOOGLE LLC") AND inv:"Assaf Hurwitz Michaely" Page 1

1.

发明授权
Contextual denormalization for automatic speech recognition 有权

公开(公告)号：US10789955B2

公开(公告)日：2020-09-29

申请号：US16192953

申请日：2018-11-16

Applicant: Google LLC

Inventor： Assaf Hurwitz Michaely , Petar Aleksic , Pedro Moreno

IPC: G10L15/26 , G10L15/22 , G06F40/56

Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.

2.

发明申请
Contextual Denormalization For Automatic Speech Recognition 有权

公开(公告)号：US20220277749A1

公开(公告)日：2022-09-01

申请号：US17652923

申请日：2022-02-28

Applicant: Google LLC

Inventor： Assaf Hurwitz Michaely , Petar Aleksic , Pedro J. Moreno Mengibar

IPC: G10L15/26 , G06F40/56 , G10L15/22

Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.

3.

发明申请
Contextual Denormalization for Automatic Speech Recognition 审中-公开

公开(公告)号：US20200160865A1

公开(公告)日：2020-05-21

申请号：US16192953

申请日：2018-11-16

Applicant: Google LLC

Inventor： Assaf Hurwitz Michaely , Petar Aleksic , Pedro Moreno

IPC: G10L15/26 , G10L15/22 , G06F17/28

Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.

4.

发明授权
Allowing spelling of arbitrary words 有权

公开(公告)号：US10579730B1

公开(公告)日：2020-03-03

申请号：US16258230

申请日：2019-01-25

Applicant: Google LLC

Inventor： Evgeny A. Cherepanov , Gleb Skobeltsyn , Jakob Foerster , Petar Aleksic , Assaf Hurwitz Michaely

IPC: G10L15/22 , G10L15/30 , G10L15/26 , G06F17/27 , G10L15/197 , G10L15/187 , G10L15/32 , G10L15/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice input from a user device; generating a first recognition output; receiving a user selection of one or more terms in the first recognition output; receiving a second voice input spelling a correction of the user selection; determining a corrected recognition output for the selected portion; and providing a second recognition output that merges the first recognition output and the corrected recognition output.

5.

发明授权
Contextual denormalization for automatic speech recognition 有权

公开(公告)号：US11676607B2

公开(公告)日：2023-06-13

申请号：US17652923

申请日：2022-02-28

Applicant: Google LLC

Inventor： Assaf Hurwitz Michaely , Petar Aleksic , Pedro J. Moreno Mengibar

IPC: G10L15/26 , G06F40/56 , G10L15/22

CPC classification number: G10L15/26 , G06F40/56 , G10L15/22 , G10L2015/228

Abstract: A method for denormalizing raw speech recognition results. The method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The context metadata indicates that the speech input includes dictated speech directed to a messaging application that is currently executing on a user device for inclusion in an electronic message. The method further includes generating, using a speech recognizer, a raw speech recognition result including an explicit punctuation term spoken by the user and corresponding to the speech input. Based on the context metadata, the method includes denormalizing the generated raw speech recognition result into denormalized text by applying an explicit punctuation denormalizer to convert the explicit punctuation term in the raw speech recognition result into a corresponding punctuation symbol and displaying the denormalized text including the corresponding punctuation symbol on a display screen of the user device.

6.

发明申请
RESOLVING UNIQUE PERSONAL IDENTIFIERS DURING CORRESPONDING CONVERSATIONS BETWEEN A VOICE BOT AND A HUMAN 有权

公开(公告)号：US20220238105A1

公开(公告)日：2022-07-28

申请号：US17157207

申请日：2021-01-25

Applicant: GOOGLE LLC

Inventor： Rafael Goldfarb , Or Guz , Lior Alon , Assaf Hurwitz Michaely , Golan Pundak , Shmuel Leibtag , Tomer Amiaz , Dan Rasin , Asaf Aharoni

IPC: G10L15/22 , G10L15/06

Abstract: Implementations are directed to causing a voice bot to utilize a plurality of ML layers in resolving unique personal identifier(s) for a human while the voice bot is engaged in a corresponding conversation with the human. The unique personal identifier(s) can include a unique sequence of alphanumeric characters that is personal to the human. In some implementations, ASR speech hypothes(es) corresponding to spoken utterance(s) that include the unique personal identifier(s) can be processed to generate candidate unique personal identifier(s), given alphanumeric character(s) of the candidate unique personal identifier(s) can be selected, and the voice bot can prompt the human with clarification request(s) to clarify the given alphanumeric character(s) until it is predicted to correspond to the an actual unique personal identifier(s) for the human(s). The unique personal identifier(s) can then be utilized in performance of further action(s) by the voice bot and/or other systems.

7.

发明公开
RESOLVING UNIQUE PERSONAL IDENTIFIERS DURING CORRESPONDING CONVERSATIONS BETWEEN A VOICE BOT AND A HUMAN 审中-公开

公开(公告)号：US20230419964A1

公开(公告)日：2023-12-28

申请号：US18462787

申请日：2023-09-07

Applicant: GOOGLE LLC

Inventor： Rafael Goldfarb , Or Guz , Lior Alon , Assaf Hurwitz Michaely , Golan Pundak , Shmuel Leibtag , Tomer Amiaz , Dan Rasin , Asaf Aharoni

IPC: G10L15/22 , G10L15/06

CPC classification number: G10L15/22 , G10L15/063 , G10L2015/0635

Abstract: Implementations are directed to causing a voice bot to utilize a plurality of ML layers in resolving unique personal identifier(s) for a human while the voice bot is engaged in a corresponding conversation with the human. The unique personal identifier(s) can include a unique sequence of alphanumeric characters that is personal to the human. In some implementations, ASR speech hypothes(es) corresponding to spoken utterance(s) that include the unique personal identifier(s) can be processed to generate candidate unique personal identifier(s), given alphanumeric character(s) of the candidate unique personal identifier(s) can be selected, and the voice bot can prompt the human with clarification request(s) to clarify the given alphanumeric character(s) until it is predicted to correspond to the an actual unique personal identifier(s) for the human(s). The unique personal identifier(s) can then be utilized in performance of further action(s) by the voice bot and/or other systems.

8.

发明授权
Resolving unique personal identifiers during corresponding conversations between a voice bot and a human 有权

公开(公告)号：US11790906B2

公开(公告)日：2023-10-17

申请号：US17157207

申请日：2021-01-25

Applicant: GOOGLE LLC

Inventor： Rafael Goldfarb , Or Guz , Lior Alon , Assaf Hurwitz Michaely , Golan Pundak , Shmuel Leibtag , Tomer Amiaz , Dan Rasin , Asaf Aharoni

IPC: G10L15/22 , G10L15/06

CPC classification number: G10L15/22 , G10L15/063 , G10L2015/0635

Abstract: Implementations are directed to causing a voice bot to utilize a plurality of ML layers in resolving unique personal identifier(s) for a human while the voice bot is engaged in a corresponding conversation with the human. The unique personal identifier(s) can include a unique sequence of alphanumeric characters that is personal to the human. In some implementations, ASR speech hypothes(es) corresponding to spoken utterance(s) that include the unique personal identifier(s) can be processed to generate candidate unique personal identifier(s), given alphanumeric character(s) of the candidate unique personal identifier(s) can be selected, and the voice bot can prompt the human with clarification request(s) to clarify the given alphanumeric character(s) until it is predicted to correspond to the an actual unique personal identifier(s) for the human(s). The unique personal identifier(s) can then be utilized in performance of further action(s) by the voice bot and/or other systems.

9.

发明授权
Contextual denormalization for automatic speech recognition 有权

公开(公告)号：US11282525B2

公开(公告)日：2022-03-22

申请号：US17009494

申请日：2020-09-01

Applicant: Google LLC

Inventor： Assaf Hurwitz Michaely , Petar Aleksic , Pedro J. Moreno Mengibar

IPC: G10L15/26 , G06F40/56 , G10L15/22

Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification