-
公开(公告)号:US20200160865A1
公开(公告)日:2020-05-21
申请号:US16192953
申请日:2018-11-16
Applicant: Google LLC
Inventor: Assaf Hurwitz Michaely , Petar Aleksic , Pedro Moreno
Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.
-
公开(公告)号:US10789955B2
公开(公告)日:2020-09-29
申请号:US16192953
申请日:2018-11-16
Applicant: Google LLC
Inventor: Assaf Hurwitz Michaely , Petar Aleksic , Pedro Moreno
Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.
-
公开(公告)号:US11797772B2
公开(公告)日:2023-10-24
申请号:US17589186
申请日:2022-01-31
Applicant: GOOGLE LLC
Inventor: Leonid Velikovich , Petar Aleksic , Pedro Moreno
IPC: G10L15/22 , G10L15/187 , G06F40/295 , G06F40/30 , G10L15/06
CPC classification number: G06F40/295 , G06F40/30 , G10L15/063 , G10L15/187 , G10L15/22
Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.
-
公开(公告)号:US11238227B2
公开(公告)日:2022-02-01
申请号:US16622657
申请日:2019-06-27
Applicant: Google LLC
Inventor: Leonid Velikovich , Petar Aleksic , Pedro Moreno
IPC: G10L15/22 , G10L15/187 , G06F40/295 , G06F40/30 , G10L15/06
Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.
-
公开(公告)号:US20210064822A1
公开(公告)日:2021-03-04
申请号:US16622657
申请日:2019-06-27
Applicant: Google LLC
Inventor: Leonid Velikovich , Petar Aleksic , Pedro Moreno
IPC: G06F40/295 , G06F40/30 , G10L15/22 , G10L15/06 , G10L15/187
Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.
-
公开(公告)号:US20220229992A1
公开(公告)日:2022-07-21
申请号:US17589186
申请日:2022-01-31
Applicant: GOOGLE LLC
Inventor: Leonid Velikovich , Petar Aleksic , Pedro Moreno
IPC: G06F40/295 , G06F40/30 , G10L15/06 , G10L15/187 , G10L15/22
Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.
-
-
-
-
-