Contextual Denormalization for Automatic Speech Recognition

    公开(公告)号:US20200160865A1

    公开(公告)日:2020-05-21

    申请号:US16192953

    申请日:2018-11-16

    Applicant: Google LLC

    Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.

    Contextual denormalization for automatic speech recognition

    公开(公告)号:US10789955B2

    公开(公告)日:2020-09-29

    申请号:US16192953

    申请日:2018-11-16

    Applicant: Google LLC

    Abstract: A method includes receiving a speech input from a user and obtaining context metadata associated with the speech input. The method also includes generating a raw speech recognition result corresponding to the speech input and selecting a list of one or more denormalizers to apply to the generated raw speech recognition result based on the context metadata associated with the speech input. The generated raw speech recognition result includes normalized text. The method also includes denormalizing the generated raw speech recognition result into denormalized text by applying the list of the one or more denormalizers in sequence to the generated raw speech recognition result.

    Word lattice augmentation for automatic speech recognition

    公开(公告)号:US11238227B2

    公开(公告)日:2022-02-01

    申请号:US16622657

    申请日:2019-06-27

    Applicant: Google LLC

    Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.

    WORD LATTICE AUGMENTATION FOR AUTOMATIC SPEECH RECOGNITION

    公开(公告)号:US20210064822A1

    公开(公告)日:2021-03-04

    申请号:US16622657

    申请日:2019-06-27

    Applicant: Google LLC

    Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.

    WORD LATTICE AUGMENTATION FOR AUTOMATIC SPEECH RECOGNITION

    公开(公告)号:US20220229992A1

    公开(公告)日:2022-07-21

    申请号:US17589186

    申请日:2022-01-31

    Applicant: GOOGLE LLC

    Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.

Patent Agency Ranking