Unsupervised automatic speech recognition

    公开(公告)号:US11138966B2

    公开(公告)日:2021-10-05

    申请号:US16269951

    申请日:2019-02-07

    摘要: A method for generating an automatic speech recognition (ASR) model using unsupervised learning includes obtaining, by a device, text information. The method includes determining, by the device, a set of phoneme sequences associated with the text information. The method includes obtaining, by the device, speech waveform data. The method includes determining, by the device, a set of phoneme boundaries associated with the speech waveform data. The method includes generating, by the device, the ASR model using an output distribution matching (ODM) technique based on determining the set of phoneme sequences associated with the text information and based on determining the set of phoneme boundaries associated with the speech waveform data.

    UNSUPERVISED AUTOMATIC SPEECH RECOGNITION
    2.
    发明申请

    公开(公告)号:US20200258497A1

    公开(公告)日:2020-08-13

    申请号:US16269951

    申请日:2019-02-07

    摘要: A method for generating an automatic speech recognition (ASR) model using unsupervised learning includes obtaining, by a device, text information. The method includes determining, by the device, a set of phoneme sequences associated with the text information. The method includes obtaining, by the device, speech waveform data. The method includes determining, by the device, a set of phoneme boundaries associated with the speech waveform data. The method includes generating, by the device, the ASR model using an output distribution matching (ODM) technique based on determining the set of phoneme sequences associated with the text information and based on determining the set of phoneme boundaries associated with the speech waveform data.