Machine Learning Based Enhancement of Audio for a Voice Call

    公开(公告)号:US20240153514A1

    公开(公告)日:2024-05-09

    申请号:US18548949

    申请日:2021-03-05

    申请人: Google LLC

    摘要: Apparatus and methods related to enhancement of audio content are provided. An example method includes receiving, by a computing device and via a communications network interface, a compressed audio data frame, wherein the compressed audio data frame is received after transmission over a communications network, The method further includes decompressing the compressed audio data frame to extract an audio waveform. The method also includes predicting, by applying a neural network to the audio waveform, an enhanced version of the audio waveform, wherein the neural network has been trained on (i) a ground truth sample comprising unencoded audio waveforms prior to compression by an audio encoder, and (ii) a training dataset comprising decoded audio waveforms after compression of the unencoded audio waveforms by the audio encoder. The method additionally includes providing, by an audio output component of the computing device, the enhanced version of the audio waveform.

    ON-DEVICE GRAMMAR CHECKING
    2.
    发明公开

    公开(公告)号:US20230359818A1

    公开(公告)日:2023-11-09

    申请号:US18246326

    申请日:2020-12-18

    申请人: Google LLC

    IPC分类号: G06F40/253

    CPC分类号: G06F40/253

    摘要: A computing device may receive inputted text and perform, using one or more neural networks, on-device grammar checking of a sequence of words in the inputted text, including determining, using the one or more neural networks, a grammatically correct version of the sequence of words and determining that the sequence of words does not match the grammatically correct version of the sequence of words. The computing device may, in response to determining that the sequence of words does not match the grammatically correct version of the sequence of words, output, for display at a display device, at least a portion of the grammatically correct version of the sequence of words as a suggested replacement for at least a sequence of the sequence of words in the inputted text.