Guided Speech Enhancement Network
    3.
    发明公开

    公开(公告)号:US20240249741A1

    公开(公告)日:2024-07-25

    申请号:US18159679

    申请日:2023-01-25

    Applicant: Google LLC

    Abstract: A method includes receiving, as input, reference audio data representing a reference audio signal captured by an audio input device. The method also includes receiving, as input, from a beamformer, spatially-filtered audio data representing an output of the beamformer, the beamformer configured to spatially filter, based on additional audio data captured by one or more additional audio input devices, the reference audio data to attenuate one or more interfering signals in the spatially-filtered audio data. The method processes, using a trained guided speech-enhancement network, the reference audio data and the spatially-filtered audio data to generate, as output, enhanced audio data, the guided speech-enhancement network processing the reference audio data and the spatially-filtered audio data to further attenuate, in the enhanced audio data, the one or more interfering signals attenuated by the beamformer.

    SPEAKER AWARENESS USING SPEAKER DEPENDENT SPEECH MODEL(S)

    公开(公告)号:US20220157298A1

    公开(公告)日:2022-05-19

    申请号:US17587424

    申请日:2022-01-28

    Applicant: GOOGLE LLC

    Abstract: Techniques disclosed herein enable training and/or utilizing speaker dependent (SD) speech models which are personalizable to any user of a client device. Various implementations include personalizing a SD speech model for a target user by processing, using the SD speech model, a speaker embedding corresponding to the target user along with an instance of audio data. The SD speech model can be personalized for an additional target user by processing, using the SD speech model, an additional speaker embedding, corresponding to the additional target user, along with another instance of audio data. Additional or alternative implementations include training the SD speech model based on a speaker independent speech model using teacher student learning.

    SPEAKER AWARENESS USING SPEAKER DEPENDENT SPEECH MODEL(S)

    公开(公告)号:US20210312907A1

    公开(公告)日:2021-10-07

    申请号:US17251163

    申请日:2019-12-04

    Applicant: GOOGLE LLC

    Abstract: Techniques disclosed herein enable training and/or utilizing speaker dependent (SD) speech models which are personalizable to any user of a client device. Various implementations include personalizing a SD speech model for a target user by processing, using the SD speech model, a speaker embedding corresponding to the target user along with an instance of audio data. The SD speech model can be personalized for an additional target user by processing, using the SD speech model, an additional speaker embedding, corresponding to the additional target user, along with another instance of audio data. Additional or alternative implementations include training the SD speech model based on a speaker independent speech model using teacher student learning.

Patent Agency Ranking