Sound source localization
    1.
    发明授权

    公开(公告)号:US11762052B1

    公开(公告)日:2023-09-19

    申请号:US17475888

    申请日:2021-09-15

    CPC classification number: G01S3/8083 G01S5/20 G06T7/70 G10L15/22 G10L2015/223

    Abstract: Techniques for improving sound source localization (SSL) are provided. A method for probabilistic SSL using a deep neural network (DNN) may include receiving audio data including a representation of audio such as a wakeword from a microphone array. The audio data may be processed by a DNN to output a plurality of values where each value indicates a probability that the audio originated from a direction corresponding to that value. A sensor may provide computer vision or other data which may be used to inform the plurality of values based on detecting presence of a human or obstacle. A probability that the audio originated from one of the directions of the plurality of directions may be determined based at least in part on the DNN output and the computer vision or other data.

Patent Agency Ranking