AURALIZATION FOR MULTI-MICROPHONE DEVICES

    公开(公告)号:US20230027458A1

    公开(公告)日:2023-01-26

    申请号:US17959734

    申请日:2022-10-04

    申请人: Google LLC

    IPC分类号: H04R3/00 H04R29/00 H04R5/027

    摘要: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.

    QUERY ENDPOINTING BASED ON LIP DETECTION
    2.
    发明申请

    公开(公告)号:US20190333507A1

    公开(公告)日:2019-10-31

    申请号:US16412677

    申请日:2019-05-15

    申请人: Google LLC

    摘要: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.

    Systems and methods of home-specific sound event detection

    公开(公告)号:US10395494B2

    公开(公告)日:2019-08-27

    申请号:US16050612

    申请日:2018-07-31

    申请人: Google LLC

    摘要: Systems and methods of a security system are provided, including detecting, by a sensor, a sound event, and selecting, by a processor coupled to the sensor, at least a portion of sound data captured by the sensor that corresponds to at least one sound feature of the detected sound event. The systems and methods include classifying the at least one sound feature into one or more sound categories, and determining, by a processor, based upon a database of home-specific sound data, whether the at least one sound feature is a human-generated sound. A notification can be transmitted to a computing device according to the sound event.

    Auralization for multi-microphone devices

    公开(公告)号:US10412489B2

    公开(公告)日:2019-09-10

    申请号:US15996070

    申请日:2018-06-01

    申请人: Google LLC

    IPC分类号: H04R3/00 H04R29/00 H04R5/027

    摘要: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.

    Query endpointing based on lip detection

    公开(公告)号:US10332515B2

    公开(公告)日:2019-06-25

    申请号:US15458214

    申请日:2017-03-14

    申请人: Google LLC

    摘要: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.

    Directional microphone device and signal processing techniques

    公开(公告)号:US10237649B2

    公开(公告)日:2019-03-19

    申请号:US15844847

    申请日:2017-12-18

    申请人: Google LLC

    摘要: Methods and apparatus relating to microphone devices and signal processing techniques are provided. In an example, a microphone device can detect sound, as well as enhance an ability to perceive at least a general direction from which the sound arrives at the microphone device. In an example, a case of the microphone device has an external surface which at least partially defines funnel-shaped surfaces. Each funnel-shaped surface is configured to direct the sound to a respective microphone diaphragm to produce an auralized multi-microphone output. The funnel-shaped surfaces are configured to cause direction-dependent variations in spectral notches and frequency response of the sound as received by the microphone diaphragms. A neural network can device-shape the auralized multi-microphone output to create a binaural output. The binaural output can be auralized with respect to a human listener.

    DEVICE SPECIFIC MULTI-CHANNEL DATA COMPRESSION

    公开(公告)号:US20180108363A1

    公开(公告)日:2018-04-19

    申请号:US15845087

    申请日:2017-12-18

    申请人: Google LLC

    摘要: A sensor device may include a computing device in communication with multiple microphones. A neural network executing on the computing device may receive audio signals from each microphone. One microphone signal may serve as a reference signal. The neural network may extract differences in signal characteristics of the other microphone signals as compared to the reference signal. The neural network may combine these signal differences into a lossy compressed signal. The sensor device may transmit the lossy compressed signal and the lossless reference signal to a remote neural network executing in a cloud computing environment for decompression and sound recognition analysis.

    SOUND EVENT DETECTION
    9.
    发明申请

    公开(公告)号:US20180047415A1

    公开(公告)日:2018-02-15

    申请号:US15797991

    申请日:2017-10-30

    申请人: Google LLC

    IPC分类号: G10L25/51 G10L25/18 G08B13/16

    摘要: A system and method for the use of sensors and processors of existing, distributed systems, operating individually or in cooperation with other systems, networks or cloud-based services to enhance the detection and classification of sound events in an environment (e.g., a home), while having low computational complexity. The system and method provides functions where the most relevant features that help in discriminating sounds are extracted from an audio signal and then classified depending on whether the extracted features correspond to a sound event that should result in a communication to a user. Threshold values and other variables can be determined by training on audio signals of known sounds in defined environments, and implemented to distinguish human and pet sounds from other sounds, and compensate for variations in the magnitude of the audio signal, different sizes and reverberation characteristics of the environment, and variations in microphone responses.