-
公开(公告)号:US20230027458A1
公开(公告)日:2023-01-26
申请号:US17959734
申请日:2022-10-04
申请人: Google LLC
摘要: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
公开(公告)号:US20190333507A1
公开(公告)日:2019-10-31
申请号:US16412677
申请日:2019-05-15
申请人: Google LLC
摘要: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.
-
公开(公告)号:US10395494B2
公开(公告)日:2019-08-27
申请号:US16050612
申请日:2018-07-31
申请人: Google LLC
摘要: Systems and methods of a security system are provided, including detecting, by a sensor, a sound event, and selecting, by a processor coupled to the sensor, at least a portion of sound data captured by the sensor that corresponds to at least one sound feature of the detected sound event. The systems and methods include classifying the at least one sound feature into one or more sound categories, and determining, by a processor, based upon a database of home-specific sound data, whether the at least one sound feature is a human-generated sound. A notification can be transmitted to a computing device according to the sound event.
-
公开(公告)号:US20210377493A1
公开(公告)日:2021-12-02
申请号:US17400887
申请日:2021-08-12
申请人: Google LLC
发明人: Jason Evans Goulden , Rengarajan Aravamudhan , Hae Rim Jeong , Michael Dixon , James Edward Stewart , Sayed Yusef Shafi , Sahana Mysore , Seungho Yang , Yu-An Lien , Christopher Charles Burns , Rajeev Conrad Nongpiur , Jeffrey Boyd
摘要: A method of presenting appropriate actions for responding to a visitor to a smart home environment via an electronic greeting system of the smart home environment, including detecting a visitor of the smart home environment; obtaining context information from the smart home environment regarding the visitor; based on the context information, identifying a plurality of appropriate actions available to a user of a client device for interacting with the visitor via the electronic greeting system; and causing the identified actions to be presented to the user of the client device.
-
公开(公告)号:US10412489B2
公开(公告)日:2019-09-10
申请号:US15996070
申请日:2018-06-01
申请人: Google LLC
摘要: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
公开(公告)号:US10332515B2
公开(公告)日:2019-06-25
申请号:US15458214
申请日:2017-03-14
申请人: Google LLC
摘要: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.
-
公开(公告)号:US10237649B2
公开(公告)日:2019-03-19
申请号:US15844847
申请日:2017-12-18
申请人: Google LLC
IPC分类号: H04R3/00 , H04R1/32 , H04R1/22 , H04R1/02 , H04R1/04 , H04R1/34 , H04R1/38 , H04R1/40 , H04R5/027
摘要: Methods and apparatus relating to microphone devices and signal processing techniques are provided. In an example, a microphone device can detect sound, as well as enhance an ability to perceive at least a general direction from which the sound arrives at the microphone device. In an example, a case of the microphone device has an external surface which at least partially defines funnel-shaped surfaces. Each funnel-shaped surface is configured to direct the sound to a respective microphone diaphragm to produce an auralized multi-microphone output. The funnel-shaped surfaces are configured to cause direction-dependent variations in spectral notches and frequency response of the sound as received by the microphone diaphragms. A neural network can device-shape the auralized multi-microphone output to create a binaural output. The binaural output can be auralized with respect to a human listener.
-
公开(公告)号:US20180108363A1
公开(公告)日:2018-04-19
申请号:US15845087
申请日:2017-12-18
申请人: Google LLC
IPC分类号: G10L19/008 , G10L25/30 , G10L25/72 , G10L19/00
CPC分类号: G10L19/008 , G10L19/0017 , G10L25/30
摘要: A sensor device may include a computing device in communication with multiple microphones. A neural network executing on the computing device may receive audio signals from each microphone. One microphone signal may serve as a reference signal. The neural network may extract differences in signal characteristics of the other microphone signals as compared to the reference signal. The neural network may combine these signal differences into a lossy compressed signal. The sensor device may transmit the lossy compressed signal and the lossless reference signal to a remote neural network executing in a cloud computing environment for decompression and sound recognition analysis.
-
公开(公告)号:US20180047415A1
公开(公告)日:2018-02-15
申请号:US15797991
申请日:2017-10-30
申请人: Google LLC
CPC分类号: G10L25/51 , G08B13/1672 , G10L25/18
摘要: A system and method for the use of sensors and processors of existing, distributed systems, operating individually or in cooperation with other systems, networks or cloud-based services to enhance the detection and classification of sound events in an environment (e.g., a home), while having low computational complexity. The system and method provides functions where the most relevant features that help in discriminating sounds are extracted from an audio signal and then classified depending on whether the extracted features correspond to a sound event that should result in a communication to a user. Threshold values and other variables can be determined by training on audio signals of known sounds in defined environments, and implemented to distinguish human and pet sounds from other sounds, and compensate for variations in the magnitude of the audio signal, different sizes and reverberation characteristics of the environment, and variations in microphone responses.
-
公开(公告)号:US11924618B2
公开(公告)日:2024-03-05
申请号:US17959734
申请日:2022-10-04
申请人: Google LLC
CPC分类号: H04R3/005 , H04R5/027 , H04R29/005 , H04R29/006 , H04R2201/401 , H04R2430/20
摘要: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
-
-
-
-
-
-
-
-