-
公开(公告)号:US10063965B2
公开(公告)日:2018-08-28
申请号:US15170348
申请日:2016-06-01
Applicant: Google Inc.
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Arun Narayanan
CPC classification number: H04R3/005 , G01S5/18 , G10L25/30 , H04R5/027 , H04R2201/401 , H04R2430/20 , H04S2400/11 , H04S2400/15 , H04S2420/01
Abstract: A system for estimating the location of a stationary or moving sound source includes multiple microphones, which need not be physically aligned in a linear array or a regular geometric pattern in a given environment, an auralizer that generates auralized multi-channel signals based at least on array-related transfer functions and room impulse responses of the microphones as well as signal labels corresponding to the auralized multi-channel signals, a feature extractor that extracts features from the auralized multi-channel signals for efficient processing, and a neural network that can be trained to estimate the location of the sound source based at least on the features extracted from the auralized multi-channel signals and the corresponding signal labels.
-
公开(公告)号:US20170353789A1
公开(公告)日:2017-12-07
申请号:US15170348
申请日:2016-06-01
Applicant: Google Inc.
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Arun Narayanan
CPC classification number: H04R3/005 , G10L25/30 , H04R5/027 , H04R2201/401 , H04R2430/20 , H04S2400/11 , H04S2400/15 , H04S2420/01
Abstract: A system for estimating the location of a stationary or moving sound source includes multiple microphones, which need not be physically aligned in a linear array or a regular geometric pattern in a given environment, an auralizer that generates auralized multi-channel signals based at least on array-related transfer functions and room impulse responses of the microphones as well as signal labels corresponding to the auralized multi-channel signals, a feature extractor that extracts features from the auralized multi-channel signals for efficient processing, and a neural network that can be trained to estimate the location of the sound source based at least on the features extracted from the auralized multi-channel signals and the corresponding signal labels.
-
公开(公告)号:US20180268812A1
公开(公告)日:2018-09-20
申请号:US15458214
申请日:2017-03-14
Applicant: Google Inc.
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Michiel A.U. Bacchiani
CPC classification number: G10L15/22 , G06K9/00255 , G10L15/04 , G10L15/25 , G10L15/265 , G10L25/78 , G10L2015/223
Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.
-
公开(公告)号:US09992570B2
公开(公告)日:2018-06-05
申请号:US15170924
申请日:2016-06-01
Applicant: Google Inc.
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Ananya Misra
CPC classification number: H04R3/005 , H04R5/027 , H04R29/005 , H04R29/006 , H04R2201/401 , H04R2430/20
Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
公开(公告)号:US09875747B1
公开(公告)日:2018-01-23
申请号:US15211417
申请日:2016-07-15
Applicant: Google Inc.
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Tara Sainath
IPC: G10L19/00 , G10L19/008 , G10L25/30 , G10L25/72
CPC classification number: G10L19/008 , G10L19/0017 , G10L25/30 , G10L25/72
Abstract: A sensor device may include a computing device in communication with multiple microphones. A neural network executing on the computing device may receive audio signals from each microphone. One microphone signal may serve as a reference signal. The neural network may extract differences in signal characteristics of the other microphone signals as compared to the reference signal. The neural network may combine these signal differences into a lossy compressed signal. The sensor device may transmit the lossy compressed signal and the lossless reference signal to a remote neural network executing in a cloud computing environment for decompression and sound recognition analysis.
-
公开(公告)号:US20170353790A1
公开(公告)日:2017-12-07
申请号:US15170924
申请日:2016-06-01
Applicant: Google Inc.
Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Ananya Misra
CPC classification number: H04R3/005 , H04R5/027 , H04R29/005 , H04R29/006 , H04R2201/401 , H04R2430/20
Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
-
-
-
-
-