-
公开(公告)号:US12288566B1
公开(公告)日:2025-04-29
申请号:US17849823
申请日:2022-06-27
Applicant: Amazon Technologies, Inc.
Inventor: Anshuman Ganguly , Srivatsan Kandadai , Trausti Thor Kristjansson , Wontak Kim
IPC: G10L21/0216 , G10L21/0264 , G10L25/51 , G10L25/78
Abstract: A device capable of using data from multiple sensors to determine an estimated position/direction of a user with respect to the device. The device may use estimated position data, along with confidence data, that originated from a plurality of sensors to fuse the data to determine the user's estimated position and comprehensive confidence of the estimated position. The system may use the location information to perform beamforming/beam steering and/or other downstream operations using the comprehensive estimated position.
-
公开(公告)号:US11258478B1
公开(公告)日:2022-02-22
申请号:US16573679
申请日:2019-09-17
Applicant: Amazon Technologies, Inc.
Inventor: Srivatsan Kandadai , Amit Singh Chhetri
Abstract: A device capable of autonomous motion includes a residual echo suppressor for suppressing echoes caused by an output reference signal. When the device outputs audio while moving with a velocity, it may receive echoes that are Doppler-shifted due to the motion. The residual echo suppressor generates estimated residual error data based on phase-shifted reference data to account for and suppress the Doppler-shifted echoes.
-
公开(公告)号:US11727912B1
公开(公告)日:2023-08-15
申请号:US17707125
申请日:2022-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Harsha Inna Kedage Rao , Srivatsan Kandadai , Minje Kim , Tarun Pruthi , Trausti Thor Kristjansson
IPC: G10K11/178 , G10K11/175
CPC classification number: G10K11/17854 , G10K11/1754 , G10K11/17823 , G10K11/17825 , G10K11/17881 , G10K2210/3026 , G10K2210/3027 , G10K2210/3028 , G10K2210/3038 , G10K2210/505
Abstract: A system configured to perform deep adaptive acoustic echo cancellation (AEC) to improve audio processing. Due to mechanical noise and continuous echo path changes caused by movement of a device, echo signals are nonlinear and time-varying and not fully canceled by linear AEC processing alone. To improve echo cancellation, deep adaptive AEC processing integrates a deep neural network (DNN) and linear adaptive filtering to perform echo and/or noise removal. The DNN is configured to generate a nonlinear reference signal and step-size data, which the linear adaptive filtering uses to generate output audio data representing local speech. The DNN may generate the nonlinear reference signal by generating mask data that is applied to a microphone signal, such that the reference signal corresponds to a portion of the microphone signal that does not include near-end speech.
-
公开(公告)号:US11218802B1
公开(公告)日:2022-01-04
申请号:US16141012
申请日:2018-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Srivatsan Kandadai , Amit Singh Chhetri , Trausti Thor Kristjansson
Abstract: A mobile device capable of capturing voice commands includes a beamformer for determining audio data corresponding to one or more directions and a beam selector for selecting in which direction a source of target audio lies. The device determines, based on data from one or more sensors, an angle through which the device has rotated. Based on the angle and one or more rotation-compensation functions, the device interpolates audio data corresponding to the one or more directions to compensate for the rotation such that the direction corresponding to the source of target audio remains selected.
-
公开(公告)号:US11437057B2
公开(公告)日:2022-09-06
申请号:US16950158
申请日:2020-11-17
Applicant: Amazon Technologies, Inc.
Inventor: Trausti Thor Kristjansson , Srivatsan Kandadai , Mark Lawrence , Balsa Laban , Anna Chen Santos , Joseph Pedro Tavares , Miroslav Ristic , Valere Joseph Vanderschaegen
Abstract: A computer-implemented method includes receiving, at a microphone of a voice-controlled device, a speech input, generating an electrical signal having a first gain level that is below a gain threshold for audible detection by a user, transmitting the electrical signal to the speaker and detecting, by the microphone, an audio signal that includes a combination of ambient noise and a probe audio signal, wherein the probe audio signal is output by the speaker based on the electrical signal. The method further includes determining a power level of the probe audio signal and determining a state of the display based on the power level of the probe audio signal.
-
公开(公告)号:US11158335B1
公开(公告)日:2021-10-26
申请号:US16368107
申请日:2019-03-28
Applicant: Amazon Technologies, Inc.
Inventor: Anshuman Ganguly , Srivatsan Kandadai , Wontak Kim
Abstract: A voice-controlled device includes a beamformer for determining audio data corresponding to one or more directions and a beam selector for selecting in which direction a source of target audio lies. The device determines magnitude spectrums for each beam and for each frequency bin in each beam for each frame of audio data. The device determines frame-by-frame changes in the magnitude and filters the changes to smooth them. The device selects the beam having the greatest smoothed change in magnitude as corresponding to speech.
-
公开(公告)号:US11789457B1
公开(公告)日:2023-10-17
申请号:US16710718
申请日:2019-12-11
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Byoungsam Woo , Srivatsan Kandadai
IPC: G05D1/02 , G10L21/0208 , H04R3/00 , G05D1/00 , G01S15/931 , H04R3/02
CPC classification number: G05D1/0255 , G01S15/931 , G05D1/0088 , G10L21/0208 , H04R3/005 , H04R3/02 , G10L2021/02082
Abstract: An autonomous mobile device (AMD) moves through a physical space without human intervention. Data from sensors on the AMD is used to determine if the AMD has collided with an obstacle. In one implementation the sensors may include a microphone. A collision may produce in the structure of the AMD a characteristic sound that is detected by the microphone and recognized as indicative of a collision. In another implementation information about velocity of the AMD and output from an inertial measurement unit (IMU) may be used to determine a characteristic change in motion that is indicative of a collision. Data from the microphone and the IMU may be combined to improve the reliability of collision detection.
-
公开(公告)号:US20200177945A1
公开(公告)日:2020-06-04
申请号:US16781855
申请日:2020-02-04
Applicant: Amazon Technologies, Inc.
Inventor: Mark Lawrence , Balsa Laban , Anna Chen Santos , Joseph Pedro Tavares , Miroslav Ristic , Valere Joseph Vanderschaegen , Trausti Thor Kristjansson , Srivatsan Kandadai , Donald L. Cantrell, JR.
IPC: H04N21/422 , G06F3/16 , H04N21/41 , G10L15/22
Abstract: A computer-implemented method includes receiving, at a microphone of a voice-controlled device, a speech input from a user and determining, by the voice-controlled device, that a power state of an AV display device that is coupled to the voice-controlled device is an ON or OFF state. Based on the user intent determined from the speech input and the power state of the AV display, the voice-controlled device sends data to the AV display device to switch the AV display device ON or OFF. The method can further include receiving, by the voice-controlled device, content from a content source location and sending the content to the AV display device via an AV port.
-
公开(公告)号:US10560737B2
公开(公告)日:2020-02-11
申请号:US15919096
申请日:2018-03-12
Applicant: Amazon Technologies, Inc.
Inventor: Mark Lawrence , Balsa Laban , Anna Chen Santos , Joseph Pedro Tavares , Miroslav Ristic , Valere Joseph Vanderschaegen , Trausti Thor Kristjansson , Srivatsan Kandadai , Donald L. Cantrell, Jr.
IPC: H04N21/422 , G10L15/22 , H04N21/41
Abstract: A computer-implemented method includes receiving, at a microphone of a voice-controlled device, a speech input from a user and determining, by the voice-controlled device, that a power state of an AV display device that is coupled to the voice-controlled device is an ON or OFF state. Based on the user intent determined from the speech input and the power state of the AV display, the voice-controlled device sends data to the AV display device to switch the AV display device ON or OFF. The method can further include receiving, by the voice-controlled device, content from a content source location and sending the content to the AV display device via an AV port.
-
公开(公告)号:US11234039B2
公开(公告)日:2022-01-25
申请号:US16781855
申请日:2020-02-04
Applicant: Amazon Technologies, Inc.
Inventor: Mark Lawrence , Balsa Laban , Anna Chen Santos , Joseph Pedro Tavares , Miroslav Ristic , Valere Joseph Vanderschaegen , Trausti Thor Kristjansson , Srivatsan Kandadai , Donald L. Cantrell, Jr.
IPC: H04N21/422 , H04N21/41 , G10L15/22 , G06F3/16
Abstract: A computer-implemented method includes receiving, at a microphone of a voice-controlled device, a speech input from a user and determining, by the voice-controlled device, that a power state of an AV display device that is coupled to the voice-controlled device is an ON or OFF state. Based on the user intent determined from the speech input and the power state of the AV display, the voice-controlled device sends data to the AV display device to switch the AV display device ON or OFF. The method can further include receiving, by the voice-controlled device, content from a content source location and sending the content to the AV display device via an AV port.
-
-
-
-
-
-
-
-
-