-
公开(公告)号:US12177643B1
公开(公告)日:2024-12-24
申请号:US18503816
申请日:2023-11-07
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Peter D. Callaway , Jae Woo Chang , Martin E. Johnson , Daniel K. Boothe , Kostyantyn Komarov , Patrick Miauton , Christopher M. Garrido , Austin W. Shyu , Karthick Santhanam
IPC: H04S3/00 , G06F3/0487 , H04R3/00 , H04S5/00 , H04S7/00
Abstract: A method performed a local device that is communicatively coupled with several remote devices, the method includes: receiving, from each remote device with which the local device is engaged in a communication session, an input audio stream; receiving, for each remote device, a set parameters; determining, for each input audio stream, whether the input audio stream is to be 1) rendered individually or 2) rendered as a mix of input audio streams based on the set of parameters; for each input audio stream that is determined to be rendered individually, spatially rendering the input audio stream as an individual virtual sound source that contains only that input audio stream; and for input audio streams that are determined to be rendered as the mix of input audio streams, spatially rendering the mix of input audio streams as a single virtual sound source that contains the mix of input audio streams.
-
公开(公告)号:US20230410828A1
公开(公告)日:2023-12-21
申请号:US17845655
申请日:2022-06-21
Applicant: Apple Inc.
Inventor: Ramin Pishehvar , Mehrez Souden , Sean A. Ramprashad , Jason Wung , Ante Jukic , Joshua D. Atkins
IPC: G10L21/0232 , G06V40/16 , G10L25/84 , G10L21/034 , G10L21/0364 , G10L15/25 , G10L15/06 , G10L15/22
CPC classification number: G10L21/0232 , G06V40/161 , G10L25/84 , G10L21/034 , G10L21/0364 , G10L15/25 , G10L15/063 , G10L15/22
Abstract: Disclosed is a reference-less echo mitigation or cancellation technique. The technique enables suppression of echoes from an interference signal when a reference version of the interference signal conventionally used for echo mitigation may not be available. A first stage of the technique may use a machine learning model to model a target audio area surrounding a device so that a target audio signal estimated as originating from within the target audio area may be accepted. In contrast, audio signals such as playback of media content on a TV or other interfering signals estimated as originating from outside the target audio area may be suppressed. A second stage of the technique may be a level-based suppressor that further attenuates the residual echo from the output of the first stage based on an audio level threshold. Side information may be provided to adjust the target audio area or the audio level threshold.
-
公开(公告)号:US20200005830A1
公开(公告)日:2020-01-02
申请号:US16025592
申请日:2018-07-02
Applicant: Apple Inc.
Inventor: Langford M. Wasada , Vijay Sundaram , William M. Bumgarner , Daniel H. Lloyd , Christopher J. Sanders , Sean A. Ramprashad , Sriram Hariharan , Jarrad A. Stallone , Johannes P. Schmidt , David P. Saracino , Gregory R. Chapman
Abstract: In some implementations, a computing device can calibrate media playback channels for presenting media content through a media system by determining the media propagation latency through the media system. For example, the computing device can send calibration content (e.g., audio data, video data, etc.) to various playback devices (e.g., playback channels) of the media system and record a timestamp indicating when the calibration content was sent. When the playback devices present the calibration content, a sensor device (e.g., remote control device, smartphone, etc.) can detect the presentation of the calibration content. The sensor device can send calibration data (e.g., media samples that may include the calibration content and/or a timestamp indicating when the media sample was detected by the sensor device) to the computing device. The computing device can determine the propagation latency (e.g., presentation delay) based on the calibration data received from the sensor device.
-
公开(公告)号:US10187504B1
公开(公告)日:2019-01-22
申请号:US15275311
申请日:2016-09-23
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Aram M. Lindahl , Joseph M. Williams
Abstract: A device and a corresponding method are provided to tune parameters of an echo control process without re-initializing the echo control process and without interrupting a playback process. A state of the device and environment around the device is computed during use of the device given information from sensors. Such sensors can give information on the position of the device, the orientation of the device, the presence of a proximate object, or handling of the device resulting in occlusion of microphones and loudspeakers, among other things. The computed state of the device is mapped to an associated device state code from among a plurality of device state codes. The parameters of the echo control process are tuned either according to the associated device state code, or a change in such a code, during use of the device.
-
公开(公告)号:US20180137864A1
公开(公告)日:2018-05-17
申请号:US15871836
申请日:2018-01-15
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Harvey D. Thornburg , Arvindh Krishnaswamy , Aram M. Lindahl
Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.
-
公开(公告)号:US20160219386A1
公开(公告)日:2016-07-28
申请号:US15019521
申请日:2016-02-09
Applicant: Apple Inc.
Inventor: Arvindh Krishnaswamy , David T. Yeh , Juha O. Merimaa , Sean A. Ramprashad
CPC classification number: H04R29/005 , H04R3/00 , H04R29/004 , H04R2499/11
Abstract: Systems and methods for determining the operating condition of multiple microphones of an electronic device are disclosed. A system can include a plurality of microphones operative to receive signals, a microphone condition detector, and a plurality of microphone condition determination sources. The microphone condition detector can determine a condition for each of the plurality of microphones by using the received signals and accessing at least one microphone condition determination source.
-
公开(公告)号:US20230020542A1
公开(公告)日:2023-01-19
申请号:US17947042
申请日:2022-09-16
Applicant: Apple Inc.
Inventor: Darius A. Satongar , Per Håkan Linus Persson , Sean B. Kelly , Martin E. Johnson , Tony S. Verma , Peter D. Callaway , Jae Woo Chang , Daniel K. Boothe , Sean A. Ramprashad , Patrick Miauton , Christopher M. Garrido , Mitchell R. Lerner , Charles C. Hoyt
IPC: H04S7/00 , G06F3/0486 , G06F3/16 , H04N7/15
Abstract: A computer system outputs audio content via one or more audio output devices. If the audio content includes information that enables spatialization of the audio content, the system outputs the audio content in a simulated three-dimensional environment, including, if the audio content corresponds to a first category of content, causing the one or more audio output devices to simulate production of the audio content in a first virtual space, and if the audio content corresponds to a second category of content, causing the one or more audio output devices to simulate production of the audio content in a second virtual space that has different simulated acoustic properties than simulated acoustic properties of the first virtual space.
-
公开(公告)号:US10861210B2
公开(公告)日:2020-12-08
申请号:US16033111
申请日:2018-07-11
Applicant: Apple Inc.
Inventor: Carlos M. Avendano , Sean A. Ramprashad
IPC: G10L21/013 , G06T13/20 , G06T13/40 , G10L21/003
Abstract: Embodiments of the present disclosure can provide systems, methods, and computer-readable medium for providing audio and/or video effects based at least in part on facial features and/or voice feature characteristics of the user. For example, video and/or an audio signal of the user may be recorded by a device. Voice audio features and facial feature characteristics may be extracted from the voice audio signal and the video, respectively. The facial features of the user may be used to modify features of a virtual avatar to emulate the facial feature characteristics of the user. The extracted voice audio features may modified to generate an adjusted audio signal or an audio signal may be composed from the voice audio features. The adjusted/composed audio signal may simulate the voice of the virtual avatar. A preview of the modified video/audio may be provided at the user's device.
-
公开(公告)号:US20190251974A1
公开(公告)日:2019-08-15
申请号:US16389697
申请日:2019-04-19
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Harvey D. Thornburg , Arvindh Krishnaswamy , Aram M. Lindahl
Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.
-
公开(公告)号:US10313808B1
公开(公告)日:2019-06-04
申请号:US15455760
申请日:2017-03-10
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Adam E. Kriegel , Sylvain J. Choisel , Afrooz Family
Abstract: An electronic device having a device housing includes a loudspeaker and several microphones within the device housing. A control circuit is electrically coupled to the loudspeaker and microphones. The loudspeaker produces speech and/or music. The control circuit determines a statistical measure for a first data set representing individual impulse responses from the plurality of microphones and compares that to a predetermined statistical measure for a second data set representing individual object-free impulse responses from the plurality of microphones to determine if an object is near the device. The statistical measure may be variance and may be computed in the time domain. Variance may be calculated using differences between the individual impulse responses and a mean impulse response that is a linear combination of the impulse responses for the plurality of microphones. The control circuit may include echo cancellers to mitigate common signals and/or other acoustic sources.
-
-
-
-
-
-
-
-
-