Spatial audio controller
    1.
    发明授权

    公开(公告)号:US12177643B1

    公开(公告)日:2024-12-24

    申请号:US18503816

    申请日:2023-11-07

    Applicant: Apple Inc.

    Abstract: A method performed a local device that is communicatively coupled with several remote devices, the method includes: receiving, from each remote device with which the local device is engaged in a communication session, an input audio stream; receiving, for each remote device, a set parameters; determining, for each input audio stream, whether the input audio stream is to be 1) rendered individually or 2) rendered as a mix of input audio streams based on the set of parameters; for each input audio stream that is determined to be rendered individually, spatially rendering the input audio stream as an individual virtual sound source that contains only that input audio stream; and for input audio streams that are determined to be rendered as the mix of input audio streams, spatially rendering the mix of input audio streams as a single virtual sound source that contains the mix of input audio streams.

    Echo control based on state of a device

    公开(公告)号:US10187504B1

    公开(公告)日:2019-01-22

    申请号:US15275311

    申请日:2016-09-23

    Applicant: Apple Inc.

    Abstract: A device and a corresponding method are provided to tune parameters of an echo control process without re-initializing the echo control process and without interrupting a playback process. A state of the device and environment around the device is computed during use of the device given information from sensors. Such sensors can give information on the position of the device, the orientation of the device, the presence of a proximate object, or handling of the device resulting in occlusion of microphones and loudspeakers, among other things. The computed state of the device is mapped to an associated device state code from among a plurality of device state codes. The parameters of the echo control process are tuned either according to the associated device state code, or a change in such a code, during use of the device.

    MULTI-MICROPHONE SPEECH RECOGNITION SYSTEMS AND RELATED TECHNIQUES

    公开(公告)号:US20180137864A1

    公开(公告)日:2018-05-17

    申请号:US15871836

    申请日:2018-01-15

    Applicant: Apple Inc.

    CPC classification number: G10L15/32 G10L15/20

    Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.

    Techniques for providing audio and video effects

    公开(公告)号:US10861210B2

    公开(公告)日:2020-12-08

    申请号:US16033111

    申请日:2018-07-11

    Applicant: Apple Inc.

    Abstract: Embodiments of the present disclosure can provide systems, methods, and computer-readable medium for providing audio and/or video effects based at least in part on facial features and/or voice feature characteristics of the user. For example, video and/or an audio signal of the user may be recorded by a device. Voice audio features and facial feature characteristics may be extracted from the voice audio signal and the video, respectively. The facial features of the user may be used to modify features of a virtual avatar to emulate the facial feature characteristics of the user. The extracted voice audio features may modified to generate an adjusted audio signal or an audio signal may be composed from the voice audio features. The adjusted/composed audio signal may simulate the voice of the virtual avatar. A preview of the modified video/audio may be provided at the user's device.

    MULTI-MICROPHONE SPEECH RECOGNITION SYSTEMS AND RELATED TECHNIQUES

    公开(公告)号:US20190251974A1

    公开(公告)日:2019-08-15

    申请号:US16389697

    申请日:2019-04-19

    Applicant: Apple Inc.

    CPC classification number: G10L15/32 G10L15/20

    Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.

    Method and apparatus to sense the environment using coupled microphones and loudspeakers and nominal playback

    公开(公告)号:US10313808B1

    公开(公告)日:2019-06-04

    申请号:US15455760

    申请日:2017-03-10

    Applicant: Apple Inc.

    Abstract: An electronic device having a device housing includes a loudspeaker and several microphones within the device housing. A control circuit is electrically coupled to the loudspeaker and microphones. The loudspeaker produces speech and/or music. The control circuit determines a statistical measure for a first data set representing individual impulse responses from the plurality of microphones and compares that to a predetermined statistical measure for a second data set representing individual object-free impulse responses from the plurality of microphones to determine if an object is near the device. The statistical measure may be variance and may be computed in the time domain. Variance may be calculated using differences between the individual impulse responses and a mean impulse response that is a linear combination of the impulse responses for the plurality of microphones. The control circuit may include echo cancellers to mitigate common signals and/or other acoustic sources.

Patent Agency Ranking