摘要:
Metadata comprising a set of gain values for creating a dominance effect is automatically generated. Automatically generating the metadata includes receiving multiple audio streams and a dominance criterion for at least one of the audio streams. A set of gains is computed for one or more audio streams based on the dominance criterion for the at least one audio stream and metadata is generated with the set of gains.
摘要:
Metadata comprising a set of gain values for creating a dominance effect is automatically generated. Automatically generating the metadata includes receiving multiple audio streams and a dominance criterion for at least one of the audio streams. A set of gains is computed for one or more audio streams based on the dominance criterion for the at least one audio stream and metadata is generated with the set of gains.
摘要:
The invention relates to audio signal processing. More specifically, the invention relates to enhancing multichannel audio, such as television audio, by applying a gain to the audio that has been smoothed between portions of the audio. The invention relates to methods, apparatus for performing such methods, and to software stored on a computer-readable medium for causing a computer to perform such methods.
摘要:
A personal sound system is described that includes a wireless network supporting an ear-level module, a companion module and a phone. Other audio sources are supported as well. A configuration processor configures the ear-level module and the companion module for private communications, and configures the ear-level module for a plurality of signal processing modes, including a hearing aid mode, for a corresponding plurality of sources of audio data. The ear module is configured to handle variant audio sources, and control switching among them.
摘要:
In one embodiment the present invention includes a method of improving audibility of speech in a multi-channel audio signal. The method includes comparing a first characteristic and a second characteristic of the multi-channel audio signal to generate an attenuation factor. The first characteristic corresponds to a first channel of the multi-channel audio signal that contains speech and non-speech audio, and the second characteristic corresponds to a second channel of the multi-channel audio signal that contains predominantly non-speech audio. The method further includes adjusting the attenuation factor according to a speech likelihood value to generate an adjusted attenuation factor. The method further includes attenuating the second channel using the adjusted attenuation factor.
摘要:
The invention relates to audio signal processing and speech enhancement. In accordance with one aspect, the invention combines a high-quality audio program that is a mix of speech and non-speech audio with a lower-quality copy of the speech components contained in the audio program for the purpose of generating a high-quality audio program with an increased ratio of speech to non-speech audio such as may benefit the elderly, hearing impaired or other listeners. Aspects of the invention are particularly useful for television and home theater sound, although they may be applicable to other audio and sound applications. The invention relates to methods, apparatus for performing such methods, and to software stored on a computer-readable medium for causing a computer to perform such methods.
摘要:
A method and system for filtering a multi-channel audio signal having a speech channel and at least one non-speech channel, to improve intelligibility of speech determined by the signal. In typical embodiments, the method includes steps of determining at least one attenuation control value indicative of a measure of similarity between speech-related content determined by the speech channel and speech-related content determined by the non-speech channel, and attenuating the non-speech channel in response to the at least one attenuation control value. Typically, the attenuating step includes scaling of a raw attenuation control signal (e.g., a ducking gain control signal) for the non-speech channel in response to the at least one attenuation control value. Some embodiments are a general or special purpose processor programmed with software or firmware and/or otherwise configured to perform filtering in accordance the invention.
摘要:
A personal sound system is described that includes a wireless network supporting an ear-level module, a companion module and a phone. Other audio sources are supported as well. A configuration processor configures the ear-level module and the companion module for private communications, and configures the ear-level module for a plurality of signal processing modes, including a hearing aid mode, for a corresponding plurality of sources of audio data. The ear module is configured to handle variant audio sources, and control switching among them.
摘要:
At least one segment is identified in an audio signal. The audio segment is associated with an artifact within the audio signal and has a time duration. At least one stored sound clip is retrieved, which has a time duration that exceeds the time duration associated with the audio segment. The retrieved sound clip is mixed with the audio signal and the retrieved sound clip audibly compensates for the audio artifact.
摘要:
At least one segment is identified in an audio signal. The audio segment is associated with an artifact within the audio signal and has a time duration. At least one stored sound clip is retrieved, which has a time duration that exceeds the time duration associated with the audio segment. The retrieved sound clip is mixed with the audio signal and the retrieved sound clip audibly compensates for the audio artifact.