摘要:
A frequency band of an audio input signal is analyzed to determine if a transient is present. When transients are detected, modifications are made to the intensity levels corresponding to the frequency band for a brief time period.
摘要:
A digital signal is processed by splitting it into at least two frequency subbands and the two subband signals are downsampled. A filter is applied in at least one of the subband signals. At least one of the phase and magnitude of the subband filtered signals is matched in the transition frequency band between the two subbands.
摘要:
The present technology provides a robust noise suppression system that may concurrently reduce noise and echo components in an acoustic signal while limiting the level of speech distortion. A time-domain acoustic signal may be received and be transformed to frequency-domain sub-band signals. Features, such as pitch, may be identified and tracked within the sub-band signals. Initial speech and noise models may be then be estimated at least in part from a probability analysis based on the tracked pitch sources. Speech and noise models may be resolved from the initial speech and noise models and noise reduction may be performed on the sub-band signals. An acoustic signal may be reconstructed from the noise-reduced sub-band signals.
摘要:
An input signal is converted to a feature-space representation. The feature-space representation is projected onto a discriminant subspace using a linear discriminant analysis transform to enhance the separation of feature clusters. Dynamic programming is used to find global changes to derive optimal cluster boundaries. The cluster boundaries are used to identify the segments of the audio signal.
摘要:
A frequency band of an audio input signal is analyzed to determine if a transient is present. When transients are detected, modifications are made to the intensity levels corresponding to the frequency band for a brief time period.
摘要:
The present technology provides a robust noise suppression system that may concurrently reduce noise and echo components in an acoustic signal while limiting the level of speech distortion. An acoustic signal may be received and transformed to cochlear domain sub-band signals. Features, such as pitch, may be identified and tracked within the sub-band signals. Initial speech and noise models may be then be estimated at least in part from a probability analysis based on the tracked pitch sources. Speech and noise models may be resolved from the initial speech and noise models and noise reduction may be performed on the sub-band signals. An acoustic signal may be reconstructed from the noise-reduced sub-band signals.
摘要:
A frequency domain method for phase-amplitude matrixed surround decoding of 2-channel stereo recordings and soundtracks, based on spatial analysis of 2-D or 3-D directional cues in the recording and re-synthesis of these cues for reproduction on any headphone or loudspeaker playback system.
摘要:
The present technology provides a robust noise suppression system which may concurrently reduce noise and echo components in an acoustic signal while limiting the level of speech distortion. An acoustic signal may be received and transformed to cochlear domain sub-band signals. Features such as pitch may be identified and tracked within the sub-band signals. Initial speech and noise models may be then be estimated at least in part from a probability analysis based on the tracked pitch sources. Speech and noise models may be resolved from the initial speech and noise models and noise reduction may be performed on the sub-band signals and an acoustic signal may be reconstructed from the noise-reduced sub-band signals.
摘要:
A method and apparatus for transient detection and time-scaling an audio signal detects transients and scales only intervals located between transients to avoid artifacts. In one embodiment, the transient detection process compares frequency characteristic energy between succeeding windows of the audio signal and calculates values of an energy curve where the energy increases. Transients are detected at maxima of the energy curve.
摘要:
A method and apparatus for removing or amplifying voice or other signals panned to the center of a stereo recording utilizes frequency domain techniques to calculate a frequency dependent gain factor based on the difference between the frequency domain spectra of the stereo channels.