摘要:
Provided is a method and apparatus for synthesizing a separated sound source, the method including generating spatial information associated with a sound source included in a frame of a stereo audio signal, and synthesizing a separated frequency-domain sound source from the frame of the stereo audio signal based on the spatial information, wherein the spatial information includes a frequency-azimuth plane representing an energy distribution corresponding to a frequency and an azimuth of the frame of the stereo audio signal.
摘要:
A microphone receives speech data to be transmitted. A business wireless communication unit transmits the received speech data. A processing unit causes a storage unit to store the speech data and time information for identifying blocks produced by dividing the received speech data into a plurality of blocks. The business wireless communication unit receives a request signal for requesting transmission of the speech data stored in the storage unit, the request signal including the time information. The processing unit acquires from the storage unit the speech data for the block identified based on the time information included in the received request signal, and causes the business wireless communication unit to transmit the acquired speech data.
摘要:
An integrated sensor-array processor and method includes sensor array time-domain input ports to receive sensor signals from time-domain sensors. A sensor transform engine (STE) creates sensor transform data from the sensor signals and applies sensor calibration adjustments. Transducer time-domain input ports receive time-domain transducer signals, and a transducer output transform engine (TTE) generates transducer output transform data from the transducer signals. A spatial filter engine (SFE) applies suppression coefficients to the sensor transform data, to suppress target signals received from noise locations and/or amplification locations. A blocking filter engine (BFE) applies subtraction coefficients to the sensor transform data, to subtract the target signals from the sensor transform data. A noise reduction filter engine (NRE) subtracts noise signals from the BFE output. An inverse transform engine (ITE) generates time-domain data from the NRE output.
摘要:
A quantization device includes: a trellis-structured vector quantizer which quantizes a first error vector between an N-dimensional (here, “N” is two or more) subvector and a first predictive vector; and an inter-frame predictor which generates a first predictive vector from the quantized N-dimensional subvector, wherein the inter-frame predictor uses a predictive coefficient comprising an N×N matrix and performs an inter-frame prediction using the quantized N-dimensional subvector of a previous stage.
摘要:
An apparatus for decoding data segments representing a time-domain data stream, a data segment being encoded in the time domain or in the frequency domain, a data segment being encoded in the frequency domain having successive blocks of data representing successive and overlapping blocks of time-domain data samples. The apparatus includes a time-domain decoder for decoding a data segment being encoded in the time domain and a processor for processing the data segment being encoded in the frequency domain and output data of the time-domain decoder to obtain overlapping time-domain data blocks. The apparatus further includes an overlap/add-combiner for combining the overlapping time-domain data blocks to obtain a decoded data segment of the time-domain data stream.
摘要:
An audio decoder for providing a decoded audio information on the basis of an encoded audio information. The audio decoder has an error concealment configured to provide an error concealment audio information for concealing a loss of an audio frame, wherein the error concealment is configured to modify a time domain excitation signal obtained for one or more audio frames preceding a lost audio frame, in order to obtain the error concealment audio information.
摘要:
An audio signal decoder includes a context-based spectral value decoder configured to decode a codeword describing one or more spectral values or at least a portion of a number representation thereof in dependence on a context state. The audio signal decoder also includes a context state determinator configured to determine a current context state in dependence on one or more previously decoded spectral values and a time warping frequency-domain-to-time-domain converter configured to provide a time-warped time-domain representation of a given audio frame on the basis of a set of decoded spectral values provided by the context-based spectral value decoder and in dependence on the time warp information. The context-state determinator is configured to adapt the determination of the context state to a change of a fundamental frequency between subsequent audio frames. An audio signal encoder applies a comparable concept.
摘要:
A method for resampling an audio-frequency signal with an output sampling frequency, for a current signal frame. The method is used when the preceding frame is sampled at a first sampling frequency which is different from a second sampling frequency of the current frame. The method includes: determining a first and second segments of the signal by adding samples at zero at the end of stored samples of the preceding frame and at the start of samples of the current frame, respectively; obtaining the first resampled segment and the second resampled segment by applying at least one resampling filter respectively to the first segment resampling the first frequency at the output frequency, and to the second segment resampling the second frequency at the output frequency; and combining the overlapping portion of the first and second resampled segments to obtain at least one portion of the resampled current frame.
摘要:
A method and an apparatus for signal extraction of audio signal are provided. An audio signal is converted into a plurality of frames, and the frames are arranged in a chronological order. Spectral data of each of the frames is obtained. The spectral data of each of N frames is extracted in the chronological order, and a spectral connectivity operation is executed for the N frames. Finally, the signal including the frames having the spectral connectivity between adjacent frames in each of the frames is determined as an ideal signal.
摘要:
An integrated sensor-array processor and method includes sensor array time-domain input ports to receive sensor signals from time-domain sensors. A sensor transform engine (STE) creates sensor transform data from the sensor signals and applies sensor calibration adjustments. Transducer time-domain input ports receive time-domain transducer signals, and a transducer output transform engine (TTE) generates transducer output transform data from the transducer signals. A spatial filter engine (SFE) applies suppression coefficients to the sensor transform data, to suppress target signals received from noise locations and/or amplification locations. A blocking filter engine (BFE) applies subtraction coefficients to the sensor transform data, to subtract the target signals from the sensor transform data. A noise reduction filter engine (NRE) subtracts noise signals from the BFE output. An inverse transform engine (ITE) generates time-domain data from the NRE output.