摘要:
Methods and systems for using pitch predictors in speech/audio coders are provided. Techniques for optimal pre- and post-filtering are presented, and a general result that post-filtering is more effective than pre-filtering is derived. A practical paired-zero filter design for the low-rate regime is proposed, and this design is extended to handle frequency-dependent periodicity levels. Further, the methods described provide a general performance measure for a post-filter that only uses information available at the decoder, thereby allowing for the optimization or selection of a post-filter without increasing the rate.
摘要:
Methods and devices for encoding and decoding are provided. A source signal value is encoded by a quantization index determined using a partition into quantization cells. Decoding of the quantization index takes place by sampling a reconstruction probability distribution, thereby obtaining a reconstructed signal value, such that the reconstructed signal value lies in the same quantization cell as the source signal value. In one embodiment, encoding and decoding are such that their succession preserves the source signal distribution. In another embodiment, the partition and the reconstruction probability distribution are determined in such manner that the quantization error is minimized subject to a constraint on the relative entropy between the source signal and the reconstructed signal.
摘要:
A system for transmitting data packets representing a source signal across a packet data network is provided. The encoder comprises a first encoder (110) and a redundancy encoder (120). The redundancy encoding is generated with a bit rate continuously scalable, the bit rate being provided by a bit rate controller (142) that uses input from the network (130) and packet-loss rate information. At the decoder, recovery is performed by a parameter estimator based in part on information transmitted from the first encoder using information from previous and/or future blocks and in addition on redundant information. The method may be added to existing lossy source coding systems or may be used to enhance the quality of the reconstructed source signal even in scenarios without packet loss.
摘要:
Methods and devices for encoding and decoding are provided. A source signal value is encoded by a quantization index determined using a partition into quantization cells. Decoding of the quantization index takes place by sampling a reconstruction probability distribution, thereby obtaining a reconstructed signal value, such that the reconstructed signal value lies in the same quantization cell as the source signal value. In one embodiment, encoding and decoding are such that their succession preserves the source signal distribution. In another embodiment, the partition and the reconstruction probability distribution are determined in such manner that the quantization error is minimized subject to a constraint on the relative entropy between the source signal and the reconstructed signal.
摘要:
The invention relates to methods and apparatuses for encoding and decoding of a video sequence. In connection with encoding/decoding a video sequence it is desirable to increase the video quality without having to increase the bit-rate for the encoded video too much, thereby still providing a bit-efficient representation of the video. If multiple descriptions of the video sequence is used the invention improves the video quality without any increase of the bit-rate. According to the invention, this is achieved by using two or more coding units for encoding the same video sequence, wherein the encoding units perform their encoding operations displaced in time in relation to each other. Correspondingly, two or more decoding units are used for decoding the same video sequence, wherein the decoding units perform their decoding operations displaced in time in relation to each other.
摘要:
The invention relates to methods for encoding/decoding a digitized sound signal which is transmitted over a packet switched network in such a way that diversity is provided. Each sound segment of the digitized sound signal is encoded into at least two different segment descriptions, i.e. into different representations of the sound signal segment. The segment descriptions differ from each other in that they provide different quantization values for a digital sound signal sample. The different segment descriptions are transmitted in separate data packets over the network at different points of time.
摘要:
Provided are methods and systems for finding the location of sensors (e.g., microphones) with unknown internal delays based on a set of events (e.g., acoustic events) with unknown event time. A localization algorithm may iteratively run to compute the acoustic event times, the observation delays, and the relative locations of the events and the sensors.
摘要:
The invention relates to a method, device and computer-program product for suppression of undesired temporal variations, notably flicker, in a sequence of video frames. Histogram-based and similar approaches generally do not remove all flicker. Features that are resolved only in portions of the flicker cycle will manifest themselves as residual flicker. This effect is near-universal in bright regions of a scene. The inventive solution is a mapping that aims to resolve in the output only those features that are resolved in all frames of the flicker cycle. Use of time-maximal quantile values may preserve non-resolution of such image features that are unresolved due to intermittent bright saturation. Thus, in one embodiment, a reduction of resolution is attained by means of a pixel-value mapping based on selecting, over a time window, maximal and minimal quantile values, with maximal values being used for bright spatial regions and minimal values for dark spatial regions.
摘要:
Systems and methods for using distributed processing in conjunction with blind source separation techniques for signal processing and acquisition in sensor network environments are provided. In the distributed blind source separation framework, sensors each perform some processing of sensor signals rather than transmitting such signals over long distances, and/or outside of the sensor network, to be processed at a central location. Sensors attempt to own a source signal, which can only be owned by one active sensor. Sensors owning a source signal broadcast the signal directly or indirectly so it is perceived by users. Sensors receive information from other sensors in their sensor neighborhood, including observed signals of the other sensors and the estimated source signals of sources owned by the other sensors. This allows owning sensors to extract the respective source signals of the sources they own and all redundant sensors to check for any non-owned source signals present.
摘要:
A non-intrusive signal quality assessment apparatus includes a feature vector calculator that determines parameters representing frames of a signal and extracts a collection of per-frame feature vectors (φ;(n)) representing structural information of the signal from the parameters. A frame selector preferably selects only frames (Ω\with a feature vector (φ;(n)) lying within a predetermined multi-dimensional window (Θ). Means determine a global feature set (ψ) over the collection of feature vectors (φ;(n)) from statistical moments of selected feature vector components ((1^,02, . . . O11). A quality predictor predicts a signal quality measure (Qj from the global feature set (ψ)).