摘要:
A method including: receiving at least a first input audio channel and a second input audio channel; and using an inter-channel prediction model to form at least an inter-channel direction of reception parameter.
摘要:
A method comprising determining a reverberation time estimate for an audio signal from a first part of an encoded audio signal representing the audio signal.
摘要:
A method and related apparatuses, the method comprising: setting up a network of a plurality of mobile devices (208); obtaining sensory data from at least one mobile device of the network (210); analyzing the obtained sensory data in order to detect at least one predetermined event (212); and in response to detecting at least one predetermined event (214), providing an indication about the detected event to at least one mobile terminal of the network.
摘要:
Apparatus comprising at least one processor and at least one memory including computer code, the at least one memory and the computer code configured to with the at least one processor cause the apparatus to at least perform: determining at least one impulse response model pulse train from at least one audio signal; and determining an presence of an object from the at least one impulse model pulse train.
摘要:
Apparatus comprising at least one processor and at least one memory including computer code, the at least one memory and the computer code configured to with the at least one processor cause the apparatus to at least perform: determining at least one impulse response model pulse train from at least one audio signal; and determining an presence of an object from the at least one impulse model pulse train.
摘要:
An encoder comprising an input for inputting frames of an audio signal in a frequency band, at least a first excitation block for performing a first excitation for a speech like audio signal, and a second excitation block for performing a second excitation for a non-speech like audio signal. The encoder further comprises a filter for dividing the frequency band into a plurality of sub bands each having a narrower bandwidth than the frequency band. The encoder also comprises an excitation selection block for selecting one excitation block among the at least first excitation block and the second excitation block for performing the excitation for a frame of the audio signal on the basis of the properties of the audio signal at least at one of the sub bands. The invention also relates to a device, a system, a method and a storage medium for a computer program.
摘要:
A processor configured to receive respective image data, representative of images, of the same subject scene from two or more image capture sources spaced apart at a particular predetermined distance; identify corresponding features from the respective image data; determine the change in position of the identified features represented in the respective image data; and identify the depth-order of the identified features according to their determined relative change in position to allow for depth-order display of the identified features according to their determined relative change in position.
摘要:
A method for supporting an encoding of an audio signal is shown, wherein at least a first and a second coder mode are available for encoding a section of the audio signal. The first coder mode enables a coding based on two different coding models. A selection of a coding model is enabled by a selection rule which is based on signal characteristics which have been determined for a certain analysis window. In order to avoid a misclassification of a section after a switch to the first coder mode, it is proposed that the selection rule is activated only when sufficient sections for the analysis window have been received. The invention relates equally to a module in which this method is implemented, to a device and a system comprising such a module and to a software program product including a software code for realizing the proposed method.
摘要:
A system and method for providing improved scalable error detection and cross-timing synchronization for packet-switched transmission. In one embodiment, checksum error detection is applied for the core layer and for enhancement layers of the scalable payload in such a way that dropping one or several enhancement layers from the payload does not change the value of the checksum. Only one checksum is transmitted, e.g., in the payload or in the header of the lower-layer protocol. The transmitter modifies the encoded bit stream in such a manner that the entity in the network deploying the scalable payload and removing layers from the packet does not need to recalculate the checksum placed in the payload or packet header, even when the payload size is changed. A prefix/tail bit field is added in the beginning/end of each enhancement layer to make the checksum check match with the common checksum. In another embodiment, the receiver may check the correctness of each received layer simultaneously and, if desired, utilize data only from correctly received layers.
摘要:
A system and method for providing improved adaptive multi-rate wideband (AMR-WB) discontinuous transmission (DTX) synchronization. According to various embodiments, an indication on the start of the inactive speech period is signalled to the decoder via a voice activity detection (VAD) flag a predetermined number of frames before the DTX period will start, i.e., before the SID_FIRST frame is received. When the VAD flag indicates active speech, or when the VAD flag has been set to zero less than the predetermined number of frames ago, the received NO_DATA frame can be classified with a high degree of reliability as active speech, i.e., considered as transmitter, network or terminal-initiated signalling, and can be substituted by a SPEECH_LOST frame. When the VAD flag was set to zero eight frames ago or earlier, the NO_DATA frame is classified as DTX.