Abstract:
In accordance with an embodiment, a method of generating an encoded audio signal, the method includes estimating a time-frequency energy of an input audio signal from a time-frequency filter bank, computing a global variance of the time-frequency energy, determining a post-processing method according to the global variance, and transmitting an encoded representation of the input audio signal along with an indication of the determined post-processing method.
Abstract:
Various embodiment provide a multi-terminal cooperative working method, a terminal device, and a multi-terminal cooperative system. In those embodiments, a primary device can send networking information to a plurality of secondary devices, and the primary device has a permission to manage the cooperative network. The primary device can receive networking confirmation information sent by the plurality of secondary devices; and send configuration information to the plurality of secondary devices separately. The configuration information can be used to instruct to switch, based on a preset management permission switching mechanism when the primary device becomes abnormal or the primary device exits from the cooperative network, the permission to manage the cooperative network. The primary device can clear the secondary device from the cooperative network the secondary device becomes abnormal. In this way, robustness of the cooperative network can be improved.
Abstract:
A voice quality evaluation method, apparatus, and system comprises an obtained voice data packet is parsed, and a frame content characteristic of the data packet is determined according to a parse result, for example, the frame content characteristic is a silence frame and a voice frame. Then, a voice sequence is divided into statements according to the determined frame content characteristic, and the statements are divided into multiple frame loss events; after non-voice parameters are extracted according to the frame loss events, voice quality of each statement is evaluated according to a preset voice quality evaluation model and according to the non-voice parameters. Finally, voice quality of the entire voice sequence is evaluated according to the voice quality of each statement. By using this solution, prediction precision can be improved significantly, and accuracy of an evaluation result can be improved.
Abstract:
An abnormal frame detection method and apparatus are disclosed. In an embodiment the method includes obtaining a signal frame from a speech signal, and dividing the signal frame into at least two subframes; obtaining a local energy value of a subframe of the signal frame; obtaining, according to the local energy value of the subframe, a first characteristic value used to indicate a local energy trend of the signal frame; performing singularity analysis on the signal frame to obtain a second characteristic value; and determining the signal frame as an abnormal frame if the first characteristic value meets a first threshold and the second characteristic value meets a second threshold. It is implemented whether distortion occurs in a speech signal is detected.
Abstract:
A method for evaluating voice quality includes performing human auditory modeling processing on a voice signal to obtain a first signal; performing variable resolution time-frequency analysis on the first signal to obtain a second signal; and performing, based on the second signal, feature extraction and analysis to obtain a voice quality evaluation result of the voice signal. According to the foregoing technical solutions, a problem that accuracy of a voice quality evaluation is not high can be solved. A voice quality evaluation result with relatively high accuracy is finally obtained by performing human auditory modeling processing, then converting a to-be-detected signal into a multi-resolution signal, further analyzing the time-frequency signal of variable resolution, extracting a feature corresponding to the signal, and performing further analysis.
Abstract:
An abnormal frame detection method and apparatus are disclosed. In an embodiment the method includes obtaining a signal frame from a speech signal, and dividing the signal frame into at least two subframes; obtaining a local energy value of a subframe of the signal frame; obtaining, according to the local energy value of the subframe, a first characteristic value used to indicate a local energy trend of the signal frame; performing singularity analysis on the signal frame to obtain a second characteristic value; and determining the signal frame as an abnormal frame if the first characteristic value meets a first threshold and the second characteristic value meets a second threshold. It is implemented whether distortion occurs in a speech signal is detected.
Abstract:
A method, device, and system for signal encoding and decoding are disclosed. The method includes: encoding a core layer signal to obtain a core layer signal code; selecting an enhancement sample point that requires enhancement layer signal encoding according to the core layer signal code and the number of bits that can be used by an enhancement layer; obtaining an enhancement layer signal code of the enhancement sample point; and outputting a bit stream, where the bit stream includes the core layer signal code and the enhancement layer signal code. According to the number of bits that can be used by the enhancement layer, the enhancement sample point that requires enhancement layer signal encoding is selected; the enhancement layer signal of the selected enhancement sample point is encoded and decoded; when no sufficient bits are available for the enhancement layer, the enhancement quality of the core layer can be improved.
Abstract:
A method for evaluating voice quality includes performing human auditory modeling processing on a voice signal to obtain a first signal; performing variable resolution time-frequency analysis on the first signal to obtain a second signal; and performing, based on the second signal, feature extraction and analysis to obtain a voice quality evaluation result of the voice signal. According to the foregoing technical solutions, a problem that accuracy of a voice quality evaluation is not high can be solved. A voice quality evaluation result with relatively high accuracy is finally obtained by performing human auditory modeling processing, then converting a to-be-detected signal into a multi-resolution signal, further analyzing the time-frequency signal of variable resolution, extracting a feature corresponding to the signal, and performing further analysis.
Abstract:
A method for voice call fallback to a circuit switched (CS) domain disclosed in the present invention includes: receiving a Service Request message from a calling user equipment (UE), where the Service Request message includes called number information of a voice call in a CS domain, instructing an evolved NodeB (eNB) to initiate circuit switched fallback (CSFB) handover; receiving a Handover Request message from the eNB, where the Handover Request message includes information required for CS handover, selecting a mobile switching center (MSC) and sending a packet switched (PS) to CS Handover Request message to the MSC, where the PS to CS Handover Request message carries information required for the CS handover and a called number so that the MSC calls a called UE. The corresponding apparatuses and systems are also disclosed. The technical solution of the present invention can reduce the connection delay.
Abstract:
In accordance with an embodiment, a method of generating an encoded audio signal, the method includes estimating a time-frequency energy of an input audio signal from a time-frequency filter bank, computing a global variance of the time-frequency energy, determining a post-processing method according to the global variance, and transmitting an encoded representation of the input audio signal along with an indication of the determined post-processing method.