Abstract:
The present disclosure provides techniques for adjusting a temporal gain parameter and for adjusting linear prediction coefficients. A value of the temporal gain parameter may be based on a comparison of a synthesized high-band portion of an audio signal to a high-band portion of the audio signal. If a signal characteristic of an upper frequency range of the high-band portion satisfies a first threshold, the temporal gain parameter may be adjusted. A linear prediction (LP) gain may be determined based on an LP gain operation that uses a first value for an LP order. The LP gain may be associated with an energy level of an LP synthesis filter. The LP order may be reduced if the LP gain satisfies a second threshold.
Abstract:
The present disclosure provides techniques for adjusting a temporal gain parameter and for adjusting linear prediction coefficients. A value of the temporal gain parameter may be based on a comparison of a synthesized high-band portion of an audio signal to a high-band portion of the audio signal. If a signal characteristic of an upper frequency range of the high-band portion satisfies a first threshold, the temporal gain parameter may be adjusted. A linear prediction (LP) gain may be determined based on an LP gain operation that uses a first value for an LP order. The LP gain may be associated with an energy level of an LP synthesis filter. The LP order may be reduced if the LP gain satisfies a second threshold.
Abstract:
A device includes a de-jitter buffer configured to receive a packet, the packet including first data and second data. The first data includes a partial copy of first frame data corresponding to a first frame of a sequence of frames. The second data corresponds to a second frame of the sequence of frames. The device also includes an analyzer configured to, in response to receiving the packet, generate a first frame receive timestamp associated with the first data. The analyzer is also configured to, in response to receiving the packet, generate a second frame receive timestamp associated with the second data. The first frame receive timestamp indicates a first time that is earlier than a second time indicated by the second frame receive timestamp.
Abstract:
A device includes a first classifier and a second classifier coupled to the first classifier. The first classifier is configured to output first decision data that indicates a classification of an audio frame as a speech frame or a non-speech frame, the first decision data determined based on first probability data associated with a first likelihood of the audio frame being the speech frame and based on second probability data associated with a second likelihood of the audio frame being the non-speech frame. The second classifier is configured to output second decision data based on the first probability data, the second probability data, and the first decision data, the second decision data includes an indication of a selection of a particular encoder of multiple encoders available to encode the audio frame.
Abstract:
A method for determining pitch pulse period signal boundaries by an electronic device is described. The method includes obtaining a signal. The method also includes determining a first averaged curve based on the signal. The method further includes determining at least one first averaged curve peak position based on the first averaged curve and a threshold. The method additionally includes determining pitch pulse period signal boundaries based on the at least one first averaged curve peak position. The method also includes synthesizing a speech signal.
Abstract:
In a particular aspect, an apparatus includes a first network interface. The first network interface is configured to receive a packet via a packet-switched network. The packet may include a primary coding of a first audio frame and a redundant coding of a second audio frame. The apparatus further includes a processor. The processor is configured to generate a modified packet that includes one or more bits that indicate signaling information or packet decoding information. The signaling information or packet decoding information may correspond to decoding of at least one of the primary coding or the redundant coding. The apparatus further includes a second network interface configured to transmit the modified packet via a circuit-switched network.
Abstract:
An apparatus includes a network interface configured to receive, via a circuit-switched network, a packet. The packet includes a primary coding of a first audio frame, redundant coding of a second audio frame, and one or more bits that indicate signaling information. The signaling information corresponds to a decode operation of at least one of the primary coding or the redundant coding. The apparatus further includes a decoder configured to decode a portion of the packet based on the signaling information.
Abstract:
A device includes a decoder configured to receive an encoded audio signal at a decoder and to generate a synthesized signal based on the encoded audio signal. The device further includes a classifier configured to classify the synthesized signal based on at least one parameter determined from the encoded audio signal.
Abstract:
A method for mitigating potential frame instability by an electronic device is described. The method includes obtaining a frame subsequent in time to an erased frame. The method also includes determining whether the frame is potentially unstable. The method further includes applying a substitute weighting value to generate a stable frame parameter if the frame is potentially unstable.
Abstract:
A device includes a decoder configured to receive an encoded audio signal at a decoder and to generate a synthesized signal based on the encoded audio signal. The device further includes a classifier configured to classify the synthesized signal based on at least one parameter determined from the encoded audio signal.