Abstract:
A method for detecting music in a speech signal having a plurality of frames (120). The method comprises obtaining one or more first pitch correlation candidates from a first frame of the plurality of frames (771); obtaining one or more second pitch correlation candidates from a second from of the plurality of frames (771); selecting a pitch correlation (RP) from the one or more first pitch correlation candidates and one or more second pitch correlation candidates (773); and distinguishing music from background noise based on analyzing the pitch correlation (Rp) (775). The method may comprise filtering the speech signal using a one-order low-pass filter prior to the obtaining the one or more first pitch correlation candidates (920), and down sampling the speech signal by four prior to obtaining the one or more first pitch correlation candidates (940).
Abstract:
A method for detecting music in a speech signal having a plurality of frames. The method comprises defining a music threshold value for a first parameter extracted from a frame of the speech signal, defining a background noise threshold value for the first parameter, and defining an unsure threshold value for the first parameter. The unsure threshold value falls between the music threshold value and the background noise threshold value. If the first parameter falls between the music threshold value and the background noise threshold value, the speech signal is classified as music or background noise based on analyzing a plurality of first parameters extracted from the plurality of frames.
Abstract:
A method is provided for detecting music in a speech signal having a plurality of frames. The method comprises obtaining one or more first pitch correlation candidates from a first frame of the plurality of frames; obtaining one or more second pitch correlation candidates from a second frame of the plurality of frames; selecting a pitch correlation (Rp) from the one or more first pitch correlation candidates and the one or more second pitch correlation candidates; and distinguishing music from background noise based on analyzing the pitch correlation (Rp). The method may further comprise filtering the speech signal using a one-order low-pass filter prior to the obtaining the one or more first pitch correlation candidates, and down sampling the speech signal by four prior to the obtaining the one or more first pitch correlation candidates.
Abstract:
There is provided a system and method for providing an integrated environment for execution monitoring and profiling of applications running on multi- processor system-on-chips. There is provided a method comprising obtaining task execution data of an application, the task execution data including a plurality of task executions assigned to a plurality of hardware resources, showing a scheduler view of the plurality of task executions on a display, receiving a modification request for a selected task execution from the plurality of task executions, reassigning the plurality of task executions to the plurality of hardware resources based on implementing the modification request, and updating the scheduler view on the display. As a result, the high level results of specific low level optimizations may be tested and retried to discover which optimization routes provide the greatest benefits.
Abstract:
Various amplifier configurations having increased bandwidth, linearity, dynamic range, and less distortion are shown and disclosed. To increase bandwidth in a transimpedance amplifier, a replica circuit is created to replicate a degeneration resistance, or the resistance or value that relates to a feedback resistance. From the replica circuit, the replicated values are mirrored and processed to control a FET switch which modifies a degeneration resistance. The FET switch control signal is related to the feedback resistance and modifies the degeneration resistance to thereby maintain the product of the feedback resistance and the degeneration resistance as a constant. In another embodiment, a second switch controlled by an automatic gain control signal is established between a first stage amplifier and a second stage amplifier to improve dynamic range and bandwidth without degrading other amplifier specifications.
Abstract:
An optical signal generator is configured with an associated control system and driver configured to reduce speckle. Speckle reduction occurs by pulsing the drive signal between a first current level and a second current level. These pulses force the optical signal generator to introduce oscillations into the optical signal. The coherence of the emitted light is reduced during the period of oscillations in the optical signal, which reduces speckle. In one embodiment, the pulsing of the drive signal brings the drive signal down to a level near or below threshold, which in turn intermittently turns off the optical signal output. Returning the optical signal to a desired optical output intensity introduces the speckle reducing oscillation. The pulse frequency, and duty cycle is controlled by a duty cycle control signal to modulate overall optical power and adjust amount of despeckle.
Abstract:
There is provided a method of reducing effect of noise producing artifacts in a speech signal. The method comprises obtaining (310) a plurality of incoming samples of a speech subframe; summing (310) an energy level for each of the plurality of incoming samples to generate a total input level; comparing (320) the total input level with a predetermined threshold; setting (340) a gain value as a function of the total input level, where the gain value is between zero (0) and one (1), and where the function results in a lower gain value when the total input level is indicative of a silence area than when the total input level is indicative of a non-silence area; and multiplying (350) the plurality of incoming samples of the speech subframe by the gain value.
Abstract:
There is provided a method for use by a speech encoder to encode an input speech signal. The method comprises receiving the input speech signal; determining whether the input speech signal includes an active speech signal or an inactive speech signal; low-pass filtering the inactive speech signal to generate a narrowband inactive speech signal; high-pass filtering the inactive speech signal to generate a high-band inactive speech signal; encoding the narrowband inactive speech signal using a narrowband inactive speech encoder to generate an encoded narrowband inactive speech; generating a low-to-high auxiliary signal by the narrowband inactive speech encoder based on the narrowband inactive speech signal; encoding the high-band inactive speech signal using a wideband inactive speech encoder to generate an encoded wideband inactive speech based on the low-to-high auxiliary signal from the narrowband inactive speech encoder; and transmitting the encoded narrowband inactive speech and the encoded wideband inactive speech.
Abstract:
There is provided a method of adjusting power consumption (125) by a wireless device having a codec (120) and a battery (125), where the codec has a plurality of states of operation and the battery has a plurality of power levels. The method comprises operating the wireless device at a first power level of the plurality of power levels and a first state of the plurality of states of operation, while in communication with a remote device; receiving a power level indication indicative of a second power level of the plurality of power levels; selecting a second state of the plurality of states of operation for the codec corresponding to the second power level of the plurality of power levels; and modifying the codec to operate at the second state of the plurality of states of operation.
Abstract:
An approach for improving quality of speech synthesized using analysis-by-synthesis (ABS ) coders is presented. An unstable perceptual quality in analysis-by-synthesis type speech coding (e.g. CELP) may occur because the periodicity degree in a voiced speech signal may vary significantly for different segments of the voiced speech. Thus the present invention uses a voicing index, which may indicate the periodicity degree of the speech signal, to control and improve ABS type speech coding. The voicing index may be used to improve the quality stability by controlling encoder and/or decoder in: fixed-codebook (301) short-term enhancement including the spectrum tilt; perceptual weighting filter; sub-fixed codebook determination; LPC interpolation (304); fixed-codebook pitch enhancement; post-pitch enhancement; noise injection into the high-frequency band at decoder; LTP sync window; signal decomposition, etc.