摘要:
Embodiments of the present invention disclose a call method of a terminal and a terminal using the call method, relate to the field of communications technologies, and are invented for adjusting voice quality of a call in real time. The method includes: analyzing a spectral component of a voice signal during a call; and selecting a corresponding frequency response channel according to an analysis result of the spectral component of the voice signal. The present invention is mainly applied in the field of call services.
摘要:
Systems and methods are provided for managing and analyzing multi-party dialogs (e.g., call) between communication devices. A digital connection is established with each of a plurality of communication devices. The connection is switched between the communication devices from a POTS connection to digital connections, enabling the communication devices to communicate with each other via the computing device over the digital connections. Audio signals are part of a multi-party dialog between users of the plurality of communication devices. The received audio signals are split into corresponding first signals and second signals. The first signals are transmitted to the plurality of communication devices and are analyzed to produce measurements of features of the second signals. Feedback data is transmitted to at least one of the plurality of communication devices.
摘要:
A parameter determination device comprises: a spectral envelope estimating portion 42 performing estimation of a spectral envelope using a parameter · 0 specified in a predetermined method, regarding the · 0 -th power of absolute values of a frequency domain sample sequence corresponding to a time-series signal as a power spectrum on the assumption that the parameter · 0 and a parameter · are positive numbers; a whitened spectral sequence generating portion 43 obtaining a whitened spectral sequence which is a sequence obtained by dividing the frequency domain sample sequence by the spectral envelope; and a parameter acquiring portion 44 determining such a parameter · that generalized Gaussian distribution with the parameter · as a shape parameter approximates a histogram of the whitened spectral sequence.
摘要:
A device (200) and method for calculating scattering features for audio signal recognition. An interface (240) receives an audio signal that is processed (S610) by a processor (210) to obtain an audio frame. The processor (210) calculates (S620) a first order scattering features from at least one audio frame and then calculates (S630) for the first order scattering features an estimation of whether the first order scattering features comprises sufficient information for accurate audio signal recognition. The processor (240) calculates (S650) a second order scattering features from the first order scattering features only in case the first order scattering features does not comprise sufficient information for accurate audio signal recognition. As second order features are calculated only when it is deemed necessary, less processing power can be used by the device, which can lead to less power used by the device.
摘要:
Systems and methods for adjusting pitch of an audio signal include detecting input notes in the audio signal, mapping the input notes to corresponding output notes, each output note having an associated upper note boundary and lower note boundary, and modifying at least one of the upper note boundary and the lower note boundary of at least one output note in response to previously received input notes. Pitch of the input notes may be shifted to match an associated pitch of corresponding output notes. Delay of the pitch shifting process may be dynamically adjusted based on detected stability of the input notes.
摘要:
An audio encoding method and an apparatus are provided. The method includes: determining sparseness of distribution, on spectrums, of energy of N input audio frames (101), where the N audio frames include a current audio frame, and N is a positive integer; and determining, according to the sparseness of distribution, on the spectrums, of the energy of the N audio frames, whether to use a first encoding method or a second encoding method to encode the current audio frame (102), where the first encoding method is an encoding method that is based on time-frequency transform and transform coefficient quantization and that is not based on linear prediction, and the second encoding method is a linear-predication-based encoding method. According to the method, when an audio frame is encoded, sparseness of distribution, on a spectrum, of energy of the audio frame is considered, which can reduce encoding complexity and ensure that encoding is of relatively high accuracy.
摘要:
An audio perception system is described, comprising a capture module configured to capture acoustic speech signal information; a feature extraction module configured to extract features that identify a candidate unvoiced portion in an acoustic signal; a classification module configured to identify if the acoustic signal is or contains an unvoiced portion based on the extracted features; and a control module configured to generate a control signal to a sensory stimulation actuator for generating an aero-tactile stimulation to be delivered to a listener, the control signal based at least in part on a signal representing the identified unvoiced portion. Related methods are also described.
摘要:
Provided are methods of decoding speech from the brain of a subject. The methods include detecting speech production signals from electrodes operably coupled to the speech motor cortex of a subject while the subject produces or imagines producing a speech sound. The methods further include deriving a speech production signal pattern from the detected speech production signals, and correlating the speech production signal pattern with a reference speech production signal pattern to decode speech from the brain of the subject. Speech communication systems and devices for practicing the subject methods are also provided.
摘要:
According to a preferred aspect of the instant invention, there is provided a system and method that allows the user to attenuate ambient noise in speech recordings in the audio part of a video recording. The user does not need to define particular sections or samples or individual parameters. The system is automatically analyzing the input signal and in a plurality of individual steps detects the ambient noise, determines an adaptive filter, implements the filter and therewith attenuates the ambient noise accordingly.