Abstract:
Systems and methods are provided for performing soft alignment in Gaussian mixture model (GMM) based and other vector transformations. Soft alignment may assign alignment probabilities to source and target feature vector pairs. The vector pairs and associated probabilities may then be used calculate a conversion function, for example, by computing GMM training parameters from the joint vectors and alignment probabilities to create a voice conversion function for converting speech sounds from a source speaker to a target speaker.
Abstract:
An apparatus for providing data clustering and mode selection includes a training element and a transformation element. The training element is configured to receive a first training data set, a second training data set and auxiliary data extracted from the same material as the first training data set. The training element is also configured to train a classifier to group the first training data set into M clusters based on the auxiliary data and the first training data set and train M processing schemes corresponding to the M clusters for transforming the first training data set into the second training data set. The transformation element is in communication with the training element and is configured to cluster the second training data set into M clusters based on features associated with the second training data set.
Abstract:
A hybrid approach is described for combining frequency warping and Gaussian Mixture Modeling (GMM) to achieve better speaker identity and speech quality. To train the voice conversion GMM model, line spectral frequency and other features are extracted from a set of source sounds to generate a source feature vector and from a set of target sounds to generate a target feature vector. The GMM model is estimated based on the aligned source feature vector and the target feature vector. A mixture specific warping function is generated each set of mixture mean pairs of the GMM model, and a warping function is generated based on a weighting of each of the mixture specific warping functions. The warping function can be used to convert sounds received from a source speaker to approximate speech of a target speaker.
Abstract:
An improved system method for enabling and implementing codebook-based voice conversion that both significantly reduces the memory footprint and improves the continuity of the output. In various embodiments, the paired source-target codebook is implemented as a multi-stage vector quantizer. During the conversion, N best candidates in a tree search are taken as the output from the quantizer. The N candidates for each vector to be converted are used in a dynamic programming-based approach that finds a smooth but accurate output sequence.
Abstract:
An apparatus for providing efficient evaluation of feature transformation includes a training module and a transformation module. The training module is configured to train a Gaussian mixture model (GMM) using training source data and training target data. The transformation module is in communication with the training module. The transformation module is configured to produce a conversion function in response to the training of the GMM. The training module is further configured to determine a quality of the conversion function prior to use of the conversion function by calculating a trace measurement of the GMM.
Abstract:
During national crisis as virus pandemics, bio-terrorism, chemical pollution, a response is to lock down all the economy, with several % GDP loss, or to convince population to wear, reasonably, appropriate individual anti-virus and bacterial protection accessories, made of a full head and shoulders protection mask, equipped with air ventilation and processing unit, and gloves, and body protection foil, for single or multiple use, and get minimal economic losses. The air processing unit may be developed a modular structure, starting from a simple filtered fan, with air suction on top of the head, flowing air over the face, for low probability of contamination environments, upgraded to a more complex unit including heat pipe, catalytic organic matter reduction, oxygen generator and closed circuit breathing unit, that may be integrated into a full body personal protection equipment by extending the functions of temperature, humidity and pressure control to full body, for hazardous environments.
Abstract:
An apparatus for providing text independent voice conversion may include a first voice conversion model and a second voice conversion model. The first voice conversion model may be trained with respect to conversion of training source speech to synthetic speech corresponding to the training source speech. The second voice conversion model may be trained with respect to conversion to training target speech from synthetic speech corresponding to the training target speech. An output of the first voice conversion model may be communicated to the second voice conversion model to process source speech input into the first voice conversion model into target speech corresponding to the source speech as the output of the second voice conversion model.
Abstract:
An apparatus is provided that includes a converter for training a voice conversion model for converting source encoding parameters characterizing a source speech signal associated with a source voice into corresponding target encoding parameters characterizing a target speech signal associated with a target voice. To reduce the affect of noise on the voice conversion model, the converter may be configured for receiving sequences of source and target encoding parameters, and train the model without one or more frames of the source and target speech signals that have energies less than a threshold energy. After conversion of the respective parameters, then, the converter, a decoder or another component may be configured for reducing the energy of one or more frames of the target speech signal that have an energy less than the threshold energy, where the threshold value may be adaptable based upon models of speech frames and non-speech frames.
Abstract:
A method and device used to improve the operation of a hydrocephalus shunt system based on the use of alpha and beta radioactive isotopes implanted in the critical zones of the shunt in order to prevent the deposition of organic matter such as blood cells, tissue, or bacteria, thereby clogging the system and causing malfunction.
Abstract:
A method and system to increase operator awareness for process control application providing operators team real time information on controlled process and equipment immersing them into augmented and virtual reality, provided by a computing system which collects and integrates data from process and equipment, leaving equipment, controls, and procedure unmodified. The method is creating an augmented and virtual reality for the operating crew, giving them real time supplementary information based on two types of simulation, one using models and one using process data and learning procedures, together with rich equipment data, equipment locator, enhanced communication and enhanced data acquisition from supplementary sources as surface and airborne robotic and operator headset additional equipment meant to monitor operator bio-parameters and operator's surrounding environment in order to assure that control is sound and sober, and operators are safe and secure all time providing an optimum high efficiency operation of the controlled process.