摘要:
Provided are an apparatus and method for reporting speech recognition failures. The method includes detecting pure speech data from input speech data and outputting the detected pure speech data; checking at least one speech recognition failure for the pure speech data; and ascertaining speech recognition failure reasons from a check-result for the speech recognition failures and outputting the ascertained speech recognition failure reasons.
摘要:
A face recognition system based on adaptive learning includes a specific person detection and tracking unit for detecting and tracking a specific person from a moving image. A facial feature extraction unit extracts a plurality of facial feature vectors from the detected and tracked specific person. A face recognition unit searches for a given registration model by comparing the extracted facial feature vectors with facial feature vectors of the registration models previously stored in a user registration model database. A learning target selection unit selects a facial feature vector to be added to a record of the given registration model from among the extracted facial feature vectors. A registration model learning unit adds and updates the selected facial feature vector to the record of the given registration model.
摘要:
Provided is a system and method for controlling voice detection of a network terminal. The system includes the network terminal for, if detection of a voice signal is requested, detecting voice by receiving and setting a voice detection setting value corresponding to a predetermined service and generating a trigger signal for the voice detection according to the voice detection setting value corresponding to the service; and a server for determining the service of the network terminal and transmitting the voice detection setting value corresponding to the service to the network terminal. Accordingly, by controlling to commence voice detection according to a service, voice detection optimized to a relevant service can commence.
摘要:
A face recognition system based on adaptive learning includes a specific person detection and tracking unit for detecting and tracking a specific person from a moving image. A facial feature extraction unit extracts a plurality of facial feature vectors from the detected and tracked specific person. A face recognition unit searches for a given registration model by comparing the extracted facial feature vectors with facial feature vectors of the registration models previously stored in a user registration model database. A learning target selection unit selects a facial feature vector to be added to a record of the given registration model from among the extracted facial feature vectors. A registration model learning unit adds and updates the selected facial feature vector to the record of the given registration model.
摘要:
A auto-recording method is disclosed for auto-recording further to user request, via generating user image and voice data, extracting feature points from the image data according to pre-defined user recognition and following by considering the user as an object of following according to extracted feature points, determining whether the image and voice data satisfy a recording reference needed to perform recording. If determined that the image and voice data satisfy the recording reference, editing the image and voice data in a pre-set edit form and generating and storing at least one of recording image and recording voice data.
摘要:
Provided is a system and method for controlling voice detection of a network terminal. The system includes the network terminal for, if detection of a voice signal is requested, detecting voice by receiving and setting a voice detection setting value corresponding to a predetermined service and generating a trigger signal for the voice detection according to the voice detection setting value corresponding to the service; and a server for determining the service of the network terminal and transmitting the voice detection setting value corresponding to the service to the network terminal. Accordingly, by controlling to commence voice detection according to a service, voice detection optimized to a relevant service can commence.
摘要:
A method and a system for segmenting phonemes from voice signals. A method for accurately segmenting phonemes, in which a histogram showing a peak distribution corresponding to an order is formed by using a high order concept, and a boundary indicating a starting point and an ending point of each phoneme is determined by calculating a peak statistic based on the histogram. The phoneme segmentation method can remarkably reduce an amount of calculation, and has an advantage of being applied to sound signal systems which perform sound coding, sound recognition, sound synthesizing, sound reinforcement, etc.
摘要:
A system and method for sound source separation. The system and method use a beamforming technique. The sound source separation system includes a windowing processor; a DFT transformer; a transfer function estimator; and a noise estimator. The system also includes a voice signal extractor that cancels individual voice signals, except an individual voice signal that is desired to be extracted among individual voice signals, from the integrated voice signals. The system further includes a voice signal detector that cancels a noise part provided through the noise estimator from a transfer function of an individual voice signal which is desired to be detected and extracts a noise-canceled individual voice signal. Even when two or more sound sources are simultaneously input, the sound sources can be separated from each other and separately stored and managed, or an initial sound source can be stored and managed.
摘要:
A method is provided for creating a panorama. The method includes photographing a plurality of images having same backgrounds and different forms of a subject, determining a size and a position of a reference region for creating a panorama using the images, extracting a target region within the reference region from each of the images, detecting same portions in adjacent target regions, and creating a panorama by combining the adjacent target regions on the basis of the same portions.
摘要:
Disclosed is a method and an apparatus for estimating noise included in a sound signal during sound signal processing. The method includes estimating harmonics components in a frame of an input sound signal; using the estimated harmonics components, computing a Voice Presence Probability (VPP) on the frame of the input sound signal; determining a weight of an equation necessary to estimate a noise spectrum, depending on the computed VPP; and using the determined weight and the equation necessary to estimate a noise spectrum, estimating the noise spectrum, and updating the noise spectrum.