Abstract:
A method is provided for creating a panorama. The method includes photographing a plurality of images having same backgrounds and different forms of a subject, determining a size and a position of a reference region for creating a panorama using the images, extracting a target region within the reference region from each of the images, detecting same portions in adjacent target regions, and creating a panorama by combining the adjacent target regions on the basis of the same portions.
Abstract:
Disclosed is a method and an apparatus for estimating noise included in a sound signal during sound signal processing. The method includes estimating harmonics components in a frame of an input sound signal; using the estimated harmonics components, computing a Voice Presence Probability (VPP) on the frame of the input sound signal; determining a weight of an equation necessary to estimate a noise spectrum, depending on the computed VPP; and using the determined weight and the equation necessary to estimate a noise spectrum, estimating the noise spectrum, and updating the noise spectrum.
Abstract:
A face recognition system based on adaptive learning includes a specific person detection and tracking unit for detecting and tracking a specific person from a moving image. A facial feature extraction unit extracts a plurality of facial feature vectors from the detected and tracked specific person. A face recognition unit searches for a given registration model by comparing the extracted facial feature vectors with facial feature vectors of the registration models previously stored in a user registration model database. A learning target selection unit selects a facial feature vector to be added to a record of the given registration model from among the extracted facial feature vectors. A registration model learning unit adds and updates the selected facial feature vector to the record of the given registration model.
Abstract:
Disclosed is a method capable of adaptively aligning windows to extract features according to the types and characteristics of voice signals. To this end, window lengths based on the window update points in a corresponding order are determined by employing the concept of a higher order peak, and windows are aligned according to window lengths. When the windows are aligned according to such a manner, the start and end points of each window is known, so that it becomes possible to easily extract and analyze peak feature information.
Abstract:
Disclosed is an apparatus and method for processing signals such as sound signals. The sound processing apparatus includes a sound signal input unit for receiving sound signals, a harmonic noise separator for separating a harmonic region and a noise region from the received sound signals, a noise restraint index determination unit for determining an optimal noise restraint index k according to a system and circumstance, and a noise restrainer for restraining the separated noise region depending on the noise restraint index k so as to output noise attenuated signals.
Abstract:
Disclosed is a method for making an emergency call by using a mobile communication terminal. The method includes identifying a current position when a user makes an emergency call request; retrieving an emergency call number from emergency call numbers pre-stored for respective regions, the emergency call number being used in a region corresponding to the current position; and dialing the emergency call number.
Abstract:
A method for producing motion effects of a character capable of interacting with a background image in accordance with the characteristics of the background image is provided, including extracting the characteristics of the background image; determining a character to be provided with the motion effects in the background in accordance with the extracted characteristics of the background image; recognizing external signals including a user input; determining the motion of the character in accordance with the characteristics of the background image and the recognized external signals; and reproducing an animation for executing the motion of the character in the background image.
Abstract:
An apparatus and method for human activity and facial expression modeling and recognition are based on feature extraction techniques from time sequential images. The human activity modeling includes determining principal components of depth and/or binary shape images of human activities extracted from video clips. Independent Component Analysis (ICA) representations are determined based on the principal components. Features are determined through Linear Discriminant Analysis (LDA) based on the ICA representations. A codebook is determined using vector quantization. Observation symbol sequences in the video clips are determined. And human activities are learned using the Hidden Markov Model (HMM) based on status transition and an observation matrix.
Abstract:
Provided is a speech signal classification system and method. The speech signal classification system includes a primary recognition unit for determining using characteristics extracted from a speech frame whether the speech frame is a voice sound, a non-voice sound, or background noise and a secondary recognition unit for determining using at least one other speech frame whether a determination-reserved speech frame is an non-voice sound or background noise, if it is determined according to a primary recognition result that an input speech frame is not a voice sound. The system reserves a determination of the input speech frame, stores characteristics of at least one other speech frame to determine the determination-reserved speech frame, calculates secondary statistical values from characteristics of the determination-reserved speech frame and the stored characteristics of the other speech frames, and determines using the calculated secondary statistical values whether the determination-reserved speech frame is an non-voice sound or background noise. Accordingly, if an input speech frame is not a voice sound, the input speech frame can be more accurately classified and output as an non-voice sound or background noise, and thus errors, which may be generated in determination of a signal corresponding to an non-voice sound, can be reduced.