摘要:
An apparatus for detecting the position of a human face in an input image or video image and a method thereof are provided. The apparatus includes an eye position detecting means for detecting pixels having a strong gray characteristic to determine areas having locality and texture characteristics as eye candidate areas among areas formed by the detected pixels, in an input red, blue, and green (RGB) image, a face position determining means for creating search templates by matching a model template to two areas extracted from the eye candidate areas, and determining an optimum search template among the created search templates by using the value normalizing the sum of a probability distance for the chromaticity of pixels within the area of a search template, and horizontal edge sizes calculated in the positions of the left and right eyes, a mouth and a nose estimated by the search template, and an extraction position stabilizing means for forming a minimum boundary rectangle by the optimum search template, and increasing count values corresponding to the minimum boundary rectangle area and reducing count values corresponding to an area other than the minimum boundary rectangle area, among count values of individual pixels, stored in a shape memory, to output the area in which count values above a predetermined value are positioned, as eye and face areas. The apparatus is capable of accurately and quickly detecting a speaking person's eyes and face in an image, and is tolerant of image noise.
摘要:
An image segmenting apparatus and method is provided. The image segmenting apparatus includes an initial image segmenting unit, a region structurizing unit and a redundant region combiner. The initial image segmenting unit converts color signals of an input image into a color space which is based on predetermined signals, and segments the input image into a plurality of regions according to positions of color pixels of the input image in the color space. The region structurizing unit classifies the plurality of regions into layers according to horizontal, adjacent relation and hierarchical, inclusive relation between the regions, and groups adjacent regions into region groups in each layer, so as to derive a hierarchical, inclusive relation between the region groups. The redundant region combiner determines the order in which adjacent regions are combined according to the horizontal, adjacent relation between regions and the hierarchical, inclusive relation between region groups. The redundant region combiner also determines whether to combine adjacent regions according to the determined combination order, and combines adjacent regions if the adjacent regions are determined to be substantially the same. Even if regions appears to be adjacent each other in a region adjacent graph (RAG), a structural inclusive relation between regions can be derived by excluding the combination of the regions or rearranging their combination order according to a hierarchical structure. Subsequently, the mutual relation between two regions can be inferred from the inclusive relation even if the color signals of the two regions, for example, a region in a highlighted area and a region in its surrounding area, are not similar to each other.
摘要:
Provided are a method and apparatus for automatically completing a text input using speech recognition. The method includes: receiving a first part of a text from a user through a text input device; recognizing a speech of the user, which corresponds to the text; and completing a remaining part of the text based on the first part of the text and the recognized speech. Therefore, accuracy of the text input and convenience of the speech recognition can be ensured, and a non-input part of the text can be easily input based on the input part of the text and the recognized speech at a high speed.
摘要:
Provided are a method and apparatus for providing a mobile voice web service in a mobile terminal. The method includes analyzing a web history of a user from web search logs of the user and generating a voice access list based on the analysis results, and performing voice recognition by dynamically generating a voice recognition syntax according to the generated voice access list. Accordingly, by limiting syntax required for voice recognition by generating a syntax suitable for a web context of the user, efficient voice recognition, which can be performed in a terminal not a server, can be implemented.
摘要:
Provided are a method and apparatus for automatically completing a text input using speech recognition. The method includes: receiving a first part of a text from a user through a text input device; recognizing a speech of the user, which corresponds to the text; and completing a remaining part of the text based on the first part of the text and the recognized speech. Therefore, accuracy of the text input and convenience of the speech recognition can be ensured, and a non-input part of the text can be easily input based on the input part of the text and the recognized speech at a high speed.
摘要:
Provided are a method and apparatus for providing a mobile voice web service in a mobile terminal. The method includes analyzing a web history of a user from web search logs of the user and generating a voice access list based on the analysis results, and performing voice recognition by dynamically generating a voice recognition syntax according to the generated voice access list. Accordingly, by limiting syntax required for voice recognition by generating a syntax suitable for a web context of the user, efficient voice recognition, which can be performed in a terminal not a server, can be implemented.