Apparatus and method for detecting speaking person's eyes and face
    1.
    发明授权
    Apparatus and method for detecting speaking person's eyes and face 有权
    用于检测说话人的眼睛和脸部的装置和方法

    公开(公告)号:US06611613B1

    公开(公告)日:2003-08-26

    申请号:US09607279

    申请日:2000-06-30

    IPC分类号: G06K900

    CPC分类号: G06K9/00228

    摘要: An apparatus for detecting the position of a human face in an input image or video image and a method thereof are provided. The apparatus includes an eye position detecting means for detecting pixels having a strong gray characteristic to determine areas having locality and texture characteristics as eye candidate areas among areas formed by the detected pixels, in an input red, blue, and green (RGB) image, a face position determining means for creating search templates by matching a model template to two areas extracted from the eye candidate areas, and determining an optimum search template among the created search templates by using the value normalizing the sum of a probability distance for the chromaticity of pixels within the area of a search template, and horizontal edge sizes calculated in the positions of the left and right eyes, a mouth and a nose estimated by the search template, and an extraction position stabilizing means for forming a minimum boundary rectangle by the optimum search template, and increasing count values corresponding to the minimum boundary rectangle area and reducing count values corresponding to an area other than the minimum boundary rectangle area, among count values of individual pixels, stored in a shape memory, to output the area in which count values above a predetermined value are positioned, as eye and face areas. The apparatus is capable of accurately and quickly detecting a speaking person's eyes and face in an image, and is tolerant of image noise.

    摘要翻译: 提供一种用于检测输入图像或视频图像中的人脸的位置的装置及其方法。 该装置包括:眼睛位置检测装置,用于在输入的红色,蓝色和绿色(RGB)图像中检测具有强灰色特性的像素,以确定具有由检测到的像素形成的区域之中的具有局部性和纹理特征的区域作为候选眼睛区域; 面部位置确定装置,用于通过将模型模板与从眼睛候选区域提取的两个区域相匹配来创建搜索模板,以及通过使用归一化用于色彩的色度的概率距离之和的值来确定所创建的搜索模板中的最佳搜索模板 搜索模板区域内的像素,以及在由搜索模板估计的左眼和右眼,口和鼻的位置中计算的水平边缘大小,以及提取位置稳定装置,用于将最小边界矩形形成最佳 搜索模板,以及增加对应于最小边界矩形区域的计数值,并减少计数值va 存储在形状存储器中的各个像素的计数值之中的与最小边界矩形区域以外的区域相对应的图案,以输出高于预定值的计数值的区域作为眼睛和脸部区域。 该装置能够准确,快速地检测说话人的眼睛和脸部的图像,并且容忍图像噪声。

    Image segmenting apparatus and method
    2.
    发明授权
    Image segmenting apparatus and method 失效
    图像分割装置及方法

    公开(公告)号:US06606408B1

    公开(公告)日:2003-08-12

    申请号:US09562029

    申请日:2000-05-01

    IPC分类号: G06K900

    摘要: An image segmenting apparatus and method is provided. The image segmenting apparatus includes an initial image segmenting unit, a region structurizing unit and a redundant region combiner. The initial image segmenting unit converts color signals of an input image into a color space which is based on predetermined signals, and segments the input image into a plurality of regions according to positions of color pixels of the input image in the color space. The region structurizing unit classifies the plurality of regions into layers according to horizontal, adjacent relation and hierarchical, inclusive relation between the regions, and groups adjacent regions into region groups in each layer, so as to derive a hierarchical, inclusive relation between the region groups. The redundant region combiner determines the order in which adjacent regions are combined according to the horizontal, adjacent relation between regions and the hierarchical, inclusive relation between region groups. The redundant region combiner also determines whether to combine adjacent regions according to the determined combination order, and combines adjacent regions if the adjacent regions are determined to be substantially the same. Even if regions appears to be adjacent each other in a region adjacent graph (RAG), a structural inclusive relation between regions can be derived by excluding the combination of the regions or rearranging their combination order according to a hierarchical structure. Subsequently, the mutual relation between two regions can be inferred from the inclusive relation even if the color signals of the two regions, for example, a region in a highlighted area and a region in its surrounding area, are not similar to each other.

    摘要翻译: 提供了一种图像分割装置和方法。 图像分割装置包括初始图像分割单元,区域结构化单元和冗余区域组合器。 初始图像分割单元将输入图像的颜色信号转换为基于预定信号的颜色空间,并根据输入图像在颜色空间中的颜色像素的位置将输入图像分割成多个区域。 区域结构化单元根据水平,相邻关系和区域之间的层次关系以及相邻区域之间的层次关系将多个区域分类为各层中的区域组,从而导出区域组之间的层次关系 。 冗余区域组合器根据区域之间的水平,相邻关系和区域组之间的层次关系确定相邻区域的组合顺序。 冗余区域组合器还根据确定的组合顺序确定是否组合相邻区域,并且如果相邻区域被确定为基本相同,则组合相邻区域。 即使在区域相邻图(RAG)中看起来彼此相邻,也可以通过排除区域的组合或根据层次结构重排其组合顺序来导出区域之间的结构性包含关系。 随后,即使两个区域的颜色信号,例如突出显示区域中的区域和其周围区域中的区域彼此不相似,也可以从包含关系推断两个区域之间的相互关系。

    Method, medium and apparatus for providing mobile voice web service
    6.
    发明申请
    Method, medium and apparatus for providing mobile voice web service 有权
    用于提供移动语音网络服务的方法,媒体和设备

    公开(公告)号:US20090055179A1

    公开(公告)日:2009-02-26

    申请号:US12007797

    申请日:2008-01-15

    IPC分类号: G10L15/00

    CPC分类号: G10L15/193

    摘要: Provided are a method and apparatus for providing a mobile voice web service in a mobile terminal. The method includes analyzing a web history of a user from web search logs of the user and generating a voice access list based on the analysis results, and performing voice recognition by dynamically generating a voice recognition syntax according to the generated voice access list. Accordingly, by limiting syntax required for voice recognition by generating a syntax suitable for a web context of the user, efficient voice recognition, which can be performed in a terminal not a server, can be implemented.

    摘要翻译: 提供了一种用于在移动终端中提供移动语音网络服务的方法和装置。 该方法包括:根据用户的网页搜索日志对用户的网络历史进行分析,并根据分析结果生成语音访问列表,并根据生成的语音访问列表动态生成语音识别语法,进行语音识别。 因此,通过生成适合于用户的web上下文的语法来限制语音识别所需的语法,可以实现可以在不是服务器的终端中执行的高效语音识别。