METHOD AND APPARATUS FOR AUTO-RECORDING IMAGE DATA
    1.
    发明申请
    METHOD AND APPARATUS FOR AUTO-RECORDING IMAGE DATA 审中-公开
    自动记录图像数据的方法和装置

    公开(公告)号:US20080140424A1

    公开(公告)日:2008-06-12

    申请号:US11954717

    申请日:2007-12-12

    IPC分类号: H04N7/173

    摘要: A auto-recording method is disclosed for auto-recording further to user request, via generating user image and voice data, extracting feature points from the image data according to pre-defined user recognition and following by considering the user as an object of following according to extracted feature points, determining whether the image and voice data satisfy a recording reference needed to perform recording. If determined that the image and voice data satisfy the recording reference, editing the image and voice data in a pre-set edit form and generating and storing at least one of recording image and recording voice data.

    摘要翻译: 公开了一种自动记录方法,用于通过生成用户图像和语音数据进一步自动记录用户请求,根据预定义的用户识别从图像数据中提取特征点,并且通过将用户作为跟随对象来考虑, 提取特征点,确定图像和语音数据是否满足执行记录所需的记录参考。 如果确定图像和语音数据满足记录参考,则以预设编辑形式编辑图像和语音数据,并且生成并存储记录图像和记录语音数据中的至少一个。

    Face recognition system and method based on adaptive learning
    2.
    发明授权
    Face recognition system and method based on adaptive learning 有权
    基于自适应学习的人脸识别系统和方法

    公开(公告)号:US08135220B2

    公开(公告)日:2012-03-13

    申请号:US12115250

    申请日:2008-05-05

    IPC分类号: G06K9/62 G06K9/00

    CPC分类号: G06K9/00295

    摘要: A face recognition system based on adaptive learning includes a specific person detection and tracking unit for detecting and tracking a specific person from a moving image. A facial feature extraction unit extracts a plurality of facial feature vectors from the detected and tracked specific person. A face recognition unit searches for a given registration model by comparing the extracted facial feature vectors with facial feature vectors of the registration models previously stored in a user registration model database. A learning target selection unit selects a facial feature vector to be added to a record of the given registration model from among the extracted facial feature vectors. A registration model learning unit adds and updates the selected facial feature vector to the record of the given registration model.

    摘要翻译: 基于自适应学习的面部识别系统包括用于从运动图像中检测和跟踪特定人物的特定人物检测和跟踪单元。 面部特征提取单元从检测到和跟踪的特定人物中提取多个面部特征向量。 面部识别单元通过将提取的面部特征向量与先前存储在用户登记模型数据库中的登记模型的面部特征向量进行比较来搜索给定的登记模型。 学习目标选择单元从所提取的面部特征向量中选择要添加到给定登记模型的记录的面部特征向量。 注册模型学习单元将所选择的面部特征向量添加并更新到给定注册模型的记录。

    FACE RECOGNITION SYSTEM AND METHOD BASED ON ADAPTIVE LEARNING
    3.
    发明申请
    FACE RECOGNITION SYSTEM AND METHOD BASED ON ADAPTIVE LEARNING 有权
    基于自适应学习的面部识别系统和方法

    公开(公告)号:US20080273766A1

    公开(公告)日:2008-11-06

    申请号:US12115250

    申请日:2008-05-05

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00295

    摘要: A face recognition system based on adaptive learning includes a specific person detection and tracking unit for detecting and tracking a specific person from a moving image. A facial feature extraction unit extracts a plurality of facial feature vectors from the detected and tracked specific person. A face recognition unit searches for a given registration model by comparing the extracted facial feature vectors with facial feature vectors of the registration models previously stored in a user registration model database. A learning target selection unit selects a facial feature vector to be added to a record of the given registration model from among the extracted facial feature vectors. A registration model learning unit adds and updates the selected facial feature vector to the record of the given registration model.

    摘要翻译: 基于自适应学习的面部识别系统包括用于从运动图像中检测和跟踪特定人物的特定人物检测和跟踪单元。 面部特征提取单元从检测到和跟踪的特定人物中提取多个面部特征向量。 面部识别单元通过将提取的面部特征向量与先前存储在用户登记模型数据库中的登记模型的面部特征向量进行比较来搜索给定的登记模型。 学习目标选择单元从所提取的面部特征向量中选择要添加到给定登记模型的记录的面部特征向量。 注册模型学习单元将所选择的面部特征向量添加并更新到给定注册模型的记录。

    Method and apparatus for controlling output level of voice signal during video telephony
    4.
    发明授权
    Method and apparatus for controlling output level of voice signal during video telephony 有权
    用于在视频电话期间控制语音信号的输出电平的方法和装置

    公开(公告)号:US08229089B2

    公开(公告)日:2012-07-24

    申请号:US12802427

    申请日:2010-06-07

    IPC分类号: H04M11/00

    摘要: A method and apparatus controls an output level of a voice signal for video telephony by considering the distance between a user and a terminal and surrounding noises. An input image signal and an input voice signal of the user to be used for the video telephony are received at the user's terminal. A received image signal and a received voice signal received from the other party's terminal to which the video telephony is connected, are output on the user's terminal. The user's face region included in the input image signal is extracted. A size information of the extracted face region is checked. A distance information about a distance from the user is checked using the size information. And an output level of the received voice signal is controlled based on the distance information.

    摘要翻译: 一种方法和装置通过考虑用户和终端之间的距离以及周围的噪声来控制用于视频电话的语音信号的输出电平。 在用户的终端处接收要用于视频电话的用户的输入图像信号和输入语音信号。 接收到的图像信号和从视频电话所连接的另一方终端接收到的接收到的语音信号被输出到用户终端。 提取包括在输入图像信号中的用户的脸部区域。 检查提取的面部区域的尺寸信息。 使用尺寸信息检查关于距离用户的距离的距离信息。 并且基于距离信息来控制接收到的语音信号的输出电平。

    Apparatus and method for detecting hands of subject in real time
    5.
    发明授权
    Apparatus and method for detecting hands of subject in real time 有权
    用于实时检测被摄体的手的装置和方法

    公开(公告)号:US08588467B2

    公开(公告)日:2013-11-19

    申请号:US12803369

    申请日:2010-06-25

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00382

    摘要: An apparatus and method can effectively detect both hands and hand shape of a user from images input through cameras. A skin image detecting skin regions from one of the input images and a stereoscopic distance image are used. For hand detection, background and noise are eliminated from a combined image of the skin image and the distance image and regions corresponding to actual both hands are detected from effective images having a high probability of hands. For hand shape detection, a non-skin region is eliminated from the skin image based on the stereoscopic distance information, hand shape candidate regions are detected from the remaining region after elimination, and finally a hand shape is determined.

    摘要翻译: 一种装置和方法可以通过相机输入的图像有效地检测用户的手和手的形状。 使用从输入图像之一和立体距离图像检测皮肤区域的皮肤图像。 对于手部检测,从皮肤图像的组合图像中消除背景和噪声,并且从具有高概率手的有效图像检测距离图像和对应于实际双手的区域。 对于手形检测,基于立体距离信息从皮肤图像中去除非皮肤区域,从消除后的剩余区域检测手形候补区域,最后确定手形。

    Apparatus and method for detecting hands of subject in real time
    6.
    发明申请
    Apparatus and method for detecting hands of subject in real time 有权
    用于实时检测被摄体的手的装置和方法

    公开(公告)号:US20100329511A1

    公开(公告)日:2010-12-30

    申请号:US12803369

    申请日:2010-06-25

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00382

    摘要: An apparatus and method can effectively detect both hands and hand shape of a user from images input through cameras. A skin image detecting skin regions from one of the input images and a stereoscopic distance image are used. For hand detection, background and noise are eliminated from a combined image of the skin image and the distance image and regions corresponding to actual both hands are detected from effective images having a high probability of hands. For hand shape detection, a non-skin region is eliminated from the skin image based on the stereoscopic distance information, hand shape candidate regions are detected from the remaining region after elimination, and finally a hand shape is determined.

    摘要翻译: 一种装置和方法可以通过相机输入的图像有效地检测用户的手和手的形状。 使用从输入图像之一和立体距离图像检测皮肤区域的皮肤图像。 对于手部检测,从皮肤图像的组合图像中消除背景和噪声,并且从具有高概率手的有效图像检测距离图像和对应于实际双手的区域。 对于手形检测,基于立体距离信息从皮肤图像中去除非皮肤区域,从消除后的剩余区域检测手形候补区域,最后确定手形。

    Method and apparatus for controlling output level of voice signal during video telephony
    7.
    发明申请
    Method and apparatus for controlling output level of voice signal during video telephony 有权
    用于在视频电话期间控制语音信号的输出电平的方法和装置

    公开(公告)号:US20100315485A1

    公开(公告)日:2010-12-16

    申请号:US12802427

    申请日:2010-06-07

    IPC分类号: H04N7/14

    摘要: A method and apparatus controls an output level of a voice signal for video telephony by considering the distance between a user and a terminal and surrounding noises. An input image signal and an input voice signal of the user to be used for the video telephony are received at the user's terminal. A received image signal and a received voice signal received from the other party's terminal to which the video telephony is connected, are output on the user's terminal. The user's face region included in the input image signal is extracted. A size information of the extracted face region is checked. A distance information about a distance from the user is checked using the size information. And an output level of the received voice signal is controlled based on the distance information.

    摘要翻译: 一种方法和装置通过考虑用户和终端之间的距离以及周围的噪声来控制用于视频电话的语音信号的输出电平。 在用户的终端处接收要用于视频电话的用户的输入图像信号和输入语音信号。 接收到的图像信号和从视频电话所连接的另一方终端接收到的接收到的语音信号被输出到用户终端。 提取包括在输入图像信号中的用户的脸部区域。 检查提取的面部区域的尺寸信息。 使用尺寸信息检查关于距离用户的距离的距离信息。 并且基于距离信息来控制接收到的语音信号的输出电平。

    System and method for controlling voice detection of network terminal
    8.
    发明授权
    System and method for controlling voice detection of network terminal 有权
    控制网络终端语音检测的系统及方法

    公开(公告)号:US07890334B2

    公开(公告)日:2011-02-15

    申请号:US11705802

    申请日:2007-02-13

    IPC分类号: G10L15/22

    CPC分类号: G10L15/30

    摘要: Provided is a system and method for controlling voice detection of a network terminal. The system includes the network terminal for, if detection of a voice signal is requested, detecting voice by receiving and setting a voice detection setting value corresponding to a predetermined service and generating a trigger signal for the voice detection according to the voice detection setting value corresponding to the service; and a server for determining the service of the network terminal and transmitting the voice detection setting value corresponding to the service to the network terminal. Accordingly, by controlling to commence voice detection according to a service, voice detection optimized to a relevant service can commence.

    摘要翻译: 提供一种用于控制网络终端的语音检测的系统和方法。 该系统包括网络终端,如果要求检测到语音信号,则通过接收并设置与预定服务相对应的语音检测设置值来检测语音,并根据对应的语音检测设置值产生用于语音检测的触发信号 服务; 以及服务器,用于确定网络终端的服务,并将对应于该服务的语音检测设置值发送到网络终端。 因此,通过控制根据业务开始语音检测,可以开始优化到相关服务的语音检测。

    System and method for controlling voice detection of network terminal
    9.
    发明申请
    System and method for controlling voice detection of network terminal 有权
    控制网络终端语音检测的系统及方法

    公开(公告)号:US20070201639A1

    公开(公告)日:2007-08-30

    申请号:US11705802

    申请日:2007-02-13

    IPC分类号: H04M11/00

    CPC分类号: G10L15/30

    摘要: Provided is a system and method for controlling voice detection of a network terminal. The system includes the network terminal for, if detection of a voice signal is requested, detecting voice by receiving and setting a voice detection setting value corresponding to a predetermined service and generating a trigger signal for the voice detection according to the voice detection setting value corresponding to the service; and a server for determining the service of the network terminal and transmitting the voice detection setting value corresponding to the service to the network terminal. Accordingly, by controlling to commence voice detection according to a service, voice detection optimized to a relevant service can commence.

    摘要翻译: 提供一种用于控制网络终端的语音检测的系统和方法。 该系统包括网络终端,如果要求检测到语音信号,则通过接收并设置与预定服务相对应的语音检测设置值来检测语音,并根据对应的语音检测设置值产生用于语音检测的触发信号 服务; 以及服务器,用于确定网络终端的服务,并将对应于该服务的语音检测设置值发送到网络终端。 因此,通过控制根据业务开始语音检测,可以开始优化到相关服务的语音检测。

    METHOD AND APPARATUS FOR SPEECH SPEAKER RECOGNITION
    10.
    发明申请
    METHOD AND APPARATUS FOR SPEECH SPEAKER RECOGNITION 审中-公开
    用于语音识别的方法和装置

    公开(公告)号:US20080249774A1

    公开(公告)日:2008-10-09

    申请号:US12061156

    申请日:2008-04-02

    IPC分类号: G10L17/00

    CPC分类号: G10L17/02

    摘要: Disclosed is a method for speech speaker recognition of a speech speaker recognition apparatus, the method including detecting effective speech data from input speech; extracting an acoustic feature from the speech data; generating an acoustic feature transformation matrix from the speech data according to each of Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA), mixing each of the acoustic feature transformation matrixes to construct a hybrid acoustic feature transformation matrix, and multiplying the matrix representing the acoustic feature with the hybrid acoustic feature transformation matrix to generate a final feature vector; and generating a speaker model from the final feature vector, comparing a pre-stored universal speaker model with the generated speaker model to identify the speaker, and verifying the identified speaker.

    摘要翻译: 公开了一种语音讲话者识别装置的语音说话人识别方法,该方法包括从输入语音中检测有效的语音数据; 从所述语音数据中提取声学特征; 根据主成分分析(PCA)和线性判别分析(LDA)中的每一个从语音数据生成声学特征变换矩阵,混合每个声学特征变换矩阵以构建混合声学特征变换矩阵,并将表示 声学特征与混合声学特征变换矩阵以产生最终特征向量; 以及从所述最终特征向量生成扬声器模型,将预先存储的通用扬声器模型与所产生的扬声器模型进行比较以识别所述扬声器,以及验证所识别的扬声器。