Information processing apparatus and method, program, and recording medium
    81.
    发明授权
    Information processing apparatus and method, program, and recording medium 失效
    信息处理装置和方法,程序和记录介质

    公开(公告)号:US08166024B2

    公开(公告)日:2012-04-24

    申请号:US12193969

    申请日:2008-08-19

    申请人: Daisuke Negi

    发明人: Daisuke Negi

    IPC分类号: G06F17/30

    摘要: An information processing apparatus includes: an extracting means for extracting a feature volume from a predetermined content; and a computing means for computing an evaluation axis that classifies a first content and a second content by using a first feature volume extracted from the first content by the extracting means or a second feature volume extracted from the second content by the extracting means.

    摘要翻译: 一种信息处理装置,包括:提取装置,用于从预定内容中提取特征量; 以及计算装置,用于计算通过使用由提取装置从第一内容提取的第一特征量或通过提取装置从第二内容提取的第二特征量对第一内容和第二内容进行分类的评估轴。

    Electronic apparatus and image display control method of the electronic apparatus
    82.
    发明授权
    Electronic apparatus and image display control method of the electronic apparatus 有权
    电子设备的电子设备和图像显示控制方法

    公开(公告)号:US08150168B2

    公开(公告)日:2012-04-03

    申请号:US12203831

    申请日:2008-09-03

    申请人: Takuya Koda

    发明人: Takuya Koda

    IPC分类号: G06K9/62 G06K9/60 G06K9/00

    摘要: According to one embodiment, an electronic apparatus extracts face images of persons from video content data and outputs timestamp information indicating time points at which each extracted face image appears in the video content data, and displays face images in each column of a plurality of face image display areas arranged in a matrix based on the time stamp information. The apparatus detects presence or absence of a face area in each frame consisting of the video content data and decides a cutout range of the detected face area. And, the apparatus adjusts a case in which the cutout range of the decided face area protrudes outside the frame.

    摘要翻译: 根据一个实施例,一种电子设备从视频内容数据提取人物的面部图像,并且输出指示每个提取的脸部图像出现在视频内容数据中的时间点的时间戳信息,并且在多个面部图像的每一列中显示面部图像 基于时间戳信息排列成矩阵的显示区域。 该装置检测由视频内容数据组成的每个帧中是否存在面部区域,并且确定检测到的脸部区域的剪切范围。 并且,该装置调整所确定的面部区域的切除范围突出到框架外部的情况。

    Systems And Methods for Manipulating Electronic Content Based On Speech Recognition
    83.
    发明申请
    Systems And Methods for Manipulating Electronic Content Based On Speech Recognition 有权
    基于语音识别的电子内容操纵系统与方法

    公开(公告)号:US20120010884A1

    公开(公告)日:2012-01-12

    申请号:US13156780

    申请日:2011-06-09

    IPC分类号: G10L17/00

    摘要: Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network; extracting an audio track from the electronic media content; and detecting speech segments within the electronic media content based on the speech model. The method further includes detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.

    摘要翻译: 公开了用于向用户显示电子多媒体内容的系统和方法。 一种用于操纵电子多媒体内容的计算机实现的方法包括使用处理器生成单个扬声器的语音模型和至少一个扬声器模型。 该方法还包括通过网络接收电子媒体内容; 从电子媒体内容中提取音轨; 以及基于所述语音模型检测所述电子媒体内容内的语音段。 该方法还包括检测电子媒体内容内的扬声器段,并且基于至少一个扬声器模型来计算涉及单个扬声器的检测到的扬声器段的概率。

    Object Detection Metadata
    84.
    发明申请
    Object Detection Metadata 有权
    对象检测元数据

    公开(公告)号:US20110305394A1

    公开(公告)日:2011-12-15

    申请号:US12815959

    申请日:2010-06-15

    IPC分类号: G06K9/46 G06F17/30

    摘要: A perimeter around a detected object in a frame of image data can be generated in a first coordinate system. The perimeter can be converted from the first coordinate system into a second coordinate system having the same aspect ratio as the first coordinate system. A first metadata entry can include dimensions of image data in the second coordinate system. A second metadata entry can provide a location and dimensions of the converted perimeter in the second coordinate space. Additional metadata can indicate matching objects between frames, position of an object relative to other objects in a frame, a probability that an object is correctly detected, and a total number of objects detected across multiple frames of image data.

    摘要翻译: 可以在第一坐标系中生成图像数据的帧中的检测对象周围的周边。 周边可以从第一坐标系转换成具有与第一坐标系相同的纵横比的第二坐标系。 第一元数据条目可以包括第二坐标系中的图像数据的尺寸。 第二元数据条目可以提供第二坐标空间中转换的周边的位置和尺寸。 附加元数据可以指示帧之间的匹配对象,对象相对于帧中的其他对象的位置,正确检测对象的概率以及跨多个图像数据帧检测到的对象的总数。

    System and Method to Assign a Digital Image to a Face Cluster
    85.
    发明申请
    System and Method to Assign a Digital Image to a Face Cluster 有权
    将数字图像分配给面部群集的系统和方法

    公开(公告)号:US20110129126A1

    公开(公告)日:2011-06-02

    申请号:US12629215

    申请日:2009-12-02

    申请人: Lee Begeja Zhu Liu

    发明人: Lee Begeja Zhu Liu

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00295 G06F17/30793

    摘要: A computer implemented method includes accessing a digital image including a plurality of faces including a first face and a second face. The computer implemented method includes identifying a plurality of identification regions of the digital image including a first identification region associated with the first face and a second identification region associated with the second face. The computer implemented method also includes assigning the digital image to a first face cluster of a plurality of face clusters when a difference between data descriptive of the first identification region and data descriptive of a face cluster identification region of the first face cluster satisfies a threshold. The computer implemented method further includes assigning the digital image to a second face cluster of the plurality of face clusters based at least partially on a probability of the second face and the first face appearing together in an image.

    摘要翻译: 计算机实现的方法包括访问包括包括第一面和第二面的多个面的数字图像。 计算机实现的方法包括识别数字图像的多个识别区域,包括与第一面相关联的第一识别区域和与第二面部相关联的第二识别区域。 计算机实现的方法还包括:当描述第一识别区域的数据与描述第一面部群集的面部群集识别区域的数据之间的差异满足阈值时,将数字图像分配给多个面部群集的第一面部群集。 计算机实现的方法还包括至少部分地基于图像中一起出现的第二面和第一面的概率将数字图像分配给多个面部群集中的第二面部群集。

    Summary of a video using faces
    87.
    发明授权
    Summary of a video using faces 有权
    使用面孔的视频摘要

    公开(公告)号:US07916894B1

    公开(公告)日:2011-03-29

    申请号:US11699618

    申请日:2007-01-29

    IPC分类号: G06K9/00 G06K9/46

    CPC分类号: G06F17/30793 G06F17/30843

    摘要: A plurality of sets of face images associated with a video is obtained. Each set of face images corresponds to a particular person depicted in the video. Of the people associated with the plurality of sets of face images, one or more of those people are selected to be included in a facial summary by analyzing the plurality of sets of face images and/or the video. For each of the selected one or more people, a face image to use in the facial summary is selected. The facial summary is laid out using the selected face images.

    摘要翻译: 获得与视频相关联的多组面部图像。 每组面部图像对应于视频中描绘的特定人物。 在与多组脸部图像相关联的人中,通过分析多组面部图像和/或视频来选择一个或多个这些人被包括在面部概要中。 对于所选择的一个或多个人中的每一个,选择在面部概要中使用的面部图像。 使用所选择的面部图像布置面部概要。

    Video retrieval system for human face content
    88.
    发明授权
    Video retrieval system for human face content 有权
    用于人脸内容的视频检索系统

    公开(公告)号:US07881505B2

    公开(公告)日:2011-02-01

    申请号:US11540619

    申请日:2006-09-29

    IPC分类号: G06K9/00 H04N9/47

    摘要: A method and apparatus for video retrieval and cueing that automatically detects human faces in the video and identifies face-specific video frames so as to allow retrieval and viewing of person-specific video segments. In one embodiment, the method locates human faces in the video, stores the time stamps associated with each face, displays a single image associated with each face, matches each face against a database, computes face locations with respect to a common 3D coordinate system, and provides a means of displaying: 1) information retrieved from the database associated with a selected person or people, 2) path of travel associated with a selected person or people 3) interaction graph of people in video, 4) video segments associated with each person and/or face. The method may also provide the ability to input and store text annotations associated with each person, face, and video segment, and the ability to enroll and remove people from database. The videos of non-human objects may be processed in a similar manner. Because of the rules governing abstracts, this abstract should not be used to construe the claims.

    摘要翻译: 一种用于视频检索和提示的方法和装置,其自动检测视频中的人脸并识别面部特定视频帧,以允许检索和查看特定于人的视频段。 在一个实施例中,该方法将人脸定位在视频中,存储与每个脸部相关联的时间戳,显示与每个脸部相关联的单个图像,将每个脸部对准数据库,针对公共3D坐标系计算面部位置, 并提供一种显示方式:1)从与所选择的人或人相关联的数据库检索的信息,2)与所选择的人或人相关联的旅行路径3)视频中的人的交互图,4)与每个人相关联的视频片段 人和/或脸。 该方法还可以提供输入和存储与每个人,面部和视频段相关联的文本注释以及从数据库注册和移除人的能力的能力。 可以以类似的方式处理非人类对象的视频。 由于管理摘要的规则,本摘要不应用于解释索赔。

    INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM
    89.
    发明申请
    INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM 有权
    信息处理设备,信息处理方法和程序

    公开(公告)号:US20100182501A1

    公开(公告)日:2010-07-22

    申请号:US12688511

    申请日:2010-01-15

    IPC分类号: H04N7/01 H04N9/74

    摘要: The information processing apparatus according to the present invention is provided with a moving picture analysis unit for analyzing moving picture data including a plurality of images and audios associated with time information and for generating moving picture metadata relating to a plurality of feature quantities characterizing the moving picture, a comic display conversion unit for extracting a plurality of images from the moving picture data based on the moving picture metadata and for dividing a predetermined display region into frames and for converting an arrangement of the plurality of extracted images into a comic-like arrangement and for generating frame information including information about the images arranged in each of the frames, and a comic display data generation unit for generating comic display data including at least the frame information, data of the extracted images, and the audio data of the moving picture.

    摘要翻译: 根据本发明的信息处理装置设置有运动图像分析单元,用于分析包括与时间信息相关联的多个图像和音频的运动图像数据,并且用于生成与表征运动图像的多个特征量相关的运动图像元数据 ,漫画显示转换单元,用于基于运动图像元数据从运动图像数据中提取多个图像,并将预定显示区域划分成帧,并将多个提取图像的排列转换为漫画式布置, 用于生成包括关于布置在每个帧中的图像的信息的帧信息;以及漫画显示数据生成单元,用于生成至少包括帧信息,提取的图像的数据和运动图像的音频数据的漫画显示数据。

    DEVICE AND METHOD FOR AUTOMATIC PARTICIPANT IDENTIFICATION IN A RECORDED MULTIMEDIA STREAM
    90.
    发明申请
    DEVICE AND METHOD FOR AUTOMATIC PARTICIPANT IDENTIFICATION IN A RECORDED MULTIMEDIA STREAM 有权
    记录多媒体流中自动参与者识别的设备和方法

    公开(公告)号:US20100149305A1

    公开(公告)日:2010-06-17

    申请号:US12638635

    申请日:2009-12-15

    摘要: The present disclosure discloses a method for identifying individuals in a multimedia stream originating from a video conferencing terminal or a Multipoint Control Unit, including executing a face detection process on the multimedia stream; defining subsets including facial images of one or more individuals, where the subsets are ranked according to a probability that their respective one or more individuals will appear in a video stream; comparing a detected face to the subsets in consecutive order starting with a most probable subset, until a match is found; and storing an identity of the detected face as searchable metadata in a content database in response to the detected face matching a facial image in one of the subsets.

    摘要翻译: 本公开公开了一种用于识别源自视频会议终端或多点控制单元的多媒体流中的个体的方法,包括对多媒体流执行人脸检测过程; 定义包括一个或多个个体的面部图像的子集,其中子集根据其相应的一个或多个个体将出现在视频流中的概率进行排名; 从最可能的子集开始,以连续的顺序将检测到的面部与子集进行比较,直到找到匹配; 以及响应于检测到的与所述子集之一中的面部图像匹配的面部,将检测到的面部的身份存储在内容数据库中作为可检索元数据。