Entity based temporal segmentation of video streams

    公开(公告)号:US09607224B2

    公开(公告)日:2017-03-28

    申请号:US14712071

    申请日:2015-05-14

    Applicant: Google Inc.

    CPC classification number: G06K9/00765 G06K9/6269 G06K9/66 H04N5/91

    Abstract: A solution is provided for temporally segmenting a video based on analysis of entities identified in the video frames of the video. The video is decoded into multiple video frames and multiple video frames are selected for annotation. The annotation process identifies entities present in a sample video frame and each identified entity has a timestamp and confidence score indicating the likelihood that the entity is accurately identified. For each identified entity, a time series comprising of timestamps and corresponding confidence scores is generated and smoothed to reduce annotation noise. One or more segments containing an entity over the length of the video are obtained by detecting boundaries of the segments in the time series of the entity. From the individual temporal segmentation for each identified entity in the video, an overall temporal segmentation for the video is generated, where the overall temporal segmentation reflects the semantics of the video.

    ENTITY BASED TEMPORAL SEGMENTATION OF VIDEO STREAMS
    2.
    发明申请
    ENTITY BASED TEMPORAL SEGMENTATION OF VIDEO STREAMS 有权
    基于实体的视频流的时间分段

    公开(公告)号:US20160335499A1

    公开(公告)日:2016-11-17

    申请号:US14712071

    申请日:2015-05-14

    Applicant: Google Inc.

    CPC classification number: G06K9/00765 G06K9/6269 G06K9/66 H04N5/91

    Abstract: A solution is provided for temporally segmenting a video based on analysis of entities identified in the video frames of the video. The video is decoded into multiple video frames and multiple video frames are selected for annotation. The annotation process identifies entities present in a sample video frame and each identified entity has a timestamp and confidence score indicating the likelihood that the entity is accurately identified. For each identified entity, a time series comprising of timestamps and corresponding confidence scores is generated and smoothed to reduce annotation noise. One or more segments containing an entity over the length of the video are obtained by detecting boundaries of the segments in the time series of the entity. From the individual temporal segmentation for each identified entity in the video, an overall temporal segmentation for the video is generated, where the overall temporal segmentation reflects the semantics of the video.

    Abstract translation: 提供了一种解决方案,用于基于在视频的视频帧中识别的实体的分析来对视频进行时间分割。 将视频解码为多个视频帧,并选择多个视频帧进行注释。 注释过程识别存在于样本视频帧中的实体,并且每个识别的实体具有指示实体准确识别的可能性的时间戳和置信度分数。 对于每个识别的实体,产生并平滑包括时间戳和对应的置信度分数的时间序列以减少注释噪声。 通过检测实体的时间序列中的段的边界来获得包含视频长度上的实体的一个或多个段。 根据视频中每个被识别实体的个体时间分割,生成视频的总体时间分割,其中整体时间分段反映视频的语义。

    Annotate Apps with Entities by Fusing Heterogeneous Signals
    3.
    发明申请
    Annotate Apps with Entities by Fusing Heterogeneous Signals 审中-公开
    通过融合异构信号注释实体的应用程序

    公开(公告)号:US20160125034A1

    公开(公告)日:2016-05-05

    申请号:US14614688

    申请日:2015-02-05

    Applicant: Google Inc.

    CPC classification number: G06F16/24573 G06F16/951

    Abstract: A system and method of annotating an application, including obtaining input signals associated with a target application, wherein the input signals are obtained from a plurality of sources, obtaining first annotation data from the obtained input signals, generating second annotation data in a machine-understandable form based on the first annotation data, and associating the second annotation data with the target application.

    Abstract translation: 一种注释应用程序的系统和方法,包括获得与目标应用相关联的输入信号,其中从多个源获得输入信号,从所获得的输入信号获得第一注释数据,在机器可理解的第二批注数据中生成第二注释数据 基于所述第一注释数据的形式,以及将所述第二注释数据与所述目标应用相关联。

Patent Agency Ranking