Hierarchical segmentation and quality measurement for video editing

    公开(公告)号:US10062412B2

    公开(公告)日:2018-08-28

    申请号:US15173465

    申请日:2016-06-03

    Applicant: Apple Inc.

    Abstract: Methods for organizing media data by automatically segmenting media data into hierarchical layers of scenes are described. The media data may include metadata and content having still image, video or audio data. The metadata may be content-based (e.g., differences between neighboring frames, exposure data, key frame identification data, motion data, or face detection data) or non-content-based (e.g., exposure, focus, location, time) and used to prioritize and/or classify portions of video. The metadata may be generated at the time of image capture or during post-processing. Prioritization information, such as a score for various portions of the image data may be based on the metadata and/or image data. Classification information such as the type or quality of a scene may be determined based on the metadata and/or image data. The classification and prioritization information may be metadata and may be used to organize the media data.

    VIDEO ANALYSIS TECHNIQUES FOR IMPROVED EDITING, NAVIGATION, AND SUMMARIZATION
    2.
    发明申请
    VIDEO ANALYSIS TECHNIQUES FOR IMPROVED EDITING, NAVIGATION, AND SUMMARIZATION 审中-公开
    改进编辑,导航和总结的视频分析技术

    公开(公告)号:US20160092561A1

    公开(公告)日:2016-03-31

    申请号:US14559705

    申请日:2014-12-03

    Applicant: Apple Inc.

    Abstract: Systems and processes for improved video editing, summarization and navigation based on generation and analysis of metadata are described. The metadata may be content-based (e.g., differences between neighboring frames, exposure data, key frame identification data, motion data, or face detection data) or non-content-based (e.g., exposure, focus, location, time) and used to prioritize and/or classify portions of video. The metadata may be generated at the time of image capture or during post-processing. Prioritization information, such as a score for various portions of the image data may be based on the metadata and/or image data. Classification information such as the type or quality of a scene may be determined based on the metadata and/or image data. The classification and prioritization information may be metadata and may be used to automatically remove undesirable portions of the video, generate suggestions during editing or automatically generate summary video.

    Abstract translation: 描述了基于元数据生成和分析改进视频编辑,汇总和导航的系统和过程。 元数据可以是基于内容的(例如,相邻帧之间的差异,曝光数据,关键帧识别数据,运动数据或面部检测数据)或非基于内容的(例如,曝光,焦点,位置,时间)并且被使用 对视频的部分进行优先排序和/或分类。 元数据可以在图像捕获时或在后处理期间生成。 诸如图像数据的各个部分的得分的优先级信息可以基于元数据和/或图像数据。 可以基于元数据和/或图像数据来确定诸如场景的类型或质量的分类信息。 分类和优先化信息可以是元数据,并且可以用于自动去除视频的不期望的部分,在编辑期间生成建议或自动生成汇总视频。

    IMAGE COMPRESSION BASED ON DEVICE ORIENTATION AND LOCATION INFORMATION
    3.
    发明申请
    IMAGE COMPRESSION BASED ON DEVICE ORIENTATION AND LOCATION INFORMATION 审中-公开
    基于设备定位和位置信息的图像压缩

    公开(公告)号:US20150350653A1

    公开(公告)日:2015-12-03

    申请号:US14288969

    申请日:2014-05-28

    Applicant: Apple Inc.

    CPC classification number: H04N19/137 H04N19/142 H04N19/51 H04N19/52

    Abstract: An encoding system may include a video source that provides video data to be coded, a video coder, a transmitter, and a controller to manage operation of the system. The controller may control the video coder to code and compress the image information from the video source into video data, based upon one or more motion prediction parameters. The transmitter may transmit the video data. A decoding system may decode the video data based upon the motion prediction parameters.

    Abstract translation: 编码系统可以包括提供要编码的视频数据的视频源,视频编码器,发射机和控制器,以管理系统的操作。 控制器可以基于一个或多个运动预测参数来控制视频编码器对来自视频源的图像信息进行编码和压缩成视频数据。 发射机可以发送视频数据。 解码系统可以基于运动预测参数解码视频数据。

    Hierarchical sharpness evaluation

    公开(公告)号:US10402677B2

    公开(公告)日:2019-09-03

    申请号:US15618909

    申请日:2017-06-09

    Applicant: Apple Inc.

    Abstract: Techniques are disclosed for estimating quality of images in an automated fashion. According to these techniques, a source image may be downsampled to generate at least two downsampled images at different levels of downsampling. Blurriness of the images may be estimated starting with a most-heavily downsampled image. Blocks of a given image may be evaluated for blurriness and, when a block of a given image is estimated to be blurry, the block of the image and co-located blocks of higher resolution image(s) may be designated as blurry. Thereafter, a blurriness score may be calculated for the source image from the number of blocks of the source image designated as blurry.

    HIERARCHICAL SEGMENTATION AND QUALITY MEASUREMENT FOR VIDEO EDITING
    8.
    发明申请
    HIERARCHICAL SEGMENTATION AND QUALITY MEASUREMENT FOR VIDEO EDITING 审中-公开
    视频编辑的分层分类和质量测量

    公开(公告)号:US20160358628A1

    公开(公告)日:2016-12-08

    申请号:US15173465

    申请日:2016-06-03

    Applicant: Apple Inc.

    CPC classification number: G11B27/031 G06K9/00718 G06K9/00751 G06K9/00765

    Abstract: Methods for organizing media data by automatically segmenting media data into hierarchical layers of scenes are described. The media data may include metadata and content having still image, video or audio data. The metadata may be content-based (e.g., differences between neighboring frames, exposure data, key frame identification data, motion data, or face detection data) or non-content-based (e.g., exposure, focus, location, time) and used to prioritize and/or classify portions of video. The metadata may be generated at the time of image capture or during post-processing. Prioritization information, such as a score for various portions of the image data may be based on the metadata and/or image data. Classification information such as the type or quality of a scene may be determined based on the metadata and/or image data. The classification and prioritization information may be metadata and may be used to organize the media data.

    Abstract translation: 描述了通过将媒体数据自动分割成层级的场景来组织媒体数据的方法。 媒体数据可以包括具有静止图像,视频或音频数据的元数据和内容。 元数据可以是基于内容的(例如,相邻帧之间的差异,曝光数据,关键帧识别数据,运动数据或面部检测数据)或非基于内容的(例如,曝光,焦点,位置,时间)并且被使用 对视频的部分进行优先排序和/或分类。 元数据可以在图像捕获时或在后处理期间生成。 诸如图像数据的各个部分的得分的优先级信息可以基于元数据和/或图像数据。 可以基于元数据和/或图像数据来确定诸如场景的类型或质量的分类信息。 分类和优先化信息可以是元数据,并且可以用于组织媒体数据。

    Media Analysis and Processing Framework on a Resource Restricted Device
    9.
    发明申请
    Media Analysis and Processing Framework on a Resource Restricted Device 审中-公开
    资源限制设备上的媒体分析和处理框架

    公开(公告)号:US20160357605A1

    公开(公告)日:2016-12-08

    申请号:US15173047

    申请日:2016-06-03

    Applicant: Apple Inc.

    Abstract: A system for processing media on a resource restricted device, the system including a memory to store data representing media assets and associated descriptors, and program instructions representing an application and a media processing system, and a processor to execute the program instructions, wherein the program instructions represent the media processing system, in response to a call from an application defining a plurality of services to be performed on an asset, determine a tiered schedule of processing operations to be performed upon the asset based on a processing budget associated therewith, and iteratively execute the processing operations on a tier-by-tier basis, unless interrupted.

    Abstract translation: 一种用于在资源受限设备上处理媒体的系统,所述系统包括用于存储表示媒体资产和相关联描述符的数据的存储器,以及表示应用和媒体处理系统的程序指令以及执行所述程序指令的处理器,其中所述程序 指令代表媒体处理系统,响应于来自定义要在资产上执行的多个服务的应用的呼叫,基于与其相关联的处理预算来确定要在资产上执行的处理操作的分层调度,并且迭代地 除非中断,否则逐级执行处理操作。

    Modular Machine Learning Architecture
    10.
    发明公开

    公开(公告)号:US20230147442A1

    公开(公告)日:2023-05-11

    申请号:US17831738

    申请日:2022-06-03

    Applicant: Apple Inc.

    CPC classification number: G06N3/045

    Abstract: In an example method, a system accesses first input data and a machine learning architecture. The machine learning architecture includes a first module having a first neural network, a second module having a second neural network, and a third module having a third neural network. The system generates a first feature set representing a first portion of the first input data using the first neural network, and a second feature set representing a second portion of the first input data using the second neural network. The system generates, using the third neural network, first output data based on the first feature set and the second feature set.

Patent Agency Ranking