-
公开(公告)号:US20160104054A1
公开(公告)日:2016-04-14
申请号:US14510000
申请日:2014-10-08
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Radomir Mech , Xiaohui Shen , Brian L. Price , Jianming Zhang
CPC classification number: G06K9/4671 , G06T11/60
Abstract: Saliency map computation is described. In one or more implementations, a base saliency map is generated for an image of a scene. The base saliency map may be generated from intermediate saliency maps computed for boundary regions of the image. Each of the intermediate saliency maps may represent visual saliency of portions of the scene that are captured in the corresponding boundary region. The boundary regions may include, for instance, a top boundary region, a bottom boundary region, a left boundary region, and a right boundary region. Further, the intermediate saliency maps may be combined in such a way that an effect of a foreground object on the saliency map is suppressed. The foreground objects for which the effect is suppressed are those that occupy a majority of one of the boundary regions.
Abstract translation: 描述了显着地图计算。 在一个或多个实现中,为场景的图像生成基本显着图。 可以从为图像的边界区域计算的中间显着图生成基本显着图。 每个中间显着图可以表示在相应边界区域中捕获的场景的部分的视觉显着性。 边界区域可以包括例如顶边界区域,底边界区域,左边界区域和右边界区域。 此外,中间显着图可以以这样的方式组合,即前景对象对显着图的影响被抑制。 效果被抑制的前景对象是占据边界区域中的大多数的那些。
-
公开(公告)号:US10257436B1
公开(公告)日:2019-04-09
申请号:US15730632
申请日:2017-10-11
Applicant: Adobe Systems Incorporated
Inventor: Jianming Zhang , Zijun Wei , Zhe Lin , Xiaohui Shen , Radomir Mech
Abstract: Various embodiments describe view switching of video on a computing device. In an example, a video processing application receives a stream of video data. The video processing application renders a major view on a display of the computing device. The major view presents a video from the stream of video data. The video processing application inputs the stream of video data to a deep learning system and receives back information that identifies a cropped video from the video based on a composition score of the cropped video, while the video is presented in the major view. The composition score is generated by the deep learning system. The video processing application renders a sub-view on a display of the device, the sub-view presenting the cropped video. The video processing application renders the cropped video in the major view based on a user interaction with the sub-view.
-
公开(公告)号:US09454712B2
公开(公告)日:2016-09-27
申请号:US14510000
申请日:2014-10-08
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Radomir Mech , Xiaohui Shen , Brian L. Price , Jianming Zhang
CPC classification number: G06K9/4671 , G06T11/60
Abstract: Saliency map computation is described. In one or more implementations, a base saliency map is generated for an image of a scene. The base saliency map may be generated from intermediate saliency maps computed for boundary regions of the image. Each of the intermediate saliency maps may represent visual saliency of portions of the scene that are captured in the corresponding boundary region. The boundary regions may include, for instance, a top boundary region, a bottom boundary region, a left boundary region, and a right boundary region. Further, the intermediate saliency maps may be combined in such a way that an effect of a foreground object on the saliency map is suppressed. The foreground objects for which the effect is suppressed are those that occupy a majority of one of the boundary regions.
Abstract translation: 描述了显着地图计算。 在一个或多个实现中,为场景的图像生成基本显着图。 可以从为图像的边界区域计算的中间显着图生成基本显着图。 每个中间显着图可以表示在相应边界区域中捕获的场景的部分的视觉显着性。 边界区域可以包括例如顶边界区域,底边界区域,左边界区域和右边界区域。 此外,中间显着图可以以这样的方式组合,即前景对象对显着图的影响被抑制。 效果被抑制的前景对象是占据边界区域中的大多数的那些。
-
公开(公告)号:US20190108640A1
公开(公告)日:2019-04-11
申请号:US15730564
申请日:2017-10-11
Applicant: Adobe Systems Incorporated
Inventor: Jianming Zhang , Zijun Wei , Zhe Lin , Xiaohui Shen , Radomir Mech
Abstract: Various embodiments describe using a neural network to evaluate image crops in substantially real-time. In an example, a computer system performs unsupervised training of a first neural network based on unannotated image crops, followed by a supervised training of the first neural network based on annotated image crops. Once this first neural network is trained, the computer system inputs image crops generated from images to this trained network and receives composition scores therefrom. The computer system performs supervised training of a second neural network based on the images and the composition scores.
-
公开(公告)号:US20180357803A1
公开(公告)日:2018-12-13
申请号:US15620636
申请日:2017-06-12
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: Jianming Zhang , Zhe Lin , Radomir Mech , Xiaohui Shen
CPC classification number: G06T11/60 , G06T3/40 , G06T7/11 , G06T2210/22
Abstract: Embodiments of the present invention are directed to facilitating region of interest preservation. In accordance with some embodiments of the present invention, a region of interest preservation score using adaptive margins is determined. The region of interest preservation score indicates an extent to which at least one region of interest is preserved in a candidate image crop associated with an image. A region of interest positioning score is determined that indicates an extent to which a position of the at least one region of interest is preserved in the candidate image crop associated with the image. The region of interest preservation score and/or the preserving score are used to select a set of one or more candidate image crops as image crop suggestions.
-
公开(公告)号:US20170344884A1
公开(公告)日:2017-11-30
申请号:US15164310
申请日:2016-05-25
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Xiaohui Shen , Jonathan W. Brandt , Jianming Zhang
CPC classification number: G06N3/084 , G06F17/30259
Abstract: Semantic class localization techniques and systems are described. In one or more implementation, a technique is employed to back communicate relevancies of aggregations back through layers of a neural network. Through use of these relevancies, activation relevancy maps are created that describe relevancy of portions of the image to the classification of the image as corresponding to a semantic class. In this way, the semantic class is localized to portions of the image. This may be performed through communication of positive and not negative relevancies, use of contrastive attention maps to different between semantic classes and even within a same semantic class through use of a self-contrastive technique.
-
公开(公告)号:US20170178291A1
公开(公告)日:2017-06-22
申请号:US15448138
申请日:2017-03-02
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Radomir Mech , Xiaohui Shen , Brian L. Price , Jianming Zhang , Anant Gilra , Jen-Chan Jeff Chien
CPC classification number: G06T3/40 , G06K9/4671 , G06T3/0012 , G06T11/60 , G06T2210/22
Abstract: Image cropping suggestion using multiple saliency maps is described. In one or more implementations, component scores, indicative of visual characteristics established for visually-pleasing croppings, are computed for candidate image croppings using multiple different saliency maps. The visual characteristics on which a candidate image cropping is scored may be indicative of its composition quality, an extent to which it preserves content appearing in the scene, and a simplicity of its boundary. Based on the component scores, the croppings may be ranked with regard to each of the visual characteristics. The rankings may be used to cluster the candidate croppings into groups of similar croppings, such that croppings in a group are different by less than a threshold amount and croppings in different groups are different by at least the threshold amount. Based on the clustering, croppings may then be chosen, e.g., to present them to a user for selection.
-
18.
公开(公告)号:US20190110002A1
公开(公告)日:2019-04-11
申请号:US15730632
申请日:2017-10-11
Applicant: Adobe Systems Incorporated
Inventor: Jianming Zhang , Zijun Wei , Zhe Lin , Xiaohui Shen , Radomir Mech
Abstract: Various embodiments describe view switching of video on a computing device. In an example, a video processing application executed on the computing device receives a stream of video data. The video processing application renders a major view on a display of the computing device. The major view presents a video from the stream of video data. The video processing application inputs the stream of video data to a deep learning system and receives back information that identifies a cropped video from the video based on a composition score of the cropped video, while the video is presented in the major view. The composition score is generated by the deep learning system. The video processing application renders a sub-view on a display of the device, the sub-view presenting the cropped video. The video processing application renders the cropped video in the major view based on a user interaction with the sub-view.
-
公开(公告)号:US10216766B2
公开(公告)日:2019-02-26
申请号:US15463769
申请日:2017-03-20
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: Zhe Lin , Xiaohui Shen , Jianming Zhang , Hailin Jin , Yingwei Li
Abstract: A framework is provided for associating images with topics utilizing embedding learning. The framework is trained utilizing images, each having multiple visual characteristics and multiple keyword tags associated therewith. Visual features are computed from the visual characteristics utilizing a convolutional neural network and an image feature vector is generated therefrom. The keyword tags are utilized to generate a weighted word vector (or “soft topic feature vector”) for each image by calculating a weighted average of word vector representations that represent the keyword tags associated with the image. The image feature vector and the soft topic feature vector are aligned in a common embedding space and a relevancy score is computed for each of the keyword tags. Once trained, the framework can automatically tag images and a text-based search engine can rank image relevance with respect to queried keywords based upon predicted relevancy scores.
-
公开(公告)号:US09626584B2
公开(公告)日:2017-04-18
申请号:US14511001
申请日:2014-10-09
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Radomir Mech , Xiaohui Shen , Brian L. Price , Jianming Zhang , Anant Gilra , Jen-Chan Jeff Chien
CPC classification number: G06T3/40 , G06K9/4671 , G06T3/0012 , G06T11/60 , G06T2210/22
Abstract: Image cropping suggestion using multiple saliency maps is described. In one or more implementations, component scores, indicative of visual characteristics established for visually-pleasing croppings, are computed for candidate image croppings using multiple different saliency maps. The visual characteristics on which a candidate image cropping is scored may be indicative of its composition quality, an extent to which it preserves content appearing in the scene, and a simplicity of its boundary. Based on the component scores, the croppings may be ranked with regard to each of the visual characteristics. The rankings may be used to cluster the candidate croppings into groups of similar croppings, such that croppings in a group are different by less than a threshold amount and croppings in different groups are different by at least the threshold amount. Based on the clustering, croppings may then be chosen, e.g., to present them to a user for selection.
-
-
-
-
-
-
-
-
-