-
公开(公告)号:US20180239995A1
公开(公告)日:2018-08-23
申请号:US15962514
申请日:2018-04-25
Applicant: Adobe Systems Incorporated
Inventor: Zhaowen Wang , Luoqi Liu , Hailin Jin
CPC classification number: G06K9/6828 , G06K9/00442 , G06K9/46 , G06K9/4628 , G06K9/52 , G06K9/6272 , G06K9/66 , G06K2009/4666 , G06K2209/01 , G06N3/0454 , G06T3/40 , G06T7/60
Abstract: Font recognition and similarity determination techniques and systems are described. In a first example, localization techniques are described to train a model using machine learning (e.g., a convolutional neural network) using training images. The model is then used to localize text in a subsequently received image, and may do so automatically and without user intervention, e.g., without specifying any of the edges of a bounding box. In a second example, a deep neural network is directly learned as an embedding function of a model that is usable to determine font similarity. In a third example, techniques are described that leverage attributes described in metadata associated with fonts as part of font recognition and similarity determinations.
-
公开(公告)号:US10026020B2
公开(公告)日:2018-07-17
申请号:US14997011
申请日:2016-01-15
Applicant: Adobe Systems Incorporated
Inventor: Hailin Jin , Zhou Ren , Zhe Lin , Chen Fang
CPC classification number: G06K9/628 , G06F16/5846 , G06F16/5866 , G06K9/00684 , G06K9/66 , G06K9/726
Abstract: Embedding space for images with multiple text labels is described. In the embedding space both text labels and image regions are embedded. The text labels embedded describe semantic concepts that can be exhibited in image content. The embedding space is trained to semantically relate the embedded text labels so that labels like “sun” and “sunset” are more closely related than “sun” and “bird”. Training the embedding space also includes mapping representative images, having image content which exemplifies the semantic concepts, to respective text labels. Unlike conventional techniques that embed an entire training image into the embedding space for each text label associated with the training image, the techniques described herein process a training image to generate regions that correspond to the multiple text labels. The regions of the training image are then embedded into the training space in a manner that maps the regions to the corresponding text labels.
-
公开(公告)号:US09978129B2
公开(公告)日:2018-05-22
申请号:US15707418
申请日:2017-09-18
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Jianchao Yang , Hailin Jin , Xin Lu
CPC classification number: G06T5/005 , G06K9/00664 , G06K9/6218 , G06T5/002 , G06T2207/20081 , G06T2207/20084
Abstract: Patch partition and image processing techniques are described. In one or more implementations, a system includes one or more modules implemented at least partially in hardware. The one or more modules are configured to perform operations including grouping a plurality of patches taken from a plurality of training samples of images into respective ones of a plurality of partitions, calculating an image processing operator for each of the partitions, determining distances between the plurality of partitions that describe image similarity of patches of the plurality of partitions, one to another, and configuring a database to provide the determined distance and the image processing operator to process an image in response to identification of a respective partition that corresponds to a patch taken from the image.
-
公开(公告)号:US09811765B2
公开(公告)日:2017-11-07
申请号:US14995032
申请日:2016-01-13
Applicant: Adobe Systems Incorporated
Inventor: Zhaowen Wang , Quanzeng You , Hailin Jin , Chen Fang
CPC classification number: G06K9/6269 , G06F17/30247 , G06F17/3028 , G06F17/30675 , G06K9/00664 , G06K9/4604 , G06K9/4628 , G06K9/6202 , G06K9/6274 , G06N3/0445 , G06N3/08 , G06N7/005
Abstract: Techniques for image captioning with weak supervision are described herein. In implementations, weak supervision data regarding a target image is obtained and utilized to provide detail information that supplements global image concepts derived for image captioning. Weak supervision data refers to noisy data that is not closely curated and may include errors. Given a target image, weak supervision data for visually similar images may be collected from sources of weakly annotated images, such as online social networks. Generally, images posted online include “weak” annotations in the form of tags, titles, labels, and short descriptions added by users. Weak supervision data for the target image is generated by extracting keywords for visually similar images discovered in the different sources. The keywords included in the weak supervision data are then employed to modulate weights applied for probabilistic classifications during image captioning analysis.
-
公开(公告)号:US20170262414A1
公开(公告)日:2017-09-14
申请号:US15067108
申请日:2016-03-10
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: I-Ming Pao , Zhaowen Wang , Hailin Jin , Alan Lee Erickson
CPC classification number: G06F17/214 , G06N3/0454
Abstract: Embodiments of the present invention are directed at providing a font similarity system. In one embodiment, a new font is detected on a computing device. In response to the detection of the new font, a pre-computed font list is checked to determine whether the new font is included therein. The pre-computed font list including feature representations, generated independently of the computing device, for corresponding fonts. In response to a determination that the new font is absent from the pre-computed font list, a feature representation for the new font is generated. The generated feature representation capable of being utilized for a similarity analysis of the new font. The feature representation is then stored in a supplemental font list to enable identification of one or more fonts installed on the computing device that are similar to the new font. Other embodiments may be described and/or claimed.
-
公开(公告)号:US20170228613A1
公开(公告)日:2017-08-10
申请号:US15494106
申请日:2017-04-21
Applicant: Adobe Systems Incorporated
Inventor: Hailin Jin , Kai Ni
CPC classification number: G06K9/6211 , G06K9/00208 , G06K9/4604 , G06K9/4652 , G06K9/627
Abstract: In one embodiment, a computer accessible storage medium stores a plurality of instructions which, when executed: group a set of reconstructed three dimensional (3D) points derived from image data into a plurality of groups based on one or more attributes of the 3D points; select one or more groups from the plurality of groups; and sample data from the selected groups, wherein the sampled data is input to a consensus estimator to generate a model that describes a 3D model of a scene captured by the image data. Other embodiments may bias sampling into a consensus estimator for any data set, based on relative quality of the data set.
-
公开(公告)号:US20170132425A1
公开(公告)日:2017-05-11
申请号:US14938724
申请日:2015-11-11
Applicant: Adobe Systems Incorporated
Inventor: Zeke Koch , Gavin Stuart Peter Miller , Jonathan W. Brandt , Nathan A. Carr , Radomir Mech , Walter Wei-Tuh Chang , Scott D. Cohen , Hailin Jin
CPC classification number: G06F21/6218 , G06F3/04845 , G06F17/30268 , G06F21/10
Abstract: Content creation collection and navigation techniques and systems are described. In one example, a representative image is used by a content sharing service to interact with a collection of images provided as part of a search result. In another example, a user interface image navigation control is configured to support user navigation through images based on one or more metrics. In a further example, a user interface image navigation control is configured to support user navigation through images based on one or more metrics identified for an object selected from the image. In yet another example, collections of images are leveraged as part of content creation. In another example, data obtained from a content sharing service is leveraged to indicate suitability of images of a user for licensing as part of the service.
-
公开(公告)号:US20170132252A1
公开(公告)日:2017-05-11
申请号:US14938690
申请日:2015-11-11
Applicant: Adobe Systems Incorporated
Inventor: Zeke Koch , Gavin Stuart Peter Miller , Jonathan W. Brandt , Nathan A. Carr , Radomir Mech , Walter Wei-Tuh Chang , Scott D. Cohen , Hailin Jin
IPC: G06F17/30 , G06F3/0482 , G06F3/0484
CPC classification number: G06F16/5866 , G06F3/0482 , G06F3/04847 , G06F16/248 , G06F16/287
Abstract: Content creation collection and navigation techniques and systems are described. In one example, a representative image is used by a content sharing service to interact with a collection of images provided as part of a search result. In another example, a user interface image navigation control is configured to support user navigation through images based on one or more metrics. In a further example, a user interface image navigation control is configured to support user navigation through images based on one or more metrics identified for an object selected from the image. In yet another example, collections of images are leveraged as part of content creation. In another example, data obtained from a content sharing service is leveraged to indicate suitability of images of a user for licensing as part of the service.
-
公开(公告)号:US20170098140A1
公开(公告)日:2017-04-06
申请号:US14876609
申请日:2015-10-06
Applicant: Adobe Systems Incorporated
Inventor: Zhaowen Wang , Luoqi Liu , Hailin Jin
CPC classification number: G06K9/6828 , G06K9/00442 , G06K9/46 , G06K9/4628 , G06K9/52 , G06K9/6272 , G06K9/66 , G06K2009/4666 , G06K2209/01 , G06N3/0454 , G06T3/40 , G06T7/60
Abstract: Font recognition and similarity determination techniques and systems are described. In a first example, localization techniques are described to train a model using machine learning (e.g., a convolutional neural network) using training images. The model is then used to localize text in a subsequently received image, and may do so automatically and without user intervention, e.g., without specifying any of the edges of a bounding box. In a second example, a deep neural network is directly learned as an embedding function of a model that is usable to determine font similarity. In a third example, techniques are described that leverage attributes described in metadata associated with fonts as part of font recognition and similarity determinations.
-
公开(公告)号:US09552639B2
公开(公告)日:2017-01-24
申请号:US14884338
申请日:2015-10-15
Applicant: Adobe Systems Incorporated
Inventor: Hailin Jin , Zihan Zhou
CPC classification number: G06T7/80 , G06T7/174 , G06T7/20 , G06T2207/10016 , G06T2207/30241 , H04N13/20
Abstract: Robust techniques for self-calibration of a moving camera observing a planar scene. Plane-based self-calibration techniques may take as input the homographies between images estimated from point correspondences and provide an estimate of the focal lengths of all the cameras. A plane-based self-calibration technique may be based on the enumeration of the inherently bounded space of the focal lengths. Each sample of the search space defines a plane in the 3D space and in turn produces a tentative Euclidean reconstruction of all the cameras that is then scored. The sample with the best score is chosen and the final focal lengths and camera motions are computed. Variations on this technique handle both constant focal length cases and varying focal length cases.
Abstract translation: 用于自动校准移动摄像机观察平面场景的强大技术。 基于平面的自校准技术可以将从点对应估计的图像之间的同形作为输入,并提供所有相机的焦距的估计。 基于平面的自校准技术可以基于焦距的固有界限空间的计数。 搜索空间的每个样本在3D空间中定义一个平面,并且反过来产生所有相机的临时欧几里德重建,然后对其进行评分。 选择具有最佳分数的样本,并计算最终焦距和相机运动。 该技术的变化处理恒定焦距情况和不同焦距情况。
-
-
-
-
-
-
-
-
-