-
公开(公告)号:US20180336401A1
公开(公告)日:2018-11-22
申请号:US16049322
申请日:2018-07-30
Applicant: Adobe Systems Incorporated
Inventor: Jonathan Brandt , Zhe Lin , Xiaohui Shen , Haoxiang Li
CPC classification number: G06K9/00295 , G06K9/00369 , G06K9/00677 , G06K9/6218 , G06K9/66
Abstract: Methods and systems for recognizing people in images with increased accuracy are disclosed. In particular, the methods and systems divide images into a plurality of clusters based on common characteristics of the images. The methods and systems also determine an image cluster to which an image with an unknown person instance most corresponds. One or more embodiments determine a probability that the unknown person instance is each known person instance in the image cluster using a trained cluster classifier of the image cluster. Optionally, the methods and systems determine context weights for each combination of an unknown person instance and each known person instance using a conditional random field algorithm based on a plurality of context cues associated with the unknown person instance and the known person instances. The methods and systems calculate a contextual probability based on the cluster-based probabilities and context weights to identify the unknown person instance.
-
公开(公告)号:US10089742B1
公开(公告)日:2018-10-02
申请号:US15458887
申请日:2017-03-14
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: Zhe Lin , Xin Lu , Xiaohui Shen , Jimei Yang , Chenxi Liu
Abstract: The invention is directed towards segmenting images based on natural language phrases. An image and an n-gram, including a sequence of tokens, are received. An encoding of image features and a sequence of token vectors are generated. A fully convolutional neural network identifies and encodes the image features. A word embedding model generates the token vectors. A recurrent neural network (RNN) iteratively updates a segmentation map based on combinations of the image feature encoding and the token vectors. The segmentation map identifies which pixels are included in an image region referenced by the n-gram. A segmented image is generated based on the segmentation map. The RNN may be a convolutional multimodal RNN. A separate RNN, such as a long short-term memory network, may iteratively update an encoding of semantic features based on the order of tokens. The first RNN may update the segmentation map based on the semantic feature encoding.
-
公开(公告)号:US20180211135A1
公开(公告)日:2018-07-26
申请号:US15935816
申请日:2018-03-26
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Yufei Wang , Radomir Mech , Xiaohui Shen , Gavin Stuart Peter Miller
CPC classification number: G06K9/6218 , G06F16/51 , G06F16/58 , G06F16/583 , G06K9/00228 , G06K9/00677 , G06K9/00718 , G06K9/00751 , G06K9/4628 , G06K9/6215 , G06K9/6254 , G06K9/6255 , G06K9/628 , G06N3/0454 , G06N3/084
Abstract: In embodiments of event image curation, a computing device includes memory that stores a collection of digital images associated with a type of event, such as a digital photo album of digital photos associated with the event, or a video of image frames and the video is associated with the event. A curation application implements a convolutional neural network, which receives the digital images and a designation of the type of event. The convolutional neural network can then determine an importance rating of each digital image within the collection of the digital images based on the type of the event. The importance rating of a digital image is representative of an importance of the digital image to a person in context of the type of the event. The convolutional neural network generates an output of representative digital images from the collection based on the importance rating of each digital image.
-
公开(公告)号:US10019823B2
公开(公告)日:2018-07-10
申请号:US14062751
申请日:2013-10-24
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Radomir Mech , Peng Wang
CPC classification number: G06T11/60 , G06N7/005 , G06N20/10 , G06T3/0012 , G06T2207/20132
Abstract: In techniques of combined composition and change-based models for image cropping, a composition application is implemented to apply one or more image composition modules of a learned composition model to evaluate multiple composition regions of an image. The learned composition model can determine one or more cropped images from the image based on the applied image composition modules, and evaluate a composition of the cropped images and a validity of change from the image to the cropped images. The image composition modules of the learned composition model include a salient regions module that iteratively determines salient image regions of the image, and include a foreground detection module that determines foreground regions of the image. The image composition modules also include one or more imaging models that reduce a number of the composition regions of the image to facilitate determining the cropped images from the image.
-
公开(公告)号:US20180137892A1
公开(公告)日:2018-05-17
申请号:US15353186
申请日:2016-11-16
Applicant: Adobe Systems Incorporated
Inventor: Zhihong Ding , Zhe Lin , Xiaohui Shen , Michael Kaplan , Jonathan Brandt
CPC classification number: G11B27/11 , G06K9/00744 , G06K9/3241 , G06T7/97 , G06T11/60 , G06T2207/10016 , G11B27/031
Abstract: The present disclosure is directed toward systems and methods for tracking objects in videos. For example, one or more embodiments described herein utilize various tracking methods in combination with an image search index made up of still video frames indexed from a video. One or more embodiments described herein utilize a backward and forward tracking method that is anchored by one or more key frames in order to accurately track an object through the frames of a video, even when the video is long and may include challenging conditions.
-
公开(公告)号:US09911201B2
公开(公告)日:2018-03-06
申请号:US15191141
申请日:2016-06-23
Applicant: Adobe Systems Incorporated
CPC classification number: G06T7/90 , G06K9/2054 , G06K9/4652 , G06T5/005 , G06T2207/10024 , H04N1/622
Abstract: Imaging process initialization techniques are described. In an implementation, a color estimate is generated for a plurality of pixels within a region of an image. A plurality of pixels outside of the regions are first identified for each pixel of the plurality of pixels within the region. This may include identification of pixels disposed at opposing directions from the pixel being estimated. A color estimate is determined for each of the plurality of pixels based on the identified pixels. As part of this, a weighting may be employed, such as based on a respective distance of each of the pixels outside of the region to the pixel within the region, a distance along the opposing direction for corresponding pixels outside of the region (e.g., at horizontal or vertical directions), and so forth. The color estimate is then used to initialize an imaging process technique.
-
公开(公告)号:US20170357892A1
公开(公告)日:2017-12-14
申请号:US15177121
申请日:2016-06-08
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Yufei Wang , Radomir Mech , Xiaohui Shen , Gavin Stuart Peter Miller
Abstract: In embodiments of convolutional neural network joint training, a computing system memory maintains different data batches of multiple digital image items, where the digital image items of the different data batches have some common features. A convolutional neural network (CNN) receives input of the digital image items of the different data batches, and classifier layers of the CNN are trained to recognize the common features in the digital image items of the different data batches. The recognized common features are input to fully-connected layers of the CNN that distinguish between the recognized common features of the digital image items of the different data batches. A scoring difference is determined between item pairs of the digital image items in a particular one of the different data batches. A piecewise ranking loss algorithm maintains the scoring difference between the item pairs, and the scoring difference is used to train CNN regression functions.
-
公开(公告)号:US20170294010A1
公开(公告)日:2017-10-12
申请号:US15097113
申请日:2016-04-12
Applicant: Adobe Systems Incorporated
Inventor: Xiaohui Shen , Zhe Lin , Shu Kong , Radomir Mech
CPC classification number: G06T7/0002 , G06K9/623 , G06K9/6256 , G06T2207/20081 , G06T2207/30168
Abstract: Systems and methods are disclosed for estimating aesthetic quality of digital images using deep learning. In particular, the disclosed systems and methods describe training a neural network to generate an aesthetic quality score digital images. In particular, the neural network includes a training structure that compares relative rankings of pairs of training images to accurately predict a relative ranking of a digital image. Additionally, in training the neural network, an image rating system can utilize content-aware and user-aware sampling techniques to identify pairs of training images that have similar content and/or that have been rated by the same or different users. Using content-aware and user-aware sampling techniques, the neural network can be trained to accurately predict aesthetic quality ratings that reflect subjective opinions of most users as well as provide aesthetic scores for digital images that represent the wide spectrum of aesthetic preferences of various users.
-
公开(公告)号:US09734434B2
公开(公告)日:2017-08-15
申请号:US15183629
申请日:2016-06-15
Applicant: Adobe Systems Incorporated
Inventor: Xiaohui Shen , Zhe Lin , Jonathan W. Brandt
CPC classification number: G06K9/6256 , G06F17/30247 , G06F17/30253 , G06K9/46 , G06K9/4676 , G06K9/469 , G06N5/04 , G06N99/005
Abstract: Feature interpolation techniques are described. In a training stage, features are extracted from a collection of training images and quantized into visual words. Spatial configurations of the visual words in the training images are determined and stored in a spatial configuration database. In an object detection stage, a portion of features of an image are extracted from the image and quantized into visual words. Then, a remaining portion of the features of the image are interpolated using the visual words and the spatial configurations of visual words stored in the spatial configuration database.
-
公开(公告)号:US20170228659A1
公开(公告)日:2017-08-10
申请号:US15082877
申请日:2016-03-28
Applicant: Adobe Systems Incorporated
Inventor: Zhe Lin , Jianchao Yang , Hailin Jin , Chen Fang
CPC classification number: G06N20/00 , G06F16/532 , G06F16/58 , G06N3/0454 , G06N3/08 , G06N5/04
Abstract: Certain embodiments involve learning features of content items (e.g., images) based on web data and user behavior data. For example, a system determines latent factors from the content items based on data including a user's text query or keyword query for a content item and the user's interaction with the content items based on the query (e.g., a user's click on a content item resulting from a search using the text query). The system uses the latent factors to learn features of the content items. The system uses a previously learned feature of the content items for iterating the process of learning features of the content items to learn additional features of the content items, which improves the accuracy with which the system is used to learn other features of the content items.
-
-
-
-
-
-
-
-
-