-
1.
公开(公告)号:US20190251441A1
公开(公告)日:2019-08-15
申请号:US15895795
申请日:2018-02-13
Applicant: ADOBE SYSTEMS INCORPORATED
CPC classification number: G06N3/082 , G06N3/0481
Abstract: The architectural complexity of a neural network is reduced by selectively pruning channels. A cost metric for a convolution layer is determined. The cost metric indicates a resource cost per channel for the channels of the layer. Training the neural network includes, for channels of the layer, updating a channel-scaling coefficient based on the cost metric. The channel-scaling coefficient linearly scales the output of the channel. A constant channel is identified based on the channel-scaling coefficients. The neural network is updated by pruning the constant channel. Model weights are updated via a stochastic gradient descent of a training loss function evaluated on training data. The channel-scaling coefficients are updated via an iterative-thresholding algorithm that penalizes a batch normalization loss function based on the cost metric for the layer and a norm of the channel-scaling coefficients. When the layer is batch normalized, the channel-scaling coefficients are batch normalization scaling coefficients.
-
公开(公告)号:US20190147224A1
公开(公告)日:2019-05-16
申请号:US15815635
申请日:2017-11-16
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: HAOXIANG LI , ZHE LIN , JONATHAN BRANDT , XIAOHUI SHEN
Abstract: Approaches are described for determining facial landmarks in images. An input image is provided to at least one trained neural network that determines a face region (e.g., bounding box of a face) of the input image and initial facial landmark locations corresponding to the face region. The initial facial landmark locations are provided to a 3D face mapper that maps the initial facial landmark locations to a 3D face model. A set of facial landmark locations are determined from the 3D face model. The set of facial landmark locations are provided to a landmark location adjuster that adjusts positions of the set of facial landmark locations based on the input image. The input image is presented on a user device using the adjusted set of facial landmark locations.
-
公开(公告)号:US20170097948A1
公开(公告)日:2017-04-06
申请号:US15002179
申请日:2016-01-20
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: BERNARD JAMES KERR , ZHE LIN , PATRICK REYNOLDS , BALDO FAIETA
IPC: G06F17/30 , G06F3/0482 , G06F3/0484
CPC classification number: G06F16/532 , G06F3/0482 , G06F3/04842 , G06F16/56 , G06F16/583 , G06F16/5838 , G06N3/08 , G06N5/022
Abstract: In various implementations, specific attributes found in images can be used in a visual-based search. Utilizing machine learning, deep neural networks, and other computer vision techniques, attributes of images, such as color, composition, font, style, and texture can be extracted from a given image. A user can then select a specific attribute from a sample image the user is searching for and the search can be refined to focus on that specific attribute from the sample image. In some embodiments, the search includes specific attributes from more than one image.
-
公开(公告)号:US20180267996A1
公开(公告)日:2018-09-20
申请号:US15463757
申请日:2017-03-20
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: ZHE LIN , XIAOHUI SHEN , JIANMING ZHANG , HAILIN JIN , YINGWEI LI
CPC classification number: G06F16/5866 , G06F16/532 , G06F16/951 , G06K9/00684 , G06K9/4628 , G06K9/4676 , G06K9/6248 , G06K9/6273 , G06K9/66 , G06N3/04 , G06N3/0454 , G06N3/08 , G06T7/33 , G06T11/60 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084
Abstract: A framework is provided for associating dense images with topics. The framework is trained utilizing images, each having multiple regions, multiple visual characteristics and multiple keyword tags associated therewith. For each region of each image, visual features are computed from the visual characteristics utilizing a convolutional neural network, and an image feature vector is generated from the visual features. The keyword tags are utilized to generate a weighted word vector for each image by calculating a weighted average of word vector representations representing keyword tags associated with the image. The image feature vector and the weighted word vector are aligned in a common embedding space and a heat map is computed for the image. Once trained, the framework can be utilized to automatically tag images and rank the relevance of images with respect to queried keywords based upon associated heat maps.
-
公开(公告)号:US20170236055A1
公开(公告)日:2017-08-17
申请号:US15094633
申请日:2016-04-08
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: ZHE LIN , XIAOHUI SHEN , JONATHAN BRANDT , JIANMING ZHANG , CHEN FANG
CPC classification number: G06N3/08 , G06F17/30247 , G06N3/0454 , G06N3/0472 , G06N99/005
Abstract: Embodiments of the present invention provide an automated image tagging system that can predict a set of tags, along with relevance scores, that can be used for keyword-based image retrieval, image tag proposal, and image tag auto-completion based on user input. Initially, during training, a clustering technique is utilized to reduce cluster imbalance in the data that is input into a convolutional neural network (CNN) for training feature data. In embodiments, the clustering technique can also be utilized to compute data point similarity that can be utilized for tag propagation (to tag untagged images). During testing, a diversity based voting framework is utilized to overcome user tagging biases. In some embodiments, bigram re-weighting can down-weight a keyword that is likely to be part of a bigram based on a predicted tag set.
-
6.
公开(公告)号:US20170004383A1
公开(公告)日:2017-01-05
申请号:US14788113
申请日:2015-06-30
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: ZHE LIN , JONATHAN BRANDT , XIAOHUI SHEN , JAE-PIL HEO , JIANCHAO YANG
CPC classification number: G06F17/30268 , G06F17/30277 , G06K9/6215 , G06T2207/20084
Abstract: In various implementations, a personal asset management application is configured to perform operations that facilitate the ability to search multiple images, irrespective of the images having characterizing tags associated therewith or without, based on a simple text-based query. A first search is conducted by processing a text-based query to produce a first set of result images used to further generate a visually-based query based on the first set of result images. A second search is conducted employing the visually-based query that was based on the first set of result images received in accordance with the first search conducted and based on the text-based query. The second search can generate a second set of result images, each having visual similarity to at least one of the images generated for the first set of result images.
Abstract translation: 在各种实现中,个人资产管理应用被配置为执行操作,其便于搜索多个图像的能力,而不管基于简单的基于文本的查询,具有与其相关联的或不具有特征标签的图像。 通过处理基于文本的查询以产生用于基于第一组结果图像进一步生成基于视觉的查询的第一组结果图像来进行第一搜索。 使用基于基于根据所进行的第一次搜索接收的第一组结果图像并基于基于文本的查询的基于视觉的查询进行第二搜索。 第二搜索可以产生第二组结果图像,每个结果图像与对于第一组结果图像生成的图像中的至少一个图像具有视觉相似性。
-
公开(公告)号:US20180267997A1
公开(公告)日:2018-09-20
申请号:US15463769
申请日:2017-03-20
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: ZHE LIN , XIAOHUI SHEN , JIANMING ZHANG , HAILIN JIN , YINGWEI LI
CPC classification number: G06F17/30268 , G06F17/30277 , G06F17/30864 , G06K9/00684 , G06K9/66 , G06N3/04 , G06N3/08 , G06T7/33 , G06T11/60 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084
Abstract: A framework is provided for associating images with topics utilizing embedding learning. The framework is trained utilizing images, each having multiple visual characteristics and multiple keyword tags associated therewith. Visual features are computed from the visual characteristics utilizing a convolutional neural network and an image feature vector is generated therefrom. The keyword tags are utilized to generate a weighted word vector (or “soft topic feature vector”) for each image by calculating a weighted average of word vector representations that represent the keyword tags associated with the image. The image feature vector and the soft topic feature vector are aligned in a common embedding space and a relevancy score is computed for each of the keyword tags. Once trained, the framework can automatically tag images and a text-based search engine can rank image relevance with respect to queried keywords based upon predicted relevancy scores.
-
公开(公告)号:US20170236032A1
公开(公告)日:2017-08-17
申请号:US15043174
申请日:2016-02-12
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: ZHE LIN , XIAOHUI SHEN , JONATHAN BRANDT , JIANMING ZHANG , CHEN FANG
CPC classification number: G06K9/623 , G06F16/24578 , G06F16/285 , G06F16/51 , G06F16/583 , G06K9/4628 , G06K9/6223 , G06K9/6262 , G06K9/6276 , G06N3/0454 , G06N3/08 , G06N20/10
Abstract: Embodiments of the present invention provide an automated image tagging system that can predict a set of tags, along with relevance scores, that can be used for keyword-based image retrieval, image tag proposal, and image tag auto-completion based on user input. Initially, during training, a clustering technique is utilized to reduce cluster imbalance in the data that is input into a convolutional neural network (CNN) for training feature data. In embodiments, the clustering technique can also be utilized to compute data point similarity that can be utilized for tag propagation (to tag untagged images). During testing, a diversity based voting framework is utilized to overcome user tagging biases. In some embodiments, bigram re-weighting can down-weight a keyword that is likely to be part of a bigram based on a predicted tag set.
-
公开(公告)号:US20170109873A1
公开(公告)日:2017-04-20
申请号:US15392162
申请日:2016-12-28
Applicant: Adobe Systems Incorporated
Inventor: Jianchao Yang , ZHE LIN
CPC classification number: G06T5/50 , G06K9/46 , G06K9/4609 , G06K9/4647 , G06K9/6259 , G06T3/4053 , G06T3/4076 , G06T5/002 , G06T5/003 , G06T2207/10024 , G06T2207/20192 , G06T2207/20216
Abstract: Systems and methods are provided for image enhancement using self-examples in combination with external examples. In one embodiment, an image manipulation application receives an input image patch of an input image. The image manipulation application determines a first weight for an enhancement operation using self-examples and a second weight for an enhancement operation using external examples. The image manipulation application generates a first interim output image patch by applying the enhancement operation using self-examples to the input image patch and a second interim output image patch by applying the enhancement operation using external examples to the input image patch. The image manipulation application generates an output image patch by combining the first and second interim output image patches as modified using the first and second weights.
-
公开(公告)号:US20160034788A1
公开(公告)日:2016-02-04
申请号:US14447296
申请日:2014-07-30
Applicant: ADOBE SYSTEMS INCORPORATED
Inventor: ZHE LIN , HAILIN JIN , JIANCHAO YANG
CPC classification number: G06T7/33 , G06K9/627 , G06N3/0454
Abstract: A first set of attributes (e.g., style) is generated through pre-trained single column neural networks and leveraged to regularize the training process of a regularized double-column convolutional neural network (RDCNN). Parameters of the first column (e.g., style) of the RDCNN are fixed during RDCNN training Parameters of the second column (e.g., aesthetics) are fine-tuned while training the RDCNN and the learning process is supervised by the label identified by the second column (e.g., aesthetics). Thus, features of the images may be leveraged to boost classification accuracy of other features by learning a RDCNN.
Abstract translation: 通过预训练的单列神经网络产生第一组属性(例如,样式),并且利用正则化的双列卷积神经网络(RDCNN)的训练过程。 在RDCNN训练期间RDCNN的第一列(例如,样式)的参数是固定的在第二列的参数(例如,美学)中进行微调,同时训练RDCNN,学习过程由第二列标识的标签 (如美学)。 因此,可以利用图像的特征来通过学习RDCNN来提高其他特征的分类精度。
-
-
-
-
-
-
-
-
-