-
公开(公告)号:US20190236394A1
公开(公告)日:2019-08-01
申请号:US16376704
申请日:2019-04-05
申请人: Adobe Inc.
发明人: Brian Price , Scott Cohen , Mai Long , Jun Hao Liew
CPC分类号: G06K9/3241 , G06K9/4628 , G06K2009/366
摘要: Systems and methods are disclosed for selecting target objects within digital images utilizing a multi-modal object selection neural network trained to accommodate multiple input modalities. In particular, in one or more embodiments, the disclosed systems and methods generate a trained neural network based on training digital images and training indicators corresponding to various input modalities. Moreover, one or more embodiments of the disclosed systems and methods utilize a trained neural network and iterative user inputs corresponding to different input modalities to select target objects in digital images. Specifically, the disclosed systems and methods can transform user inputs into distance maps that can be utilized in conjunction with color channels and a trained neural network to identify pixels that reflect the target object.
-
公开(公告)号:US20190213751A1
公开(公告)日:2019-07-11
申请号:US16351912
申请日:2019-03-13
申请人: MAGIC LEAP, INC.
发明人: Gary R. Bradski
CPC分类号: G06T7/62 , G06K9/00597 , G06K9/00617 , G06K9/4628 , G06K2009/00939
摘要: A user identification system includes an image recognition network to analyze image data and generate shape data based on the image data. The system also includes a generalist network to analyze the shape data and generate general category data based on the shape data. The system further includes a specialist network to compare the general category data with a characteristic to generate narrow category data. Moreover, the system includes a classifier layer including a plurality of nodes to represent a classification decision based on the narrow category data.
-
3.
公开(公告)号:US20190206546A1
公开(公告)日:2019-07-04
申请号:US16296072
申请日:2019-03-07
申请人: Dirk Schneemann, LLC
发明人: Dirk Schneemann
IPC分类号: G16H30/40 , G06K9/00 , A61B5/00 , G06K9/32 , G06K9/46 , G06K9/62 , A61B5/11 , A61B5/16 , A61B5/055 , A61B5/107 , G06K9/66 , G06T7/11 , G06T7/00 , G16H50/20 , G06N3/08 , G06N3/04
CPC分类号: G16H30/40 , A61B5/0022 , A61B5/0075 , A61B5/0077 , A61B5/055 , A61B5/1077 , A61B5/1079 , A61B5/11 , A61B5/1103 , A61B5/163 , A61B5/167 , A61B5/7267 , A61B5/7275 , A61B5/7282 , A61B5/7485 , A61B6/03 , A61B6/032 , A61B6/5217 , A61B6/5247 , A61B8/08 , A61B8/5223 , A61B2576/02 , G06K9/00214 , G06K9/00228 , G06K9/00275 , G06K9/00281 , G06K9/00302 , G06K9/3233 , G06K9/4628 , G06K9/6254 , G06K9/627 , G06K9/66 , G06N3/04 , G06N3/0454 , G06N3/084 , G06N7/005 , G06T7/0016 , G06T7/11 , G16H20/70 , G16H50/20 , G16H50/70
摘要: A computer-implemented method for identifying character traits associated with a target subject includes acquiring image data of a target subject from an image data source, rendering a 3D image data set, comparing each of a plurality of regions of interest within the 3D image set to a historical image data set to identify active regions of interest, grouping subsets of the regions of interest into one or more convolutional feature layers, wherein each convolutional feature layer probabilistically maps to a pre-identified character trait, and applying a convolutional neural network model to the convolutional feature layers to identify a pattern of active regions of interest within each convolutional feature layer to predict whether a target subject possesses the pre-identified character trait.
-
公开(公告)号:US20190205738A1
公开(公告)日:2019-07-04
申请号:US15862369
申请日:2018-01-04
申请人: Tesla, Inc.
CPC分类号: G06N3/063 , G06K9/342 , G06K9/4628 , G06K9/66 , G06N20/00
摘要: Described herein are systems and methods that utilize a novel hardware-based pooling architecture to process the output of a convolution engine representing an output channel of a convolution layer in a convolutional neural network (CNN). The pooling system converts the output into a set of arrays and aligns them according to a pooling operation to generate a pooling result. In certain embodiments, this is accomplished by using an aligner that aligns, e.g., over a number of arithmetic cycles, an array of data in the output into rows and shifts the rows relative to each other. A pooler applies a pooling operation to a combination of a subset of data from each row to generate the pooling result.
-
5.
公开(公告)号:US20190197300A1
公开(公告)日:2019-06-27
申请号:US16290868
申请日:2019-03-02
发明人: Lin Yang , Patrick Z. Dong , Baohua Sun
CPC分类号: G06K9/00375 , G06K9/4628 , G06K9/66 , G06K9/72 , G06N3/08 , G06T1/005 , G06T7/246 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30196 , G06T2207/30232
摘要: Methods of recognizing motions of an object in a video clip or an image sequence are disclosed. A plurality of frames are selected out of a video clip or an image sequence of interest. A text category is associated with each frame by applying an image classification technique with a trained deep-learning model for a set of categories containing various poses of an object within each frame. A “super-character” is formed by embedding respective text categories of the frames as corresponding ideograms in a 2-D symbol having multiple ideograms contained therein. Particular motion of the object is recognized by obtaining the meaning of the “super-character” with image classification of the 2-D symbol via a trained convolutional neural networks model for various motions of the object derived from specific sequential combinations of text categories. Ideograms may contain imagery data instead of text categories, e.g., detailed images or reduced-size images.
-
公开(公告)号:US20190171876A1
公开(公告)日:2019-06-06
申请号:US16258707
申请日:2019-01-28
申请人: PHOTOMYNE LTD.
发明人: Yair SEGALOVITZ , Omer SHOOR , Yaron LIPMAN , Nir TZEMAH , Natalie VERTER
IPC分类号: G06K9/00 , G06K9/22 , G06K9/32 , G06T5/00 , G06T3/40 , G06N3/08 , G06K9/46 , G06K9/66 , G06N3/04
CPC分类号: G06K9/00463 , G06K9/00442 , G06K9/00456 , G06K9/228 , G06K9/3233 , G06K9/4628 , G06K9/4676 , G06K9/66 , G06K2009/363 , G06N3/04 , G06N3/0454 , G06N3/08 , G06T3/4007 , G06T5/009 , G06T2207/10004 , G06T2207/10024 , G06T2207/20132 , G06T2207/20164 , G06T2207/30176
摘要: A method for cropping photos images captured by a user from an image of a page of a photo album is described. Corners in the page image are detected using corner detection algorithm or by detecting intersections of line-segments (and their extensions) in the image using edge, corner, or line detection techniques. Pairs of the detected corners are used to define all potential quads, which are then are qualified according to various criteria. A correlation matrix is generated for each potential pair of the qualified quads, and candidate quads are selected based on the Eigenvector of the correlation matrix. The content of the selected quads is checked using a salience map that may be based on a trained neuron network, and the resulting photos images are extracted as individual files for further handling or manipulation by the user.
-
公开(公告)号:US20190164020A1
公开(公告)日:2019-05-30
申请号:US15824389
申请日:2017-11-28
发明人: Shervin Sabripour
CPC分类号: G06K9/66 , G06K9/00228 , G06K9/00369 , G06K9/00664 , G06K9/4628 , G06K9/6267 , G06N20/00 , H04N5/2258 , H04N5/232
摘要: A portable electronic device and method. The portable electronic device includes a first camera, a second camera, an electronic processor, and one or more sensors. The electronic processor is configured to detect, based on information obtained from the one or more sensors, an incident and select a camera responsive to the incident. The electronic processor is further configured to capture an image using the selected camera and determine, within the image, a subject of interest, wherein the subject of interest is at least one selected from the group consisting of a person, an object, and an entity. The electronic processor is also configured to initiate an edge learning process on the subject of interest to create a classifier for use in identifying the subject of interest and transmit the classifier to a second portable electronic device within a predetermined distance from the portable electronic device.
-
公开(公告)号:US20190138820A1
公开(公告)日:2019-05-09
申请号:US16168217
申请日:2018-10-23
申请人: Robert Bosch GmbH
发明人: Michael Herman , Joerg Wagner , Volker Fischer
CPC分类号: G06K9/00791 , G06K9/00805 , G06K9/4628 , G06K9/6262 , G06K9/6271
摘要: A method for detecting an object with the aid of a system, the system having a plurality of modules connected in series one after the other, at least one connection between a predefinable module and its immediately following module being implemented in each case with the aid of a filter and storage module connected in series, the first module of the modules connected in series being connected to an acquisition unit, and the last module of the modules connected in series being connected to a detection module. The method includes temporal filtering.
-
公开(公告)号:US20190066331A1
公开(公告)日:2019-02-28
申请号:US15689514
申请日:2017-08-29
申请人: Trimble Inc.
发明人: Kent Kahle , Changlin Xiao
CPC分类号: G06T7/74 , B25F5/00 , G06K9/00201 , G06K9/4628 , G06K9/6271 , G06T7/11 , G06T7/248 , G06T7/292 , G06T2200/04 , G06T2207/10028 , G06T2207/20021 , G06T2207/20081 , G06T2207/30196 , G06T2207/30232
摘要: A tool in a scene is detected using one or more cameras by dividing an image into patches. Multiple different patch sizes are used in conjunction with deep learning to identify the tool in the image. After the tool is identified, a position of the tool in three dimensions is calculated using images from two or more cameras.
-
公开(公告)号:US20190065899A1
公开(公告)日:2019-02-28
申请号:US15710377
申请日:2017-09-20
申请人: Google Inc.
CPC分类号: G06K9/6215 , G06K9/4628 , G06K9/6232 , G06K9/6255 , G06K9/6256 , G06K9/6262 , G06K9/66 , G06N20/00
摘要: The present disclosure provides systems and methods that enable distance metric learning using proxies. A machine-learned distance model can be trained in a proxy space in which a loss function compares an embedding provided for an anchor data point of a training dataset to a positive proxy and one or more negative proxies, where each of the positive proxy and the one or more negative proxies serve as a proxy for two or more data points included in the training dataset. Thus, each proxy can approximate a number of data points, enabling faster convergence. According to another aspect, the proxies of the proxy space can themselves be learned parameters, such that the proxies and the model are trained jointly. Thus, the present disclosure enables faster convergence (e.g., reduced training time). The present disclosure provides example experiments which demonstrate a new state of the art on several popular training datasets.
-
-
-
-
-
-
-
-
-