-
公开(公告)号:US20240257316A1
公开(公告)日:2024-08-01
申请号:US18615050
申请日:2024-03-25
Applicant: Intel Corporation
Inventor: Anbang Yao , Ming Lu , Yikai Wang , Shandong Wang , Yurong Chen , Sungye Kim , Attila Tamas Afra
CPC classification number: G06T5/50 , G06N3/02 , G06T7/13 , G06V40/161 , G06V40/171 , G06T2207/20084 , G06T2207/30201
Abstract: The present disclosure provides an apparatus and method of guided neural network model for image processing. An apparatus may comprise a guidance map generator, a synthesis network and an accelerator. The guidance map generator may receive a first image as a content image and a second image as a style image, and generate a first plurality of guidance maps and a second plurality of guidance maps, respectively from the first image and the second image. The synthesis network may synthesize the first plurality of guidance maps and the second plurality of guidance maps to determine guidance information. The accelerator may generate an output image by applying the style of the second image to the first image based on the guidance information.
-
公开(公告)号:US11972545B2
公开(公告)日:2024-04-30
申请号:US17482998
申请日:2021-09-23
Applicant: Intel Corporation
Inventor: Anbang Yao , Ming Lu , Yikai Wang , Shandong Wang , Yurong Chen , Sungye Kim , Attila Tamas Afra
CPC classification number: G06T5/50 , G06N3/02 , G06T7/13 , G06V40/161 , G06V40/171 , G06T2207/20084 , G06T2207/30201
Abstract: The present disclosure provides an apparatus and method of guided neural network model for image processing. An apparatus may comprise a guidance map generator, a synthesis network and an accelerator. The guidance map generator may receive a first image as a content image and a second image as a style image, and generate a first plurality of guidance maps and a second plurality of guidance maps, respectively from the first image and the second image. The synthesis network may synthesize the first plurality of guidance maps and the second plurality of guidance maps to determine guidance information. The accelerator may generate an output image by applying the style of the second image to the first image based on the guidance information.
-
公开(公告)号:US20230298204A1
公开(公告)日:2023-09-21
申请号:US18000389
申请日:2020-06-26
Applicant: Intel Corporation
Inventor: Shandong Wang , Yangyuxuan Kang , Anbang Yao , Ming Lu , Yurong Chen
CPC classification number: G06T7/74 , G06T17/00 , G06T2207/20084 , G06T2207/10016 , G06T2207/20081 , G06T2207/30244 , G06T2207/30196
Abstract: Apparatus and methods for three-dimensional pose estimation are disclosed herein. An example apparatus includes an image synchronizer to synchronize a first image generated by a first image capture device and a second image generated by a second image capture device, the first image and the second image including a subject; a two-dimensional pose detector to predict first positions of keypoints of the subject based on the first image and by executing a first neural network model to generate first two-dimensional data and predict second positions of the keypoints based on the second image and by executing the first neural network model to generate second two-dimensional data; and a three-dimensional pose calculator to generate a three-dimensional graphical model representing a pose of the subject in the first image and the second image based on the first two-dimensional data, the second two-dimensional data, and by executing a second neural network model.
-
公开(公告)号:US11537851B2
公开(公告)日:2022-12-27
申请号:US16475075
申请日:2017-04-07
Applicant: INTEL CORPORATION
Inventor: Yiwen Guo , Anbang Yao , Dongqi Cai , Libin Wang , Lin Xu , Ping Hu , Shandong Wang , Wenhua Cheng , Yurong Chen
Abstract: Methods and systems are disclosed using improved training and learning for deep neural networks. In one example, a deep neural network includes a plurality of layers, and each layer has a plurality of nodes. The nodes of each L layer in the plurality of layers are randomly connected to nodes of an L+1 layer. The nodes of each L+1 layer are connected to nodes in a subsequent L layer in a one-to-one manner. Parameters related to the nodes of each L layer are fixed. Parameters related to the nodes of each L+1 layers are updated. In another example, inputs for the input layer and labels for the output layer of a deep neural network are determined related to a first sample. A similarity between different pairs of inputs and labels is estimated using a Gaussian regression process.
-
公开(公告)号:US20220222492A1
公开(公告)日:2022-07-14
申请号:US17584216
申请日:2022-01-25
Applicant: Intel Corporation
Inventor: Yiwen GUO , Yuqing Hou , Anbang Yao , Dongqi Cai , Lin Xu , Ping Hu , Shandong Wang , Wenhua Cheng , Yurong Chen , Libin Wang
Abstract: Methods and systems for budgeted and simplified training of deep neural networks (DNNs) are disclosed. In one example, a trainer is to train a DNN using a plurality of training sub-images derived from a down-sampled training image. A tester is to test the trained DNN using a plurality of testing sub-images derived from a down-sampled testing image. In another example, in a recurrent deep Q-network (RDQN) having a local attention mechanism located between a convolutional neural network (CNN) and a long-short time memory (LSTM), a plurality of feature maps are generated by the CNN from an input image. Hard-attention is applied by the local attention mechanism to the generated plurality of feature maps by selecting a subset of the generated feature maps. Soft attention is applied by the local attention mechanism to the selected subset of generated feature maps by providing weights to the selected subset of generated feature maps in obtaining weighted feature maps. The weighted feature maps are stored in the LSTM. A Q value is calculated for different actions based on the weighted feature maps stored in the LSTM.
-
公开(公告)号:US11263490B2
公开(公告)日:2022-03-01
申请号:US16475078
申请日:2017-04-07
Applicant: INTEL CORPORATION
Inventor: Yiwen Guo , Yuqing Hou , Anbang Yao , Dongqi Cai , Lin Xu , Ping Hu , Shandong Wang , Wenhua Cheng , Yurong Chen , Libin Wang
Abstract: Methods and systems for budgeted and simplified training of deep neural networks (DNNs) are disclosed. In one example, a trainer is to train a DNN using a plurality of training sub-images derived from a down-sampled training image. A tester is to test the trained DNN using a plurality of testing sub-images derived from a down-sampled testing image. In another example, in a recurrent deep Q-network (RDQN) having a local attention mechanism located between a convolutional neural network (CNN) and a long-short time memory (LSTM), a plurality of feature maps are generated by the CNN from an input image. Hard-attention is applied by the local attention mechanism to the generated plurality of feature maps by selecting a subset of the generated feature maps. Soft attention is applied by the local attention mechanism to the selected subset of generated feature maps by providing weights to the selected subset of generated feature maps in obtaining weighted feature maps. The weighted feature maps are stored in the LSTM. A Q value is calculated for different actions based on the weighted feature maps stored in the LSTM.
-
公开(公告)号:US11176632B2
公开(公告)日:2021-11-16
申请号:US16474540
申请日:2017-04-07
Applicant: INTEL CORPORATION
Inventor: Anbang Yao , Dongqi Cai , Libin Wang , Lin Xu , Ping Hu , Shandong Wang , Wenhua Cheng , Yiwen Guo , Liu Yang , Yuqing Hou , Zhou Su
Abstract: Described herein are advanced artificial intelligence agents for modeling physical interactions. An apparatus to provide an active artificial intelligence (AI) agent includes at least one database to store physical interaction data and compute cluster coupled to the at least one database. The compute cluster automatically obtains physical interaction data from a data collection module without manual interaction, stores the physical interaction data in the at least one database, and automatically trains diverse sets of machine learning program units to simulate physical interactions with each individual program unit having a different model based on the applied physical interaction data.
-
公开(公告)号:US11151361B2
公开(公告)日:2021-10-19
申请号:US16471106
申请日:2017-01-20
Applicant: INTEL CORPORATION
Inventor: Anbang Yao , Dongqi Cai , Ping Hu , Shandong Wang , Yurong Chen
Abstract: An apparatus for dynamic emotion recognition in unconstrained scenarios is described herein. The apparatus comprises a controller to pre-process image data and a phase-convolution mechanism to build lower levels of a CNN such that the filters form pairs in phase. The apparatus also comprises a phase-residual mechanism configured to build middle layers of the CNN via plurality of residual functions and an inception-residual mechanism to build top layers of the CNN by introducing multi-scale feature extraction. Further, the apparatus comprises a fully connected mechanism to classify extracted features.
-
公开(公告)号:US20210201078A1
公开(公告)日:2021-07-01
申请号:US16475079
申请日:2017-04-07
Applicant: INTEL CORPORATION
Inventor: Anbang Yao , Shandong Wang , Wenhua Cheng , Dongqi Cai , Libin Wang , Lin Xu , Ping Hu , Yiwen Guo , Liu Yang , Yuging Hou , Zhou Su , Yurong Chen
Abstract: Methods and systems for advanced and augmented training of deep neural networks (DNNs) using synthetic data and innovative generative networks. A method includes training a DNN using synthetic data, training a plurality of DNNs using context data, associating features of the DNNs trained using context data with features of the DNN trained with synthetic data, and generating an augmented DNN using the associated features.
-
公开(公告)号:US10818064B2
公开(公告)日:2020-10-27
申请号:US16327779
申请日:2016-09-21
Applicant: Intel Corporation
Inventor: Shandong Wang , Ming Lu , Anbang Yao , Yurong Chen
Abstract: Techniques related to estimating accurate face shape and texture from an image having a representation of a human face are discussed. Such techniques may include determining shape parameters that optimize a linear spatial cost model based on 2D landmarks, 3D landmarks, and camera and pose parameters, determining texture parameters that optimize a linear texture estimation cost model, and refining the shape parameters by optimizing a nonlinear pixel intensity cost function.
-
-
-
-
-
-
-
-
-