-
公开(公告)号:US20240127408A1
公开(公告)日:2024-04-18
申请号:US18514252
申请日:2023-11-20
Applicant: Intel Corporation
Inventor: Anbang Yao , Ming Lu , Yikai Wang , Xiaoming Chen , Junjie Huang , Tao Lv , Yuanke Luo , Yi Yang , Feng Chen , Zhiming Wang , Zhiqiao Zheng , Shandong Wang
CPC classification number: G06T5/002 , G06N3/04 , G06T2207/20081 , G06T2207/20084
Abstract: Embodiments are generally directed to an adaptive deformable kernel prediction network for image de-noising. An embodiment of a method for de-noising an image by a convolutional neural network implemented on a compute engine, the image including a plurality of pixels, the method comprising: for each of the plurality of pixels of the image, generating a convolutional kernel having a plurality of kernel values for the pixel; generating a plurality of offsets for the pixel respectively corresponding to the plurality of kernel values, each of the plurality of offsets to indicate a deviation from a pixel position of the pixel; determining a plurality of deviated pixel positions based on the pixel position of the pixel and the plurality of offsets; and filtering the pixel with the convolutional kernel and pixel values of the plurality of deviated pixel positions to obtain a de-noised pixel.
-
公开(公告)号:US20210142448A1
公开(公告)日:2021-05-13
申请号:US17090170
申请日:2020-11-05
Applicant: Intel Corporation
Inventor: Anbang Yao , Ming Lu , Yikai Wang , Xiaoming Chen , Junjie Huang , Tao Lv , Yuanke Luo , Yi Yang , Feng Chen , Zhiming Wang , Zhiqiao Zheng , Shandong Wang
Abstract: Embodiments are generally directed to an adaptive deformable kernel prediction network for image de-noising. An embodiment of a method for de-noising an image by a convolutional neural network implemented on a compute engine, the image including a plurality of pixels, the method comprising: for each of the plurality of pixels of the image, generating a convolutional kernel having a plurality of kernel values for the pixel; generating a plurality of offsets for the pixel respectively corresponding to the plurality of kernel values, each of the plurality of offsets to indicate a deviation from a pixel position of the pixel; determining a plurality of deviated pixel positions based on the pixel position of the pixel and the plurality of offsets; and filtering the pixel with the convolutional kernel and pixel values of the plurality of deviated pixel positions to obtain a de-noised pixel.
-
公开(公告)号:US20190228556A1
公开(公告)日:2019-07-25
申请号:US16327779
申请日:2016-09-21
Applicant: Intel Corporation
Inventor: Shandong Wang , Ming Lu , Anbang Yao , Yurong Chen
Abstract: Techniques related to estimating accurate face shape and texture from an image having a representation of a human face are discussed. Such techniques may include determining shape parameters that optimize a linear spatial cost model based on 2D landmarks, 3D landmarks, and camera and pose parameters, determining texture parameters that optimize a linear texture estimation cost model, and refining the shape parameters by optimizing a nonlinear pixel intensity cost function.
-
公开(公告)号:US12079914B2
公开(公告)日:2024-09-03
申请号:US17914314
申请日:2020-04-23
Applicant: Intel Corporation
Inventor: Shandong Wang , Yangyuxuan Kang , Anbang Yao , Ming Lu , Yurong Chen
CPC classification number: G06T13/40 , G06T7/70 , G06T7/251 , G06T17/00 , G06T2207/10024 , G06T2207/20081 , G06T2207/20084 , G06T2207/30196
Abstract: Techniques are disclosed for providing improved pose tracking of a subject using a 2D camera and generating a 3D image that recreates the pose of the subject. A 3D skeleton map is estimated from a 2D skeleton map of the subject using, for example, a neural network. A template 3D skeleton map is accessed or generated having bone segments that have lengths set using, for instance, anthropometry statistics based on a given height of the template 3D skeleton map. An improved 3D skeleton map is then produced by at least retargeting one or more of the plurality of bone segments of the estimated 3D skeleton map to more closely match the corresponding template bone segments of the template 3D skeleton map. The improved 3D skeleton map can then be animated in various ways (e.g., using various skins or graphics) to track corresponding movements of the subject.
-
公开(公告)号:US11790223B2
公开(公告)日:2023-10-17
申请号:US16475076
申请日:2017-04-07
Applicant: INTEL CORPORATION
Inventor: Libin Wang , Yiwen Guo , Anbang Yao , Dongqi Cai , Lin Xu , Ping Hu , Shandong Wang , Wenhua Cheng , Yurong Chen
CPC classification number: G06N3/08 , G06F18/217 , G06F18/2148 , G06N3/045 , G06N3/063 , G06T1/20
Abstract: Methods and systems are disclosed for boosting deep neural networks for deep learning. In one example, in a deep neural network including a first shallow network and a second shallow network, a first training sample is processed by the first shallow network using equal weights. A loss for the first shallow network is determined based on the processed training sample using equal weights. Weights for the second shallow network are adjusted based on the determined loss for the first shallow network. A second training sample is processed by the second shallow network using the adjusted weights. In another example, in a deep neural network including a first weak network and a second weak network, a first subset of training samples is processed by the first weak network using initialized weights. A classification error for the first weak network on the first subset of training samples is determined. The second weak network is boosted using the determined classification error of the first weak network with adjusted weights. A second subset of training samples is processed by the second weak network using the adjusted weights.
-
公开(公告)号:US11551335B2
公开(公告)日:2023-01-10
申请号:US16474848
申请日:2017-04-07
Applicant: Intel Corporation
Inventor: Lin Xu , Liu Yang , Anbang Yao , Dongqi Cai , Libin Wang , Ping Hu , Shandong Wang , Wenhua Cheng , Yiwen Guo , Yurong Chen
Abstract: Methods and systems are disclosed using camera devices for deep channel and Convolutional Neural Network (CNN) images and formats. In one example, image values are captured by a color sensor array in an image capturing device or camera. The image values provide color channel data. The captured image values by the color sensor array are input to a CNN having at least one CNN layer. The CNN provides CNN channel data for each layer. The color channel data and CNN channel data is to form a deep channel image that stored in a memory. In another example, image values are captured by sensor array. The captured image values by the sensor array are input a CNN having a first CNN layer. An output is generated at the first CNN layer using the captured image values by the color sensor array. The output of the first CNN layer is stored as a feature map of the captured image.
-
公开(公告)号:US11341368B2
公开(公告)日:2022-05-24
申请号:US16475079
申请日:2017-04-07
Applicant: INTEL CORPORATION
Inventor: Anbang Yao , Shandong Wang , Wenhua Cheng , Dongqi Cai , Libin Wang , Lin Xu , Ping Hu , Yiwen Guo , Liu Yang , Yuqing Hou , Zhou Su , Yurong Chen
Abstract: Methods and systems for advanced and augmented training of deep neural networks (DNNs) using synthetic data and innovative generative networks. A method includes training a DNN using synthetic data, training a plurality of DNNs using context data, associating features of the DNNs trained using context data with features of the DNN trained with synthetic data, and generating an augmented DNN using the associated features.
-
公开(公告)号:US11107189B2
公开(公告)日:2021-08-31
申请号:US16474927
申请日:2017-04-07
Applicant: INTEL CORPORATION
Inventor: Shandong Wang , Yiwen Guo , Anbang Yao , Dongqi Cai , Libin Wang , Lin Xu , Ping Hu , Wenhua Cheng , Yurong Chen
IPC: G06K9/00 , G06T3/40 , G06N20/20 , G06N20/10 , G06K9/62 , G06N3/04 , G06N3/08 , G06N5/04 , G06T1/20
Abstract: Methods and systems are disclosed using improved Convolutional Neural Networks (CNN) for image processing. In one example, an input image is down-sampled into smaller images with a smaller resolution than the input image. The down-sampled smaller images are processed by a CNN having a last layer with a reduced number of nodes than a last layer of a full CNN used to process the input image at a full resolution. A result is outputted based on the processed down-sampled smaller images by the CNN having a last layer with a reduced number of nodes. In another example, shallow CNN networks are built randomly. The randomly built shallow CNN networks are combined to imitate a trained deep neural network (DNN).
-
公开(公告)号:US20200279156A1
公开(公告)日:2020-09-03
申请号:US16645425
申请日:2017-10-09
Applicant: INTEL CORPORATION
Inventor: Dongqi Cai , Anbang Yao , Ping Hu , Shandong Wang , Yurong Chen
Abstract: A system to perform multi-modal analysis has at least three distinct characteristics: an early abstraction layer for each data modality integrating homogeneous feature cues coming from different deep learning architectures for that data modality, a late abstraction layer for further integrating heterogeneous features extracted from different models or data modalities and output from the early abstraction layer, and a propagation-down strategy for joint network training in an end-to-end manner. The system is thus able to consider correlations among homogeneous features and correlations among heterogenous features at different levels of abstraction. The system further extracts and fuses discriminative information contained in these models and modalities for high performance emotion recognition.
-
公开(公告)号:US20230154092A1
公开(公告)日:2023-05-18
申请号:US17914314
申请日:2020-04-23
Applicant: Intel Corporation
Inventor: Shandong Wang , Yangyuxuan Kang , Anbang Yao , Ming Lu , Yurong Chen
CPC classification number: G06T13/40 , G06T7/70 , G06T2207/30196 , G06T2207/10024 , G06T2207/20084
Abstract: Techniques are disclosed for providing improved pose tracking of a subject using a 2D camera and generating a 3D image that recreates the pose of the subject. A 3D skeleton map is estimated from a 2D skeleton map of the subject using, for example, a neural network. A template 3D skeleton map is accessed or generated having bone segments that have lengths set using, for instance, anthropometry statistics based on a given height of the template 3D skeleton map. An improved 3D skeleton map is then produced by at least retargeting one or more of the plurality of bone segments of the estimated 3D skeleton map to more closely match the corresponding template bone segments of the template 3D skeleton map. The improved 3D skeleton map can then be animated in various ways (e.g., using various skins or graphics) to track corresponding movements of the subject.
-
-
-
-
-
-
-
-
-