-
公开(公告)号:US11429842B2
公开(公告)日:2022-08-30
申请号:US16396513
申请日:2019-04-26
Applicant: Microsoft Technology Licensing, LLC
Inventor: Lijuan Wang , Kevin Lin , Zicheng Liu , Kun Luo
Abstract: A computing system is provided. The computing system includes a processor configured to execute a convolutional neural network that has been trained, the convolutional neural network including a backbone network that is a concatenated pyramid network, a plurality of first head neural networks, and a plurality of second head neural networks. At the backbone network, the processor is configured to receive an input image as input and output feature maps extracted from the input image. The processor is configured to: process the feature maps using each of the first head neural networks to output corresponding keypoint heatmaps; process the feature maps using each of the second head neural networks to output corresponding part affinity field heatmaps; link the keypoints into one or more instances of virtual skeletons using the part affinity fields; and output the instances of the virtual skeletons.
-
公开(公告)号:US10957074B2
公开(公告)日:2021-03-23
申请号:US16261297
申请日:2019-01-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Hongli Deng , Zicheng Liu
Abstract: Examples are disclosed herein that relate to automatically calibrating cameras based on human detection. One example provides a computing system comprising instructions executable to receive image data comprising depth image data and two-dimensional image data of a space from a camera, detect a person in the space via the image data, determine a skeletal representation for the person via the image data, determine over a period of time a plurality of locations at which a reference point of the skeletal representation is on a ground area in the image data, determine a ground plane of the three-dimensional representation based upon the plurality of locations at which the reference point of the skeletal representation is on the ground area in the image data, and track a location of an object within the space relative to the ground plane.
-
公开(公告)号:US20200272812A1
公开(公告)日:2020-08-27
申请号:US16281876
申请日:2019-02-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Lijuan Wang , Zicheng Liu , Kevin Lin , Kun Luo
Abstract: A machine accesses a training data set comprising multiple real images and multiple synthetic images. The machine trains a joint prediction module to predict joint locations in visual data using the multiple real images. The machine trains a part affinity field prediction module to identify adjacent joints in visual data using the multiple real images. The machine trains the joint prediction module to predict joint locations in visual data using the multiple synthetic images. The machine trains the part affinity field prediction module to identify adjacent joints in visual data using the multiple synthetic images. The machine trains a body part prediction module to identify body parts in visual data using the multiple synthetic images. The machine provides a trained human body part segmentation module comprising the trained joint prediction module, the trained part affinity field prediction module, and the trained body part prediction module.
-
公开(公告)号:US10425627B2
公开(公告)日:2019-09-24
申请号:US15590501
申请日:2017-05-09
Applicant: Microsoft Technology Licensing, LLC
Inventor: Zhengyou Zhang , Zicheng Liu , Dieni Afonso Ferreira Florencio , Sasa Junuzovic
IPC: H04N9/31 , H04N17/04 , H04N17/00 , G09G3/00 , H04N21/4223 , H04N21/442 , G09G3/20 , H04N7/15 , H04N1/60 , H04N21/414
Abstract: An apparatus that automatically monitors a display device includes a photo sensor configured to receive light from a display screen of the display device. The photo sensor provides signals representing detected light levels to a processor. The processor is coupled to the display device and is configured to cause the display device to present a test sequence including a plurality of images on the display screen. The processor is configured to capture data from the photo sensor during the presentation of the test sequence and to compare the captured data to an expected sequence corresponding to the test sequence displayed by a well-functioning display. The processor is further configured to report any mismatch between the captured data and the expected sequence as a possible malfunction of the display device.
-
公开(公告)号:US20180239985A1
公开(公告)日:2018-08-23
申请号:US15958820
申请日:2018-04-20
Applicant: Microsoft Technology Licensing, LLC
Inventor: Chun-Te Chu , Jaeyeon Jung , Zicheng Liu , Ratul Mahajan
CPC classification number: G06K9/6201 , G06F17/3028 , G06K9/00711 , G06K9/52 , G06T7/292 , G06T2207/10016 , G06T2207/10024 , G06T2207/20036 , G06T2207/20081 , G06T2207/20224 , G06T2207/30232 , H04N7/181
Abstract: A technique for multi-camera object tracking is disclosed that preserves privacy of imagery from each camera or group of cameras. This technique uses secure multi-party computation to compute a distance metric across data from multiple cameras without revealing any information to operators of the cameras except whether or not an object was observed by both cameras. This is achieved by a distance metric learning technique that reduces the computing complexity of secure computation while maintaining object identification accuracy.
-
公开(公告)号:US09977991B2
公开(公告)日:2018-05-22
申请号:US14693791
申请日:2015-04-22
Applicant: Microsoft Technology Licensing, LLC
Inventor: Chun-Te Chu , Jaeyeon Jung , Zicheng Liu , Ratul Mahajan
CPC classification number: G06K9/6201 , G06F17/3028 , G06K9/00711 , G06K9/52 , G06T7/292 , G06T2207/10016 , G06T2207/10024 , G06T2207/20036 , G06T2207/20081 , G06T2207/20224 , G06T2207/30232 , H04N7/181
Abstract: A technique for multi-camera object tracking is disclosed that preserves privacy of imagery from each camera or group of cameras. This technique uses secure multi-party computation to compute a distance metric across data from multiple cameras without revealing any information to operators of the cameras except whether or not an object was observed by both cameras. This is achieved by a distance metric learning technique that reduces the computing complexity of secure computation while maintaining object identification accuracy.
-
公开(公告)号:US12106531B2
公开(公告)日:2024-10-01
申请号:US17383362
申请日:2021-07-22
Applicant: Microsoft Technology Licensing, LLC
Inventor: Lijuan Wang , Zicheng Liu , Ying Jin , Hongli Deng , Kun Luo , Pei Yu , Yinpeng Chen
CPC classification number: G06V10/22 , G06T7/70 , G06V40/10 , G06T2207/30196
Abstract: To improve the accuracy and efficiency of object detection through computer digital image analysis, the detection of some objects can inform the sub-portion of the digital image to which subsequent computer digital image analysis is directed to detect other objects. In such a manner object detection can be made more efficient by limiting the image area of a digital image that is analyzed. Such efficiencies can represent both computational efficiencies and communicational efficiencies arising due to the smaller quantity of digital image data that is analyzed. Additionally, the detection of some objects can render the detection of other objects more accurate by adjusting confidence thresholds based on the detection of those related objects. Relationships between objects can be utilized to inform both the image area on which subsequent object detection is performed and the confidence level of such subsequent object detection.
-
公开(公告)号:US11488325B2
公开(公告)日:2022-11-01
申请号:US16904498
申请日:2020-06-17
Applicant: Microsoft Technology Licensing, LLC
Inventor: Hongli Deng , Ryan Savio Menezes , Gabriel Blanco Saldana , Zicheng Liu
Abstract: Techniques for improved camera calibration are disclosed. An image is analyzed to identify a first set of key points for an object. A virtual object is generated. The virtual object has a second set of key points. A reprojected version of the second set is fitted to the first set in 2D space until a fitting threshold is satisfied. To do so, a 3D alignment of the second set is generated in an attempt to fit (e.g., in 2D space) the second set to the first set. Another operation includes reprojecting the second set into 2D space. In response to comparing the reprojected second set to the first set, another operation includes determining whether a fitting error between those sets satisfies the fitting threshold. A specific 3D alignment of the second set is selected. The camera is calibrated based on resulting reprojection parameters.
-
公开(公告)号:US11238300B2
公开(公告)日:2022-02-01
申请号:US16688956
申请日:2019-11-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Nikolaos Karianakis , Zicheng Liu , Yinpeng Chen
Abstract: An object re-identifier. For each of a plurality of frames of a video, a quality of the frame is assessed and a confidence that a previously-recognized object is present in the frame is determined. The determined confidence for the frame is weighted based on the assessed quality of the frame such that frames with higher relative quality are weighted more heavily than frames with lower relative quality. An overall confidence that the previously-recognized object is present in the video is assessed based on the weighted determined confidences.
-
公开(公告)号:US09864430B2
公开(公告)日:2018-01-09
申请号:US14593955
申请日:2015-01-09
Applicant: Microsoft Technology Licensing, LLC
Inventor: Dijia Wu , Michael J. Conrad , Tim Burrell , Xu Miao , Zicheng Liu , Qin Cai , Zhengyou Zhang
CPC classification number: G06F3/013 , G06F3/0304 , G06F3/038 , G06K9/0061
Abstract: Examples are disclosed herein that are related to gaze tracking via image data. One example provides, on a gaze tracking system comprising an image sensor, a method of determining a gaze direction, the method comprising acquiring image data via the image sensor, detecting in the image data facial features of a human subject, determining an eye rotation center based upon the facial features using a calibrated face model, determining an estimated position of a center of a lens of an eye from the image data, determining an optical axis based upon the eye rotation center and the estimated position of the center of the lens, determining a visual axis by applying an adjustment to the optical axis, determining the gaze direction based upon the visual axis, and providing an output based upon the gaze direction.
-
-
-
-
-
-
-
-
-