-
公开(公告)号:US10909349B1
公开(公告)日:2021-02-02
申请号:US16450499
申请日:2019-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Shashank Tripathi , Visesh Chari , Ambrish Tyagi , Amit Kumar Agrawal , James Rehg , Siddhartha Chandra
Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.
-
公开(公告)号:US11631260B1
公开(公告)日:2023-04-18
申请号:US17132738
申请日:2020-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Shashank Tripathi , Visesh Chari , Ambrish Tyagi , Amit Kumar Agrawal , James Rehg , Siddhartha Chandra
Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.
-
公开(公告)号:US11526697B1
公开(公告)日:2022-12-13
申请号:US16814526
申请日:2020-03-10
Applicant: Amazon Technologies, Inc.
Inventor: Shashank Tripathi , Ambrish Tyagi , Amit Kumar Agrawal , Siddhant Ranade
Abstract: Devices and techniques are generally described for estimating three-dimensional pose data. In some examples, a first machine learning network may generate first three-dimensional (3D) data representing input 2D data. In various examples, a first 2D projection of the first 3D data may be generated. A determination may be made that the first 2D projection conforms to a distribution of natural 2D data. A second machine learning network may generate parameters of a 3D model based at least in part on the input 2D data and based at least in part on the first 3D data. In some examples, second 3D data may be generated using the parameters of the 3D model.
-
公开(公告)号:US10860836B1
公开(公告)日:2020-12-08
申请号:US16192433
申请日:2018-11-15
Applicant: Amazon Technologies, Inc.
Inventor: Ambrish Tyagi , Amit Kumar Agrawal , Siddhartha Chandra , Visesh Uday Kumar Chari , Shashank Tripathi , James Rehg
Abstract: Techniques are generally described for object detection in image data. First image data comprising a first plurality of pixel values representing an object and a second plurality of pixel values representing a background may be received. First foreground image data and first background image data may be generated from the first image data. A first feature vector representing the first plurality of pixel values may be generated. A second feature vector representing a first plurality of pixel values of second background image data may be generated. A first machine learning model may determine a first operation to perform on the first foreground image data. A transformed representation of the first foreground image data may be generated by performing the first operation on the first foreground image data. Composite image data may be generated by compositing the transformed representation of the first foreground image data with the second background image data.
-
-
-