-
公开(公告)号:US11158344B1
公开(公告)日:2021-10-26
申请号:US14870227
申请日:2015-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Matthew Alan Townsend , Rohith Mysore Vijaya Kumar , Yadunandana Nagaraja Rao , Ambrish Tyagi , Eduard Oks , Apoorv Chaudhri
IPC: G11B27/031 , H04N9/79
Abstract: Devices, systems and methods are disclosed for improving story assembly and video summarization. For example, video clips may be received and a theme may be determined from the received video clips based on annotation data or other characteristics of the received video data. Individual moments may be extracted from the video clips, based on the selected theme and the annotation data. The moments may be ranked based on a priority metric corresponding to content determined to be desirable for purposes of video summarization. Select moments may be chosen based on the priority metric and a structure may be determined based on the selected theme. Finally, a video summarization may be generated using the selected theme and the structure, the video summarization including the select moments.
-
公开(公告)号:US09992412B1
公开(公告)日:2018-06-05
申请号:US14687451
申请日:2015-04-15
Applicant: Amazon Technologies, Inc.
Inventor: Ambrish Tyagi
CPC classification number: H04N5/23238 , G03B17/561 , H04N5/2252 , H04N5/2258 , H04N5/23296 , H04N5/247
Abstract: A camera device having verged cameras is disclosed. A camera device may include a housing and four cameras disposed in the housing. The housing may define a horizontal plane passing through the center of the housing. Each of the four cameras may be verged at an angle defined by a longitudinal center axis of the camera and the horizontal plane. Each camera may include a vertical field of view verged at the same angle. The camera device may produce a panoramic image (e.g., a panoramic still image or panoramic video) using two or more of the cameras. Systems and processes including the camera device are also disclosed.
-
公开(公告)号:US09842402B1
公开(公告)日:2017-12-12
申请号:US14976460
申请日:2015-12-21
Applicant: Amazon Technologies, Inc.
Inventor: Rohith Mysore Vijaya Kumar , Abhishek Singh , Ambrish Tyagi
CPC classification number: G06T7/0081 , G06T7/0097 , G06T7/2053 , G06T7/206 , G06T7/215 , G06T2207/20021 , G06T2207/20056 , G06T2207/20144 , G06T2207/20224 , G06T2207/30241 , H04N1/00 , H04N5/23206 , H04N5/23238 , H04N5/23254 , H04N5/3415
Abstract: Various examples are directed to systems and methods for detecting regions in video frames. For example, a computing device may receive a video comprising a plurality of frames and a video frame sequence of the plurality of frames. The computing device may select a plurality of scene point location from a first frame. The computing device may determine a plurality of columns in the first frame and fit a first sinusoidal function to a distribution of average column Y-axis displacements for the plurality of columns by column position. The computing device may determine a first difference based at least in part on the first scene point Y-axis displacement and an output of the first sinusoidal function at the X-axis position of the first scene point and determine that the first difference is greater than a threshold distance.
-
公开(公告)号:US09818451B1
公开(公告)日:2017-11-14
申请号:US14976844
申请日:2015-12-21
Applicant: Amazon Technologies, Inc.
Inventor: Ambrish Tyagi , Suresh Bholabhai Lakhani , Rohith Mysore Vijaya Kumar , Yadunandana Nagaraja Rao , Amit Kumar Agrawal
CPC classification number: G11B27/34 , G06K9/00744 , G06K9/00751 , G06K9/00758 , G06K9/00765 , G11B27/3081
Abstract: A system and method for selecting portions of video data from preview video data is provided. The system may extract image features from the preview video data and discard video frames associated with poor image quality based on the image features. The system may determine similarity scores between individual video frames and corresponding transition costs and may identify transition points in the preview video data based on the similarity scores and/or transition costs. The system may select portions of the video data for further processing based on the transition points and the image features. By selecting portions of the video data, the system may reduce a bandwidth consumption, processing burden and/or latency associated with uploading the video data or performing further processing.
-
公开(公告)号:US09462230B1
公开(公告)日:2016-10-04
申请号:US14230047
申请日:2014-03-31
Applicant: Amazon Technologies, Inc.
Inventor: Amit Kumar Agrawal , Timothy Thomas Gray , Ambrish Tyagi
Abstract: A system determines if someone watching a live video feed looks or moves away from a display screen, and when their attention is back on the display, provides an accelerated recap of the content that they missed. The video component of the feed may be shown as a series of selected still images or clips from the original feed, while audio and/or text captioning is output at an accelerated rate. The rate may be adaptively adjusted to maintain a consistent speed, and superfluous content may be omitted. When the recap catches up to the live feed, output returns to regular speed.
Abstract translation: 系统确定观看实时视频馈送的人是否看起来或远离显示屏幕,并且当他们的注意力回到显示器上时,提供他们错过的内容的加速回顾。 馈送的视频分量可以被显示为来自原始馈送的一系列所选择的静止图像或剪辑,同时以加速的速率输出音频和/或文本字幕。 可以自适应地调整速率以保持一致的速度,并且可以省略多余的内容。 当回顾达到直播饲料时,输出返回正常速度。
-
公开(公告)号:US09384384B1
公开(公告)日:2016-07-05
申请号:US14034379
申请日:2013-09-23
Applicant: Amazon Technologies, Inc.
Inventor: Ambrish Tyagi
CPC classification number: G06T11/60 , G06K9/00228
Abstract: A computing device can acquire a set of images, each image including at least a portion of a user's face. The images can be acquired using one or more cameras and/or from an image library/database associated with the user. Based on the images including the user's face (or portions thereof), a virtual representation for the user's face can be generated. The device can subsequently receive or identify an image including a facial representation (e.g., face or portion thereof) to be adjusted. The device can analyze the image including the facial representation and determine that the facial representation sufficiently matches the virtual representation. Using the virtual representation, (at least a portion of) the face can be adjusted. For example, one or more variations or details associated with the user's face, which are provided via the virtual representation, can be used to replace, improve, or otherwise modify the face in the image.
Abstract translation: 计算设备可以获取一组图像,每个图像包括用户脸部的至少一部分。 可以使用一个或多个照相机和/或与用户相关联的图像库/数据库来获取图像。 基于包括用户脸部(或其部分)的图像,可以生成用户脸部的虚拟表示。 该装置随后可以接收或识别包括要调整的面部表情(例如,面部或部分)的图像。 设备可以分析包括面部表情的图像,并确定面部表情与虚拟表示充分匹配。 使用虚拟表示,(面部的至少一部分)可以被调整。 例如,可以使用通过虚拟表示提供的与用户面部相关联的一个或多个变体或细节来替代,改进或以其他方式修改图像中的面部。
-
公开(公告)号:US11810597B2
公开(公告)日:2023-11-07
申请号:US17492781
申请日:2021-10-04
Applicant: Amazon Technologies, Inc.
Inventor: Matthew Alan Townsend , Rohith Mysore Vijaya Kumar , Yadunandana Nagaraja Rao , Ambrish Tyagi , Eduard Oks , Apoorv Chaudhri
IPC: G11B27/031 , H04N9/79
CPC classification number: G11B27/031 , H04N9/79
Abstract: Devices, systems and methods are disclosed for improving story assembly and video summarization. For example, video clips may be received and a theme may be determined from the received video clips based on annotation data or other characteristics of the received video data. Individual moments may be extracted from the video clips, based on the selected theme and the annotation data. The moments may be ranked based on a priority metric corresponding to content determined to be desirable for purposes of video summarization. Select moments may be chosen based on the priority metric and a structure may be determined based on the selected theme. Finally, a video summarization may be generated using the selected theme and the structure, the video summarization including the select moments.
-
公开(公告)号:US11631260B1
公开(公告)日:2023-04-18
申请号:US17132738
申请日:2020-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Shashank Tripathi , Visesh Chari , Ambrish Tyagi , Amit Kumar Agrawal , James Rehg , Siddhartha Chandra
Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.
-
公开(公告)号:US11450008B1
公开(公告)日:2022-09-20
申请号:US16803363
申请日:2020-02-27
Applicant: Amazon Technologies, Inc.
Inventor: Ambrish Tyagi , Siddhartha Chandra , Amit Kumar Agrawal , Viveka Kulharia
Abstract: Devices and techniques are generally described for weakly-supervised object segmentation in image data. In various examples, a first frame of image data may be received. The first frame may include a first bounding box surrounding a first set of pixels, wherein first subset of pixels of the first set of pixels represent a first object of a first class and wherein second subset of pixels of the first set of pixels represent background image data. Cross-entropy loss may be determined for the first set of pixels. In some examples, a spatial attention map may be determined for the first set of pixels. In further examples, parameters of a convolutional neural network may be determined by modulating the cross-entropy loss for the first set of pixels using the spatial attention map. The convolutional neural network may be used to generate a segmentation map.
-
公开(公告)号:US10582149B1
公开(公告)日:2020-03-03
申请号:US15435896
申请日:2017-02-17
Applicant: Amazon Technologies, Inc.
Inventor: Rohith Mysore Vijaya Kumar , Ambrish Tyagi , Yadunandana Nagaraja Rao , Suresh Bholabhai Lakhani , Amit Kumar Agrawal
Abstract: A system and method for generating preview data from video data and using the preview data to select portions of the video data or determine an order with which to upload the video data. The system may sample video data to generate sampled video data and may identify portions of the sampled video data having complexity metrics exceeding a threshold. The system may upload a first portion of the video data corresponding to the identified portions while omitting a second portion of the video data. The system may determine an order with which to upload portions of the video data based on a complexity of the video data. Therefore, portions of the video data that may require additional processing after being uploaded may be prioritized and uploaded first. As a result, a latency between the video data being uploaded and a video summarization being received is reduced.
-
-
-
-
-
-
-
-
-