-
公开(公告)号:US11342003B1
公开(公告)日:2022-05-24
申请号:US16711797
申请日:2019-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Christian Garcia Siagian , Christian Ciabattoni , David Niu , Lawrence Kyuil Chang , Gordon Zheng , Ritesh Pase , Shiva Krishnamurthy , Ramakanth Mudumba
Abstract: Disclosed are various embodiments for segmenting and classifying video content using sounds. In one embodiment, a plurality of segments of a video content item are generated by analyzing audio accompanying the video content item. A subset of the plurality of segments that correspond to music segments is selected based at least in part on an audio characteristic of the subset of the plurality of segments. Individual segments of the subset of the plurality of segments are processed to determine whether a classification applies to the individual segments. A list of segments of the video content item to which the classification applies is generated.
-
公开(公告)号:US11120839B1
公开(公告)日:2021-09-14
申请号:US16711841
申请日:2019-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Christian Garcia Siagian , Christian Ciabattoni , David Niu , Lawrence Kyuil Chang , Gordon Zheng , Ritesh Pase , Shiva Krishnamurthy , Ramakanth Mudumba
Abstract: Disclosed are various embodiments for segmenting and classifying video content using conversation. In one embodiment, a plurality of segments of a video content item are generated by analyzing audio accompanying the video content item. A subset of the plurality of segments that correspond to conversation segments are selected. Individual segments of the subset of the plurality of segments are processed to determine whether a classification applies to the individual segments. A list of segments of the video content item to which the classification applies is generated.
-
公开(公告)号:US11776273B1
公开(公告)日:2023-10-03
申请号:US17107514
申请日:2020-11-30
Applicant: Amazon Technologies, Inc.
Inventor: Shixing Chen , Muhammad Raffay Hamid , Vimal Bhat , Shiva Krishnamurthy
IPC: G06V20/40 , G06N5/04 , G06N20/20 , G10L25/78 , G06F18/213
CPC classification number: G06V20/49 , G06F18/213 , G06N5/04 , G06N20/20 , G10L25/78
Abstract: Techniques for automatic scene change detection are described. As one example, a computer-implemented method includes receiving a request to train an ensemble of machine learning models on a training dataset of videos having labels that indicate scene changes to detect a scene change in a video, partitioning each video file of the training dataset of videos into a plurality of shots, training the ensemble of machine learning models into a trained ensemble of machine learning models based at least in part on the plurality of shots of the training dataset of videos and the labels that indicate scene changes, receiving an inference request for an input video, partitioning the input video into a plurality of shots, generating, by the trained ensemble of machine learning models, an inference of one or more scene changes in the input video based at least in part on the plurality of shots of the input video, and transmitting the inference to a client application or to a storage location.
-
公开(公告)号:US11582522B1
公开(公告)日:2023-02-14
申请号:US17332498
申请日:2021-05-27
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Shiva Krishnamurthy , Steven David Prinz , Craig Critchley , Arjun Cholkar , Andrew James McVeigh
IPC: H04N21/4722 , H04N21/45 , G06F16/78 , H04N21/431 , H04N21/478 , H04N21/4402
Abstract: A system can be configured to receive entertainment content requested by a user and identify content segments and content features from the entertainment content. The content segments can be utilized to identify portions of the entertainment content for enrichment and/or enhancement by the system. The content features can be utilized to associate the entertainment content and the content segments with supplemental content that includes or is associated with the content features. The content features can indicate genres, scene classifications, significant figures credited with creating the entertainment content, and other points of interests for users interested in the entertainment content. The associations between the entertainment content and the supplemental content can enable the system to engage the users by presenting the supplemental content determined to match interests of the users.
-
-
-