-
1.
公开(公告)号:US11829413B1
公开(公告)日:2023-11-28
申请号:US17030103
申请日:2020-09-23
Applicant: Amazon Technologies, Inc.
Inventor: Xiang Hao , Jingxiang Chen , Vernon Germano , Muhammad Raffay Hamid , Lakshay Sharma
IPC: G06F16/783 , G06N20/00 , G06F16/75
CPC classification number: G06F16/7847 , G06F16/75 , G06N20/00
Abstract: Techniques for temporal localization of mature content in long-form videos using only video-level labels are described. According to some embodiments, computer-implemented method includes receiving a request to train a machine learning model on a training video file comprising at least one mature content label, training the machine learning model to generate a feature vector for each of a plurality of video frames of the training video file, generate a plurality of frame-level mature content classification scores of the training video file from the feature vectors of the training video file, and generate a video-level mature content classification score of the training video file from the plurality of frame-level mature content classification scores for the training video file based at least in part on the at least one mature content label of the training video file, receiving a request for an input video file, generating, by the machine learning model in response to the request, a feature vector for each of a plurality of video frames of the input video file, a plurality of frame-level mature content classification scores of the input video file from the feature vectors of the input video file, and a video-level mature content classification score of the input video file from the plurality of frame-level mature content classification scores for the input video file, and transmitting the plurality of frame-level mature content classification scores of the input video file or the video-level mature content classification score of the input video file to a client application or to a storage location.
-
公开(公告)号:US12067779B1
公开(公告)日:2024-08-20
申请号:US17668014
申请日:2022-02-09
Applicant: Amazon Technologies, Inc.
Inventor: Shixing Chen , Xiang Hao , Xiaohan Nie , Muhammad Raffay Hamid
IPC: G06V20/40 , G06V10/774
CPC classification number: G06V20/48 , G06V10/774 , G06V20/46
Abstract: A plurality of similar video pairs may be determined based on one or more similarity information types. Each video pair of the plurality of similar video pairs may include a first respective video and a second respective video. For each video pair, one or more similar scene pairs may be determined. Each of the one or more similar scene pairs may include a respective first scene from the first respective video and a second respective scene from the second respective video. An encoder may be trained using a contrastive learning model that contrasts a plurality of similar scene pairs with a plurality of random scenes. The plurality of similar scene pairs may include the one or more scene pairs for each video pair. One or more scene features of one or more other scenes of one or more other videos may be determined using the encoder.
-
公开(公告)号:US10467507B1
公开(公告)日:2019-11-05
申请号:US15491804
申请日:2017-04-19
Applicant: Amazon Technologies, Inc.
Abstract: An image quality assessment solution analyzes an image quality and a correlation of an image to an item description associated with the item. The content quality assessment may assign a quality score to the image based on a composition of the image and/or correlation with the image description. The score may be based on a model that is trained to analyze images using a learning model. Based on the image score, a correlation score, or other scores, the user may be given feedback on how to improve an image. A service provider providing this service may use the score to influence recommendation results that use the images.
-
公开(公告)号:US10897649B1
公开(公告)日:2021-01-19
申请号:US16142584
申请日:2018-09-26
Applicant: Amazon Technologies, Inc.
Inventor: Vernon Germano , Kripa Kanchana Sivakumar , Xiang Hao , Emily Evon McAninly
IPC: H04N21/25 , H04N21/466 , H04N21/475 , G06F16/71
Abstract: Methods and apparatus are described that relate to the use of machine learning techniques to accurately predict ratings for various types of content across multiple mature themes.
-
公开(公告)号:US12047645B1
公开(公告)日:2024-07-23
申请号:US17698798
申请日:2022-03-18
Applicant: Amazon Technologies, Inc.
Inventor: Xiang Hao , Ahmed Aly Saad Ahmed , Diana Nassar , Mohamed Kamal Omar , Steven James Cox , Saida Lehiany
IPC: H04N21/466 , G06V20/40 , H04N21/258
CPC classification number: H04N21/4665 , G06V20/47 , H04N21/25883 , G06V2201/10
Abstract: A system can be utilized to retrieve media content and rating schemas, to determine maturity ratings for media content. The media content can be utilized to determine segments of data as building blocks associated with mature content. The building blocks can be mapped to content descriptors and rating levels associated with the rating schemas. The building blocks can be compared the media content to identify portions of the media content that have characteristics represented by the building blocks. The building blocks representing the characteristics in the portions of the media content can be utilized to select content descriptors and rating levels associated with the media content. The selected content descriptor and selected rating levels can be utilized to control how, and/or whether, the media content is made available for output to the consumers.
-
公开(公告)号:US11829717B1
公开(公告)日:2023-11-28
申请号:US16998794
申请日:2020-08-20
Applicant: Amazon Technologies, Inc.
Inventor: Jingxiang Chen , Vernon Germano , Xiang Hao
IPC: G06F40/253 , G06N20/00 , G06V20/40
CPC classification number: G06F40/253 , G06N20/00 , G06V20/41
Abstract: Devices, systems, and methods are provided for context-based abusive language detection and responses. A method may include identifying text associated with first video content, and determining that a first word in the text matches a first keyword indicative of abusive language. The method may include determining a first label associated with the first word, the first label indicating that the first word is ambiguous. The method may include identifying a first sentence of the text, the first sentence including the first word. The method may include determining first and second context of the first word and the first sentence. The method may include determining, based on the first and second context, using a machine learning model, a second label associated with the first sentence, the second label indicating a probability that the first sentence includes abusive language. The method may include generating second video content for presentation.
-
公开(公告)号:US11617008B1
公开(公告)日:2023-03-28
申请号:US17218009
申请日:2021-03-30
Applicant: Amazon Technologies, Inc.
Inventor: Tarun Gupta , Mayank Sharma , Xiang Hao , Muhammad Raffay Hamid , Zhitao Qiu
IPC: H04N21/439 , G06N20/00 , H04N21/466 , H04N21/475
Abstract: Methods, systems, and computer-readable media for media classification using local and global audio features are disclosed. A media classification system determines local features of an audio input using an audio event detector model that is trained to detect a plurality of audio event classes descriptive of objectionable content. The local features are extracted using maximum values of audio event scores for individual audio event classes. The media classification system determines one or more global features of the audio input using the audio event detector model. The global feature(s) are extracted using averaging of clip-level descriptors of a plurality of clips of the audio input. The media classification system determines a content-based rating for media comprising the audio input based (at least in part) on the local features of the audio input and based (at least in part) on the global feature(s) of the audio input.
-
公开(公告)号:US11153655B1
公开(公告)日:2021-10-19
申请号:US16142487
申请日:2018-09-26
Applicant: Amazon Technologies, Inc.
Inventor: Vernon Germano , Kripa Kanchana Sivakumar , Xiang Hao
IPC: H04N21/25 , H04N21/258 , H04N21/45 , H04N21/466 , H04N21/475 , H04N21/482 , G06N5/04 , H04N21/488 , G06N20/00
Abstract: Methods and apparatus are relating to the use of machine learning techniques to identify unrated content that will be appealing to particular demographics. Using explicit feedback (e.g., star ratings) and implicit feedback (e.g., viewing behavior) for a given demographic, feature sets are extracted from the rated titles and then used to recognize similar titles in unrated content.
-
-
-
-
-
-
-