-
公开(公告)号:US20230421855A1
公开(公告)日:2023-12-28
申请号:US18244625
申请日:2023-09-11
申请人: Google LLC
发明人: Chenjie Gu , Wei-Hong Chuang , Min-Hsuan Tsai , Jianfeng Yang , Ji Zhang , Honglu Zhou , Hassan Akbari
IPC分类号: H04N21/472 , G11B27/34
CPC分类号: H04N21/47217 , G11B27/34
摘要: Methods and systems for time marking of media items at a platform using machine-learning are provided herein. An indication of a identified media item is provided as input to a machine-learning model and one or more outputs of the machine-learning model is obtained. The one or more obtained outputs comprise time marks identifying each of the plurality of content segments of the media item. Each of the plurality of content segments is associated with a segment start indicator for a timeline of the media item. A resulting duration is determined of a combination of the plurality of content segments for which the time marks were obtained from the one or more of outputs of the machine-learning model. Responsive to determining that the resulting duration is less than the duration of the media item, one or more further inputs is provided to the machine learning model.
-
公开(公告)号:US20210166035A1
公开(公告)日:2021-06-03
申请号:US17120525
申请日:2020-12-14
申请人: Google LLC
发明人: Sanketh Shetty , Tomas Izo , Min-Hsuan Tsai , Sudheendra Vijayanarasimhan , Apostol Natsev , Sami Abu-El-Haija , George Dan Toderici , Susana Ricco , Balakrishnan Varadarajan , Nicola Muscettola , WeiHsin Gu , Weilong Yang , Nitin Khandelwal , Phuong Le
IPC分类号: G06K9/00 , G06F16/783
摘要: A computer-implemented method for selecting representative frames for videos is provided. The method includes receiving a video and identifying a set of features for each of the frames of the video. The features including frame-based features and semantic features. The semantic features identifying likelihoods of semantic concepts being present as content in the frames of the video. A set of video segments for the video is subsequently generated. Each video segment includes a chronological subset of frames from the video and each frame is associated with at least one of the semantic features. The method generates a score for each frame of the subset of frames for each video segment based at least on the semantic features, and selecting a representative frame for each video segment based on the scores of the frames in the video segment. The representative frame represents and summarizes the video segment.
-
公开(公告)号:US11758233B1
公开(公告)日:2023-09-12
申请号:US17835547
申请日:2022-06-08
申请人: Google LLC
发明人: Chenjie Gu , Wei-Hong Chuang , Min-Hsuan Tsai , Jianfeng Yang , Ji Zhang , Honglu Zhou , Hassan Akbari
IPC分类号: H04N21/472 , G11B27/34
CPC分类号: H04N21/47217 , G11B27/34
摘要: Methods and systems for time marking of media items at a platform using machine-learning are provided herein. A media item is provided to users of a platform. An indication of the identified media item is provided as input to a machine-learning model that is trained using different feature types of historical media items to predict a plurality of content segments of a given media item each depicting, to the one or more users, a distinct section of the media item. One or more outputs of the machine-learning model are obtained comprising time marks identifying each of the plurality of content segments of the media item. Each of the plurality of content segments are associated with a segment start indicator for a timeline of the media item. The media item and an indication of each segment start indicator is provided for presentation to at least one user.
-
公开(公告)号:US12014542B2
公开(公告)日:2024-06-18
申请号:US17120525
申请日:2020-12-14
申请人: Google LLC
发明人: Sanketh Shetty , Tomas Izo , Min-Hsuan Tsai , Sudheendra Vijayanarasimhan , Apostol Natsev , Sami Abu-El-Haija , George Dan Toderici , Susana Ricco , Balakrishnan Varadarajan , Nicola Muscettola , WeiHsin Gu , Weilong Yang , Nitin Khandelwal , Phuong Le
IPC分类号: G06K9/00 , G06F16/783 , G06V20/40
CPC分类号: G06V20/41 , G06F16/7834 , G06V20/46 , G06V20/47 , G06V20/49 , G06V2201/10
摘要: A computer-implemented method for selecting representative frames for videos is provided. The method includes receiving a video and identifying a set of features for each of the frames of the video. The features including frame-based features and semantic features. The semantic features identifying likelihoods of semantic concepts being present as content in the frames of the video. A set of video segments for the video is subsequently generated. Each video segment includes a chronological subset of frames from the video and each frame is associated with at least one of the semantic features. The method generates a score for each frame of the subset of frames for each video segment based at least on the semantic features, and selecting a representative frame for each video segment based on the scores of the frames in the video segment. The representative frame represents and summarizes the video segment.
-
公开(公告)号:US20230402065A1
公开(公告)日:2023-12-14
申请号:US17835550
申请日:2022-06-08
申请人: Google LLC
发明人: Chenjie Gu , Wei-Hong Chuang , Min-Hsuan Tsai , Jianfeng Yang , Keren Gu-Lemberg , Flora Xue , Shubham Agrawal , Yuzhu Dong , Ji Zhang , Mahdis Mahdieh , Gagan Bansal , Kai Chen
IPC分类号: G11B27/10 , G11B27/34 , G06F3/0481 , G06F3/04842
CPC分类号: G11B27/102 , G11B27/34 , G06F3/0481 , G06F3/04842
摘要: Methods and systems for predicting titles for contents segments of media items at a platform using machine-learning are provided herein. A media item is provided to users of a platform, the media item having a plurality of content segments comprising a first content segment and a second content segment preceding the first content segment in the media item. The first content segment and a title of the second content segment are provided as input to a machine-learning model trained to predict a title for the first content segment that is consistent with the title of the second content segment. One or more outputs of the machine-learning model are obtained which indicate the title for the first content segment. An indication of each content segment and a respective title of each content segment are provided for presentation to at least one user of the one or more users.
-
公开(公告)号:US20180239964A1
公开(公告)日:2018-08-23
申请号:US15959858
申请日:2018-04-23
申请人: Google LLC
发明人: Sanketh Shetty , Tomas Izo , Min-Hsuan Tsai , Sudheendra Vijayanarasimhan , Apostol Natsev , Sami Abu-El-Haija , George Dan Toderici , Susanna Ricco , Balakrishnan Varadarajan , Nicola Muscettola , WeiHsin Gu , Weilong Yang , Nitin Khandelwal , Phuong Le
CPC分类号: G06K9/00718 , G06F16/7834 , G06K9/00744 , G06K9/00751 , G06K9/00765 , G06K2209/27
摘要: A computer-implemented method for selecting representative frames for videos is provided. The method includes receiving a video and identifying a set of features for each of the frames of the video. The features including frame-based features and semantic features. The semantic features identifying likelihoods of semantic concepts being present as content in the frames of the video. A set of video segments for the video is subsequently generated. Each video segment includes a chronological subset of frames from the video and each frame is associated with at least one of the semantic features. The method generates a score for each frame of the subset of frames for each video segment based at least on the semantic features, and selecting a representative frame for each video segment based on the scores of the frames in the video segment. The representative frame represents and summarizes the video segment.
-
公开(公告)号:US09953222B2
公开(公告)日:2018-04-24
申请号:US14848216
申请日:2015-09-08
申请人: Google LLC
发明人: Sanketh Shetty , Tomas Izo , Min-Hsuan Tsai , Sudheendra Vijayanarasimhan , Apostol Natsev , Sami Abu-El-Haija , George Dan Toderici , Susanna Ricco , Balakrishnan Varadarajan , Nicola Muscettola , WeiHsin Gu , Weilong Yang , Nitin Khandelwal , Phuong Le
CPC分类号: G06K9/00718 , G06F17/30787 , G06K9/00744 , G06K9/00751 , G06K9/00765 , G06K2209/27
摘要: A computer-implemented method for selecting representative frames for videos is provided. The method includes receiving a video and identifying a set of features for each of the frames of the video. The features including frame-based features and semantic features. The semantic features identifying likelihoods of semantic concepts being present as content in the frames of the video. A set of video segments for the video is subsequently generated. Each video segment includes a chronological subset of frames from the video and each frame is associated with at least one of the semantic features. The method generates a score for each frame of the subset of frames for each video segment based at least on the semantic features, and selecting a representative frame for each video segment based on the scores of the frames in the video segment. The representative frame represents and summarizes the video segment.
-
-
-
-
-
-