-
公开(公告)号:US10528821B2
公开(公告)日:2020-01-07
申请号:US15689193
申请日:2017-08-29
Applicant: Amazon Technologies, Inc.
Inventor: Adam Carlson , Douglas Ryan Gray , Ashutosh Vishwas Kulkarni , Colin Jon Taylor
Abstract: A video segmentation system can be utilized to automate segmentation of digital video content. Features corresponding to visual, audio, and/or textual content of the video can be extracted from frames of the video. The extracted features of adjacent frames are compared according to a similarity measure to determine boundaries of a first set of shots or video segments distinguished by abrupt transitions. The first set of shots is analyzed according to certain heuristics to recognize a second set of shots distinguished by gradual transitions. Key frames can be extracted from the first and second set of shots, and the key frames can be used by the video segmentation system to group the first and second set of shots by scene. Additional processing can be performed to associate metadata, such as names of actors or titles of songs, with the detected scenes.
-
公开(公告)号:US09626084B2
公开(公告)日:2017-04-18
申请号:US14283554
申请日:2014-05-21
Applicant: Amazon Technologies, Inc.
Inventor: Charles Benjamin Franklin Waggoner , Colin Jon Taylor , Jeffrey P. Bezos , Douglas Ryan Gray
IPC: G06F3/0484 , G06F3/0488
CPC classification number: G06F3/04842 , G06F3/013 , G06F3/017 , G06F3/04845 , G06F3/0488 , G06F3/04883 , G06F3/167 , G06F2203/0381 , G06F2203/04806 , H04N5/2628 , H04N21/440263 , H04N21/47205 , H04N21/4728
Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.
-
公开(公告)号:US12210516B1
公开(公告)日:2025-01-28
申请号:US17854791
申请日:2022-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Xinliang Zhu , Arnab Dhua , Son D. Tran , Douglas Ryan Gray
IPC: G06F16/242 , G06F16/2457
Abstract: Combined feature vectors may be generated to map features of two or more search queries to a common embedding space. A user may provide an initial input query and then provide a refinement query. Independent feature vectors may be generated for each of the initial input query and the refinement query, may be weighted, and then may be combined to form a combined feature vector. The combined feature vector aligns different search modalities within the common embedding space that may be executed against an index.
-
公开(公告)号:US20170083770A1
公开(公告)日:2017-03-23
申请号:US15255978
申请日:2016-09-02
Applicant: Amazon Technologies, Inc.
Inventor: Adam Carlson , Douglas Ryan Gray , Ashutosh Vishwas Kulkarni , Colin Jon Taylor
CPC classification number: G06K9/00765 , G06K9/00147 , G06K9/00744 , G06K9/46 , G06K9/469 , G06K9/6212 , G06K9/6224 , G06T2207/10016
Abstract: A video segmentation system can be utilized to automate segmentation of digital video content. Features corresponding to visual, audio, and/or textual content of the video can be extracted from frames of the video. The extracted features of adjacent frames are compared according to a similarity measure to determine boundaries of a first set of shots or video segments distinguished by abrupt transitions. The first set of shots is analyzed according to certain heuristics to recognize a second set of shots distinguished by gradual transitions. Key frames can be extracted from the first and second set of shots, and the key frames can be used by the video segmentation system to group the first and second set of shots by scene. Additional processing can be performed to associate metadata, such as names of actors or titles of songs, with the detected scenes.
-
公开(公告)号:US20180082127A1
公开(公告)日:2018-03-22
申请号:US15689193
申请日:2017-08-29
Applicant: Amazon Technologies, Inc.
Inventor: Adam Carlson , Douglas Ryan Gray , Ashutosh Vishwas Kulkarni , Colin Jon Taylor
CPC classification number: G06K9/00765 , G06K9/00147 , G06K9/00744 , G06K9/46 , G06K9/469 , G06K9/6212 , G06K9/6224 , G06T2207/10016
Abstract: A video segmentation system can be utilized to automate segmentation of digital video content. Features corresponding to visual, audio, and/or textual content of the video can be extracted from frames of the video. The extracted features of adjacent frames are compared according to a similarity measure to determine boundaries of a first set of shots or video segments distinguished by abrupt transitions. The first set of shots is analyzed according to certain heuristics to recognize a second set of shots distinguished by gradual transitions. Key frames can be extracted from the first and second set of shots, and the key frames can be used by the video segmentation system to group the first and second set of shots by scene. Additional processing can be performed to associate metadata, such as names of actors or titles of songs, with the detected scenes.
-
公开(公告)号:US09564177B1
公开(公告)日:2017-02-07
申请号:US14667645
申请日:2015-03-24
Applicant: Amazon Technologies, Inc.
Inventor: Douglas Ryan Gray , Adam Carlson , Ashutosh Vishwas Kulkarni , Anna Makris , Colin Jon Taylor
CPC classification number: G11B27/3081 , G11B27/007 , G11B27/105 , H04N5/783 , H04N5/85 , H04N9/8205
Abstract: Automatic replay or skip ahead functionality can be configured to intelligently navigate to a portion of a video a user desires to view. The context at which a user selects intelligent navigation can be analyzed to determine where to initiate automatic replay or skip ahead. The context for intelligent navigation can be based on scene or shot segmentation data, closed captioning, aggregate video navigation data from a community of users of shared demographic traits and/or interest, and/or other metadata. In the case of automatic replay, playback of a portion of a video can include enhancements for that portion, such as providing closed captioning, display at a decreased frame rate (“slow motion”), zooming in/out on a portion of the frames of a video segment, among other enhancements.
Abstract translation: 自动重播或跳过功能可以配置为智能地导航到用户希望查看的视频的一部分。 可以分析用户选择智能导航的上下文以确定在何处启动自动重放或跳过。 智能导航的背景可以基于场景或拍摄分割数据,隐藏字幕,来自共享人口特征和/或兴趣的用户社区的聚合视频导航数据和/或其他元数据。 在自动重放的情况下,视频的一部分的回放可以包括对该部分的增强,例如提供隐藏字幕,以降低的帧速率(“慢动作”)显示,在帧的一部分上放大/缩小 的视频片段,以及其他增强功能。
-
公开(公告)号:US09558784B1
公开(公告)日:2017-01-31
申请号:US14667652
申请日:2015-03-24
Applicant: Amazon Technologies, Inc.
Inventor: Douglas Ryan Gray , Adam Carlson , Ashutosh Vishwas Kulkarni , Anna Makris , Colin Jon Taylor
CPC classification number: G11B27/005 , H04N5/783 , H04N9/8205
Abstract: Automatic replay or skip ahead functionality can be configured to intelligently navigate to a portion of a video a user desires to view. The context at which a user selects intelligent navigation can be analyzed to determine where to initiate automatic replay or skip ahead. The context for intelligent navigation can be based on scene or shot segmentation data, closed captioning, aggregate video navigation data from a community of users of shared demographic traits and/or interest, and/or other metadata. In the case of automatic replay, playback of a portion of a video can include enhancements for that portion, such as providing closed captioning, display at a decreased frame rate (“slow motion”), zooming in/out on a portion of the frames of a video segment, among other enhancements.
-
公开(公告)号:US09805270B2
公开(公告)日:2017-10-31
申请号:US15255978
申请日:2016-09-02
Applicant: Amazon Technologies, Inc.
Inventor: Adam Carlson , Douglas Ryan Gray , Ashutosh Vishwas Kulkarni , Colin Jon Taylor
CPC classification number: G06K9/00765 , G06K9/00147 , G06K9/00744 , G06K9/46 , G06K9/469 , G06K9/6212 , G06K9/6224 , G06T2207/10016
Abstract: A video segmentation system can be utilized to automate segmentation of digital video content. Features corresponding to visual, audio, and/or textual content of the video can be extracted from frames of the video. The extracted features of adjacent frames are compared according to a similarity measure to determine boundaries of a first set of shots or video segments distinguished by abrupt transitions. The first set of shots is analyzed according to certain heuristics to recognize a second set of shots distinguished by gradual transitions. Key frames can be extracted from the first and second set of shots, and the key frames can be used by the video segmentation system to group the first and second set of shots by scene. Additional processing can be performed to associate metadata, such as names of actors or titles of songs, with the detected scenes.
-
公开(公告)号:US10664140B2
公开(公告)日:2020-05-26
申请号:US15452201
申请日:2017-03-07
Applicant: Amazon Technologies, Inc.
Inventor: Charles Benjamin Franklin Waggoner , Colin Jon Taylor , Jeffrey P. Bezos , Douglas Ryan Gray
IPC: G06F3/0484 , G06F3/0488 , G06F3/01 , H04N5/262 , H04N21/4728 , H04N21/4402 , H04N21/472 , G06F3/16
Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.
-
公开(公告)号:US20170177197A1
公开(公告)日:2017-06-22
申请号:US15452201
申请日:2017-03-07
Applicant: Amazon Technologies, Inc.
Inventor: Charles Benjamin Franklin Waggoner , Colin Jon Taylor , Jeffrey P. Bezos , Douglas Ryan Gray
IPC: G06F3/0484 , H04N5/262 , G06F3/16 , G06F3/0488 , G06F3/01
CPC classification number: G06F3/04842 , G06F3/013 , G06F3/017 , G06F3/04845 , G06F3/0488 , G06F3/04883 , G06F3/167 , G06F2203/0381 , G06F2203/04806 , H04N5/2628 , H04N21/440263 , H04N21/47205 , H04N21/4728
Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.
-
-
-
-
-
-
-
-
-