-
公开(公告)号:US11625928B1
公开(公告)日:2023-04-11
申请号:US17009311
申请日:2020-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Tamojit Chatterjee , Mayank Sharma , Muhammad Raffay Hamid , Sandeep Joshi
Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.
-
公开(公告)号:US11900700B2
公开(公告)日:2024-02-13
申请号:US18175044
申请日:2023-02-27
Applicant: Amazon Technologies, Inc.
Inventor: Tamojit Chatterjee , Mayank Sharma , Muhammad Raffay Hamid , Sandeep Joshi
CPC classification number: G06V20/635 , G06F40/169 , G06N7/01 , G06V20/40 , G11B27/10 , G06V20/44
Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.
-
公开(公告)号:US10945041B1
公开(公告)日:2021-03-09
申请号:US16890940
申请日:2020-06-02
Applicant: Amazon Technologies, Inc.
Inventor: Tamojit Chatterjee , Mayank Sharma , Muhammad Raffay Hamid , Sandeep Joshi
IPC: H04N21/47 , G10L15/00 , G10L15/26 , H04N21/488
Abstract: Devices, systems, and methods are provided for language-agnostic subtitle drift detection and localization. A method may include extracting audio from video, dividing the audio into overlapping blocks, and determining the probabilities of overlapping portions of the blocks, the probabilities indicating a presence of voice data represented by the audio in the blocks. The method may generate machine blocks using overlapping portions of blocks where voice data is present, and may map the machine blocks to corresponding blocks indicating that subtitles are available for the video. For mapped blocks, the method may include determining features such as when subtitles are available without voice audio, when voice audio is available without subtitles, and when voice audio and subtitles both are available. Using the features, the method may include determining the probability that the video includes subtitle drift, and the method may include analyzing the video to localize where the subtitle drift occurs.
-
公开(公告)号:US20230282006A1
公开(公告)日:2023-09-07
申请号:US18175044
申请日:2023-02-27
Applicant: Amazon Technologies, Inc.
Inventor: Tamojit Chatterjee , Mayank Sharma , Muhammad Raffay Hamid , Sandeep Joshi
IPC: G06V20/62 , G11B27/10 , G06F40/169 , G06V20/40 , G06N7/01
CPC classification number: G06V20/635 , G11B27/10 , G06F40/169 , G06V20/40 , G06N7/01 , G06V20/44
Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.
-
-
-