Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Sandeep Joshi"

1.

发明授权
Data retention management for partitioned datasets 有权

公开(公告)号：US12072868B1

公开(公告)日：2024-08-27

申请号：US17224987

申请日：2021-04-07

Applicant: Amazon Technologies, Inc.

Inventor： Daniel Opincariu , Sandeep Joshi

IPC: G06F16/23 , G06F16/22

CPC classification number: G06F16/2379 , G06F16/2228

Abstract: Systems and methods are disclosed to implement a data storage system that manages data retention for partitioned datasets. A received data retention policy specifies to selectively delete data from a dataset based on a set of data retention attributes. If the data retention attributes are part of the dataset's partition key, a first type of data deletion job is configured to selectively delete entire partitions of the dataset. Otherwise, the system will generate a retention attribute index for the dataset, which will be used by a second type of data deletion job to selectively delete individual records within the partitions. In embodiments, the retention attribute index is implemented as Bloom filters that track retention attribute values in each partition. Advantageously, the disclosed system is able to automatically configure deletion jobs for any dataset schema that avoids full scans of the dataset partitions.

2.

发明授权
Language agnostic drift correction 有权

公开(公告)号：US11625928B1

公开(公告)日：2023-04-11

申请号：US17009311

申请日：2020-09-01

Applicant: Amazon Technologies, Inc.

Inventor： Tamojit Chatterjee , Mayank Sharma , Muhammad Raffay Hamid , Sandeep Joshi

IPC: G06F17/00 , G06V20/62 , G11B27/10 , G06N7/00 , G06F40/169 , G06V20/40

Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.

3.

发明授权
Language agnostic drift correction 有权

公开(公告)号：US11900700B2

公开(公告)日：2024-02-13

申请号：US18175044

申请日：2023-02-27

Applicant: Amazon Technologies, Inc.

Inventor： Tamojit Chatterjee , Mayank Sharma , Muhammad Raffay Hamid , Sandeep Joshi

IPC: G06F17/00 , G06V20/62 , G11B27/10 , G06F40/169 , G06V20/40 , G06N7/01

CPC classification number: G06V20/635 , G06F40/169 , G06N7/01 , G06V20/40 , G11B27/10 , G06V20/44

Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.

4.

发明授权
Language agnostic automated voice activity detection 有权

公开(公告)号：US11869537B1

公开(公告)日：2024-01-09

申请号：US17523777

申请日：2021-11-10

Applicant: Amazon Technologies, Inc.

Inventor： Mayank Sharma , Sandeep Joshi , Muhammad Raffay Hamid

IPC: G10L25/84 , G10L15/06 , G10L15/22 , G10L15/16 , G10L25/18

CPC classification number: G10L25/84 , G10L15/063 , G10L15/16 , G10L15/22 , G10L25/18

Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating audio segments using the audio file, the audio segments including a first segment and a second segment, and determining that the first segment includes first voice activity. Methods may include determining that the second segment comprises second voice activity, determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment, and generating text data representing the voice activity that is present between the first timestamp and the second timestamp.

5.

发明授权
Language-agnostic subtitle drift detection and localization 有权

公开(公告)号：US10945041B1

公开(公告)日：2021-03-09

申请号：US16890940

申请日：2020-06-02

Applicant: Amazon Technologies, Inc.

Inventor： Tamojit Chatterjee , Mayank Sharma , Muhammad Raffay Hamid , Sandeep Joshi

IPC: H04N21/47 , G10L15/00 , G10L15/26 , H04N21/488

Abstract: Devices, systems, and methods are provided for language-agnostic subtitle drift detection and localization. A method may include extracting audio from video, dividing the audio into overlapping blocks, and determining the probabilities of overlapping portions of the blocks, the probabilities indicating a presence of voice data represented by the audio in the blocks. The method may generate machine blocks using overlapping portions of blocks where voice data is present, and may map the machine blocks to corresponding blocks indicating that subtitles are available for the video. For mapped blocks, the method may include determining features such as when subtitles are available without voice audio, when voice audio is available without subtitles, and when voice audio and subtitles both are available. Using the features, the method may include determining the probability that the video includes subtitle drift, and the method may include analyzing the video to localize where the subtitle drift occurs.

6.

发明公开
LANGUAGE AGNOSTIC DRIFT CORRECTION 审中-公开

公开(公告)号：US20230282006A1

公开(公告)日：2023-09-07

申请号：US18175044

申请日：2023-02-27

Applicant: Amazon Technologies, Inc.

Inventor： Tamojit Chatterjee , Mayank Sharma , Muhammad Raffay Hamid , Sandeep Joshi

IPC: G06V20/62 , G11B27/10 , G06F40/169 , G06V20/40 , G06N7/01

CPC classification number: G06V20/635 , G11B27/10 , G06F40/169 , G06V20/40 , G06N7/01 , G06V20/44

Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.

7.

发明授权
Language agnostic automated voice activity detection 有权

公开(公告)号：US11205445B1

公开(公告)日：2021-12-21

申请号：US16436351

申请日：2019-06-10

Applicant: Amazon Technologies, Inc.

Inventor： Mayank Sharma , Sandeep Joshi , Muhammad Raffay Hamid

IPC: G10L25/84 , G10L15/22 , G10L25/18 , G10L15/06 , G10L15/16

Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating a number of audio segments using the audio file, the plurality of audio segments including a first segment and a second segment, where the first segment and the second segment are consecutive segments. Example methods may include determining, using a Gated Recurrent Unit neural network, that the first segment includes first voice activity, determining, using the Gated Recurrent Unit neural network, that the second segment includes second voice activity, and determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification