-
公开(公告)号:US12249344B1
公开(公告)日:2025-03-11
申请号:US17853773
申请日:2022-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Christopher Evans , Sumit Garg , Ameya Agaskar , Mohammad Edris Qarghah , Zhengping Jin
IPC: G10L25/51 , G10L15/08 , G10L15/22 , G10L19/018
Abstract: Described herein is a system for encoding audio watermarks with frequency extensions to enable enhanced watermark detection. An extended audio watermark may include an existing audio watermark and a duplicate audio watermark, enabling backwards compatibility with existing watermark detection while also enabling enhanced watermark detection with increased accuracy. For example, embedding the extended audio watermark enables (i) limited devices to perform watermark detection to detect the existing audio watermark, and (ii) improved devices to perform enhanced watermark detection to detect the extended audio watermark. As the extended audio watermark includes redundancy in the form of duplicate audio watermark(s), an accuracy of performing enhanced watermark detection is increased relative to detecting the existing audio watermark alone.
-
公开(公告)号:US12205601B1
公开(公告)日:2025-01-21
申请号:US17853183
申请日:2022-06-29
Applicant: Amazon Technologies, Inc.
Inventor: David McGuire , Ahmed Abdelal , Sai Kiran Venkata Subramanya Rupanagudi , Sumit Garg , Terrence Yu , Nathaniel White , Siddharth Agrawal , Pavas Kant , Yuxuan Hao , Nagaraj Mahajan , Ameya Agaskar , Aaron Challenner
IPC: G10L19/018 , G06F21/62 , G06V20/40 , G11B27/34 , H04R3/00
Abstract: A system configured to perform content recognition using fingerprinting to recognize known media content. A device determines fingerprints based on decoded content data to be sent using a media interface component to an output component. Metadata related to the content/device/fingerprint may also be created. The fingerprints and metadata are sent by the device to a supporting system for orchestration and matching of the fingerprints to known media content.
-
公开(公告)号:US12136428B1
公开(公告)日:2024-11-05
申请号:US17490271
申请日:2021-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Ameya Agaskar , Sumit Garg
IPC: G10L19/018 , G10L15/22
Abstract: Described herein is a system for embedding audio watermarks. To improve performance without a user perceiving the audio watermark, a system embeds audio watermarks in audio data using scaling factors that are calculated based on a spectral masking level for each frame of the audio data. The scaling factors may vary over time and correspond to an amplitude of the audio watermark across a series of watermark frames. The system processes the audio data to determine a spectral mask, which represents an amount of energy perceived in a first frequency range that is caused by energy represented in neighboring frequency ranges. By selecting scaling factor values that keep an amplitude of the audio watermark below the threshold indicated by the spectral mask, the system may embed the audio watermark in the first audio data without the audio watermark being audible to the user.
-
-