专利检索 cpc:"G10L19/018" 第 1 页

1.

发明授权
Use of doppler shift as a basis to determine area of focus 有权

公开(公告)号：US12114042B2

公开(公告)日：2024-10-08

申请号：US18148746

申请日：2022-12-30

申请人： The Nielsen Company (US), LLC

发明人： John Thomas LiVoti , Stanley Wellington Woodruff

IPC分类号： H04N21/442 , G01S15/58 , G10L19/018 , H04N21/439

CPC分类号： H04N21/44218 , G01S15/586 , G10L19/018 , H04N21/4394

摘要： A method system for use of Doppler shift as a basis to detect user focus, such as to detect that a user was attracted to audio media and/or to an associated object. A portable processing device carried by the user receives audio media emitted from an audio source at a fixed location, the audio media having periodic watermarking encoded at a baseline frequency. The portable processing device detects a change in frequency of the periodic watermarking over time, such as the frequency progressing from at least being higher than the baseline frequency to being the baseline frequency for at least a predefined threshold period of time. Based on the detected change in frequency of the periodic watermarking over time, the portable device then provides a report indicating that the user was attracted to the audio media and/or to an object (e.g., a commercial object) collocated with the audio source.

2.

发明授权
Verifying the provenance of a digital object using watermarking and embeddings 有权

公开(公告)号：US12094474B1

公开(公告)日：2024-09-17

申请号：US18510537

申请日：2023-11-15

申请人： DeepMind Technologies Limited

发明人： Sven Adrian Gowal , Christopher Gamble , Florian Nils Stimberg , Sylvestre-Alvise Guglielmo Rebuffi , Sree Meghana Thotakuri , Jamie Hayes , Ian Goodfellow , Rudy Bunel , Miklós Zsigmond Horváth , David Stutz , Olivia Anne Wiles

IPC分类号： G10L19/018 , G06F21/16 , G10L21/0232

CPC分类号： G10L19/018 , G06F21/16 , G10L21/0232

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for verifying the provenance of a digital object generated by a neural network, such as an image or audio object. Also methods, systems, and apparatus, including computer programs, for training a watermarking neural network and a watermark decoding neural network. The described techniques make efficient use of computing resources and are robust to attack.

3.

发明公开
AUDIO WATERMARKING FOR PEOPLE MONITORING 审中-公开

公开(公告)号：US20240290334A1

公开(公告)日：2024-08-29

申请号：US18428732

申请日：2024-01-31

申请人： The Nielsen Company (US), LLC

发明人： Alexander Topchy , Padmanabhan Soundararajan , Venugopal Srinivasan

IPC分类号： G10L19/018 , G06F21/10 , H04H60/45 , H04H60/51

CPC分类号： G10L19/018 , H04H60/45 , G06F21/1063 , H04H60/51 , H04H2201/50

摘要： Disclosed example people monitoring methods include detecting a first watermark in a first audio signal obtained from an acoustic sensor, the first watermark identifying media presented by a monitored media device, determining whether a second watermark, different from the first watermark, is embedded in the first audio signal obtained from the acoustic sensor, the second watermark identifying at least one of a mobile device or a user of the mobile device, classifying the second watermark as a media watermark or a people monitoring watermark based on a characteristic of the second watermark, and when the second watermark is determined to be embedded in the first audio signal, reporting at least one of the second watermark or information decoded from the second watermark to identify at least one of the mobile device or the user of the mobile device as being exposed to the media presented by the monitored media device.

4.

发明公开
Synchronizing Audio Streams in Cloud-Based Gaming Environment 审中-公开

公开(公告)号：US20240278118A1

公开(公告)日：2024-08-22

申请号：US18171926

申请日：2023-02-21

申请人： Microsoft Technology Licensing, LLC

发明人： Krishna Kant Chintalapudi , Pouya HAMADANIAN , Doug GALLATIN , Thomas POUGET-ABADIE

IPC分类号： A63F13/358 , A63F13/215 , A63F13/355 , A63F13/54 , G10L19/018 , G10L19/16

CPC分类号： A63F13/358 , A63F13/215 , A63F13/355 , A63F13/54 , G10L19/018 , G10L19/167

摘要： A data processing system implements an acoustic delay detection technique for detecting and correcting inter-stream latency between two audio streams in a cloud-based computing environment. A first audio stream of game audio is sent to a controller or headset associated with the cloud-based computing environment, and a second audio steam of game audio is send to a display device associated with the cloud-based computing environment. An acoustic marker that is inaudible to human users is added to the second audio stream. A microphone associated with the controller or headset records audio content output by a speaker of the display device. The recording includes the acoustic marker. The gaming platform correlates this recording with the acoustic marker to determine a difference between the time that the controller played the audio and the time that the display device played the audio in order to determine and compensate for an inter-stream latency.

5.

发明公开
MEDIA SEARCH FILTERING MECHANISM FOR SEARCH ENGINE 审中-公开

公开(公告)号：US20240273138A1

公开(公告)日：2024-08-15

申请号：US18644954

申请日：2024-04-24

申请人： Comcast Cable Communications, LLC

发明人： Rui MIN , Hongcheng WANG

IPC分类号： G06F16/683 , G06F16/9535 , G06Q30/0202 , G10L15/08 , G10L15/22 , G10L15/30 , G10L19/018 , G10L25/54 , G10L25/69 , H04L9/40

CPC分类号： G06F16/683 , G06F16/9535 , G10L15/08 , G10L15/22 , G10L15/30 , G10L19/018 , G10L25/54 , G10L25/69 , G06Q30/0202 , G10L2015/085 , H04L63/1458

摘要： Methods and systems for more efficient analyses of and response to voice commands and queries are provided. The system may be configured to receive one or more of audio files corresponding to a voice query and determine, for each of the audio files, whether the audio file is a first type of audio file capable of being processed based on a characteristic of the audio file or a second type of audio file that cannot, and may require further processing in order to recognize the voice query associated with the audio file. The system may process each of the first type of audio files and respond to the associated voice queries. The system may also determine a priority for each of the second type of audio files for further processing of the second type of audio files.

6.

发明公开
SYSTEMS AND METHODS EMPLOYING SCENE EMBEDDED MARKERS FOR VERIFYING MEDIA 审中-公开

公开(公告)号：US20240235847A1

公开(公告)日：2024-07-11

申请号：US18290677

申请日：2022-07-22

申请人： John Elijah Jacobson

发明人： John Elijah Jacobson

IPC分类号： H04L9/32 , G06T1/00 , G06V20/40 , G10L19/018 , G10L25/57

CPC分类号： H04L9/3247 , G06T1/0085 , G06V20/44 , G10L19/018 , G10L25/57 , G06V2201/10

摘要： Display badges employing scene embedded digital watermarks for authenticating media data typically comprising: an audio detection component detecting at least a portion of ambient audio data of an actual event; a computing device operably connected to a recording component; the computing device converting at least a portion of the detected ambient audio data into a digital representation of the at least a portion of the ambient audio data; a display presenting a succession of images comprising the digital representation; where the display badges are designed such that the digital representation is sufficiently visible that it may be extracted by a computer upon replay of audio and video of some or all of the actual event, and the replay audio may be verified as authentic by comparing the digital representation with the audio associated with the replay. Methods for encoding and authenticating media data are also disclosed.

7.

发明公开
WATERMARKING FOR SPEECH IN CONVERSATIONAL AI AND COLLABORATIVE SYNTHETIC CONTENT GENERATION SYSTEMS AND APPLICATIONS 审中-公开

公开(公告)号：US20240221763A1

公开(公告)日：2024-07-04

申请号：US18148226

申请日：2022-12-29

申请人： Nvidia Corporation

发明人： Boris Ginsburg

IPC分类号： G10L19/018 , G10L13/04

CPC分类号： G10L19/018 , G10L13/04

摘要： Approaches presented herein provide for insertion of watermarks into synthesized content, such as audio content that may include synthesized speech to appear to be spoken by a digital avatar in a 3D virtual environment. A Text-to-Speech (TTS) generator, such as a trained neural network, can be used to produce synthetic speech audio, which can have an audio watermark inserted therein. This watermark can be detected by a process of a collaborative content generation platform, for example, and an indication can be provided that the content contains synthesized speech. The presence of the audio watermark will generally not be detectable by the human ear during presentation. To make it difficult to remove or modify the watermark, the watermark can be generated using a key or other unique piece of data known only to authorized entities.

8.

发明授权
Error detection and correction for audio cache 有权

公开(公告)号：US12026196B2

公开(公告)日：2024-07-02

申请号：US16839306

申请日：2020-04-03

申请人： Comcast Cable Communications, LLC

发明人： Rui Min , Stefan Deichmann , Hongcheng Wang

IPC分类号： G06F7/00 , G06F16/2455 , G06F16/632 , G10L19/018

CPC分类号： G06F16/634 , G06F16/24552 , G10L19/018

摘要： An audio file associated with a user voice query may be received at a user device. The audio file may be compared to a plurality of references, such as cache entries, corresponding to a plurality of other voice queries. Based on a determination that the voice query corresponds to one of the references, an operation associated with the voice query may be executed. An indication may be received that the operation was not an intended operation associated with the voice query. Based on receiving this indication, the incorrectly identified operation, associated reference, e.g., voice query, may be disabled for the user or the device. However, the cache entry may remain enabled for one or more of a plurality of other devices.

9.

发明公开
MEDIA RATINGS WATERMARK ENCODING 审中-公开

公开(公告)号：US20240212694A1

公开(公告)日：2024-06-27

申请号：US18508207

申请日：2023-11-13

申请人： iBiquity Digital Corporation

发明人： Jeffrey DETWEILER

IPC分类号： G10L19/018 , G06F16/60

CPC分类号： G10L19/018 , G06F16/60

摘要： A method operable in a receiver comprises the steps of: receiving an audio signal from a broadcaster; obtaining digital information comprising an identifier associated with the broadcaster; retrieving an audio watermark associated with the identifier; encoding the audio watermark in the audio signal; and outputting an acoustic signal based on the encoded audio signal so that the audio watermark can be sensed by a listening device without being audible by a human listener.

10.

发明公开
UTILIZING INAUDIBLE ULTRASONIC FREQUENCIES TO EMBED ADDITIONAL AUDIO ASSET CHANNELS WITHIN EXISTING AUDIO CHANNELS 审中-公开

公开(公告)号：US20240161759A1

公开(公告)日：2024-05-16

申请号：US18056132

申请日：2022-11-16

申请人： Sony Interactive Entertainment Inc.

发明人： Christopher M. Pontiga , Celeste Bean , Arthur Kwun

IPC分类号： G10L19/018 , G06F30/20 , G08B6/00

CPC分类号： G10L19/018 , G06F30/20 , G08B6/00

摘要： Computer game transmitters can shift frequencies of game assets such as audio or haptic assets to the ultrasonic range and then mix the assets with game audio, transmitting the mix. Receivers then separate the audio from the audible band to play the audio and downmix the assets for play thereof. The frequencies of the assets can be companded prior to transmission.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类