-
公开(公告)号:US12114042B2
公开(公告)日:2024-10-08
申请号:US18148746
申请日:2022-12-30
IPC分类号: H04N21/442 , G01S15/58 , G10L19/018 , H04N21/439
CPC分类号: H04N21/44218 , G01S15/586 , G10L19/018 , H04N21/4394
摘要: A method system for use of Doppler shift as a basis to detect user focus, such as to detect that a user was attracted to audio media and/or to an associated object. A portable processing device carried by the user receives audio media emitted from an audio source at a fixed location, the audio media having periodic watermarking encoded at a baseline frequency. The portable processing device detects a change in frequency of the periodic watermarking over time, such as the frequency progressing from at least being higher than the baseline frequency to being the baseline frequency for at least a predefined threshold period of time. Based on the detected change in frequency of the periodic watermarking over time, the portable device then provides a report indicating that the user was attracted to the audio media and/or to an object (e.g., a commercial object) collocated with the audio source.
-
公开(公告)号:US12094474B1
公开(公告)日:2024-09-17
申请号:US18510537
申请日:2023-11-15
发明人: Sven Adrian Gowal , Christopher Gamble , Florian Nils Stimberg , Sylvestre-Alvise Guglielmo Rebuffi , Sree Meghana Thotakuri , Jamie Hayes , Ian Goodfellow , Rudy Bunel , Miklós Zsigmond Horváth , David Stutz , Olivia Anne Wiles
IPC分类号: G10L19/018 , G06F21/16 , G10L21/0232
CPC分类号: G10L19/018 , G06F21/16 , G10L21/0232
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for verifying the provenance of a digital object generated by a neural network, such as an image or audio object. Also methods, systems, and apparatus, including computer programs, for training a watermarking neural network and a watermark decoding neural network. The described techniques make efficient use of computing resources and are robust to attack.
-
公开(公告)号:US20240290334A1
公开(公告)日:2024-08-29
申请号:US18428732
申请日:2024-01-31
IPC分类号: G10L19/018 , G06F21/10 , H04H60/45 , H04H60/51
CPC分类号: G10L19/018 , H04H60/45 , G06F21/1063 , H04H60/51 , H04H2201/50
摘要: Disclosed example people monitoring methods include detecting a first watermark in a first audio signal obtained from an acoustic sensor, the first watermark identifying media presented by a monitored media device, determining whether a second watermark, different from the first watermark, is embedded in the first audio signal obtained from the acoustic sensor, the second watermark identifying at least one of a mobile device or a user of the mobile device, classifying the second watermark as a media watermark or a people monitoring watermark based on a characteristic of the second watermark, and when the second watermark is determined to be embedded in the first audio signal, reporting at least one of the second watermark or information decoded from the second watermark to identify at least one of the mobile device or the user of the mobile device as being exposed to the media presented by the monitored media device.
-
公开(公告)号:US20240278118A1
公开(公告)日:2024-08-22
申请号:US18171926
申请日:2023-02-21
IPC分类号: A63F13/358 , A63F13/215 , A63F13/355 , A63F13/54 , G10L19/018 , G10L19/16
CPC分类号: A63F13/358 , A63F13/215 , A63F13/355 , A63F13/54 , G10L19/018 , G10L19/167
摘要: A data processing system implements an acoustic delay detection technique for detecting and correcting inter-stream latency between two audio streams in a cloud-based computing environment. A first audio stream of game audio is sent to a controller or headset associated with the cloud-based computing environment, and a second audio steam of game audio is send to a display device associated with the cloud-based computing environment. An acoustic marker that is inaudible to human users is added to the second audio stream. A microphone associated with the controller or headset records audio content output by a speaker of the display device. The recording includes the acoustic marker. The gaming platform correlates this recording with the acoustic marker to determine a difference between the time that the controller played the audio and the time that the display device played the audio in order to determine and compensate for an inter-stream latency.
-
公开(公告)号:US20240273138A1
公开(公告)日:2024-08-15
申请号:US18644954
申请日:2024-04-24
发明人: Rui MIN , Hongcheng WANG
IPC分类号: G06F16/683 , G06F16/9535 , G06Q30/0202 , G10L15/08 , G10L15/22 , G10L15/30 , G10L19/018 , G10L25/54 , G10L25/69 , H04L9/40
CPC分类号: G06F16/683 , G06F16/9535 , G10L15/08 , G10L15/22 , G10L15/30 , G10L19/018 , G10L25/54 , G10L25/69 , G06Q30/0202 , G10L2015/085 , H04L63/1458
摘要: Methods and systems for more efficient analyses of and response to voice commands and queries are provided. The system may be configured to receive one or more of audio files corresponding to a voice query and determine, for each of the audio files, whether the audio file is a first type of audio file capable of being processed based on a characteristic of the audio file or a second type of audio file that cannot, and may require further processing in order to recognize the voice query associated with the audio file. The system may process each of the first type of audio files and respond to the associated voice queries. The system may also determine a priority for each of the second type of audio files for further processing of the second type of audio files.
-
公开(公告)号:US20240235847A1
公开(公告)日:2024-07-11
申请号:US18290677
申请日:2022-07-22
申请人: John Elijah Jacobson
发明人: John Elijah Jacobson
IPC分类号: H04L9/32 , G06T1/00 , G06V20/40 , G10L19/018 , G10L25/57
CPC分类号: H04L9/3247 , G06T1/0085 , G06V20/44 , G10L19/018 , G10L25/57 , G06V2201/10
摘要: Display badges employing scene embedded digital watermarks for authenticating media data typically comprising: an audio detection component detecting at least a portion of ambient audio data of an actual event; a computing device operably connected to a recording component; the computing device converting at least a portion of the detected ambient audio data into a digital representation of the at least a portion of the ambient audio data; a display presenting a succession of images comprising the digital representation; where the display badges are designed such that the digital representation is sufficiently visible that it may be extracted by a computer upon replay of audio and video of some or all of the actual event, and the replay audio may be verified as authentic by comparing the digital representation with the audio associated with the replay. Methods for encoding and authenticating media data are also disclosed.
-
7.
公开(公告)号:US20240221763A1
公开(公告)日:2024-07-04
申请号:US18148226
申请日:2022-12-29
申请人: Nvidia Corporation
发明人: Boris Ginsburg
IPC分类号: G10L19/018 , G10L13/04
CPC分类号: G10L19/018 , G10L13/04
摘要: Approaches presented herein provide for insertion of watermarks into synthesized content, such as audio content that may include synthesized speech to appear to be spoken by a digital avatar in a 3D virtual environment. A Text-to-Speech (TTS) generator, such as a trained neural network, can be used to produce synthetic speech audio, which can have an audio watermark inserted therein. This watermark can be detected by a process of a collaborative content generation platform, for example, and an indication can be provided that the content contains synthesized speech. The presence of the audio watermark will generally not be detectable by the human ear during presentation. To make it difficult to remove or modify the watermark, the watermark can be generated using a key or other unique piece of data known only to authorized entities.
-
公开(公告)号:US12026196B2
公开(公告)日:2024-07-02
申请号:US16839306
申请日:2020-04-03
发明人: Rui Min , Stefan Deichmann , Hongcheng Wang
IPC分类号: G06F7/00 , G06F16/2455 , G06F16/632 , G10L19/018
CPC分类号: G06F16/634 , G06F16/24552 , G10L19/018
摘要: An audio file associated with a user voice query may be received at a user device. The audio file may be compared to a plurality of references, such as cache entries, corresponding to a plurality of other voice queries. Based on a determination that the voice query corresponds to one of the references, an operation associated with the voice query may be executed. An indication may be received that the operation was not an intended operation associated with the voice query. Based on receiving this indication, the incorrectly identified operation, associated reference, e.g., voice query, may be disabled for the user or the device. However, the cache entry may remain enabled for one or more of a plurality of other devices.
-
公开(公告)号:US20240212694A1
公开(公告)日:2024-06-27
申请号:US18508207
申请日:2023-11-13
发明人: Jeffrey DETWEILER
IPC分类号: G10L19/018 , G06F16/60
CPC分类号: G10L19/018 , G06F16/60
摘要: A method operable in a receiver comprises the steps of: receiving an audio signal from a broadcaster; obtaining digital information comprising an identifier associated with the broadcaster; retrieving an audio watermark associated with the identifier; encoding the audio watermark in the audio signal; and outputting an acoustic signal based on the encoded audio signal so that the audio watermark can be sensed by a listening device without being audible by a human listener.
-
10.
公开(公告)号:US20240161759A1
公开(公告)日:2024-05-16
申请号:US18056132
申请日:2022-11-16
IPC分类号: G10L19/018 , G06F30/20 , G08B6/00
CPC分类号: G10L19/018 , G06F30/20 , G08B6/00
摘要: Computer game transmitters can shift frequencies of game assets such as audio or haptic assets to the ultrasonic range and then mix the assets with game audio, transmitting the mix. Receivers then separate the audio from the audible band to play the audio and downmix the assets for play thereof. The frequencies of the assets can be companded prior to transmission.
-
-
-
-
-
-
-
-
-