-
公开(公告)号:US20250068673A1
公开(公告)日:2025-02-27
申请号:US18813647
申请日:2024-08-23
Applicant: SRI International
Inventor: Mitchell Leigh McLaren , Aaron Dennis Lawson
IPC: G06F16/683 , G06F16/61 , G10L15/00
Abstract: A computing system is configured to obtain a plurality of media files that each includes speech of one or more speakers. The computing system is further configured to process the plurality of media files to generate indexed data, wherein the indexed data includes a corresponding embedding for each speaker of the one or more speakers identified in the media file and a corresponding one or more keywords identified in the speech in the media file. The computing system is further configured to receive an indication at least one of a selection of a particular speaker from the one or more speakers or a selection of a particular keyword from a plurality of keywords. The computing system is further configured to generate one or more correlations based on the indexed data. The computing system is further configured to output an alert regarding the one or more correlations.
-
公开(公告)号:US12237069B2
公开(公告)日:2025-02-25
申请号:US17750749
申请日:2022-05-23
Applicant: X-SYSTEM LIMITED
Inventor: Nigel Osborne , Robert Ashcroft , Paul Robertson , Peter Kingsley
Abstract: The present invention relates to a method and system for analysing audio (eg. music) tracks. A predictive model of the neuro-physiological functioning and response to sounds by one or more of the human lower cortical, limbic and subcortical regions in the brain is described. Sounds are analysed so that appropriate sounds can be selected and played to a listener in order to stimulate and/or manipulate neuro-physiological arousal in that listener. The method and system are particularly applicable to applications harnessing a biofeedback resource.
-
公开(公告)号:US12230236B2
公开(公告)日:2025-02-18
申请号:US18389438
申请日:2023-11-14
Applicant: NIKE, Inc.
Inventor: Justin Fraga , Levi J. Patton
IPC: G06F3/16 , G06F16/683 , G10H1/00 , G10H1/46 , H03G3/20
Abstract: An adaptive music playback system is disclosed. The system includes a composition system that receives information corresponding to user activity levels. The composition system determines target musical criteria corresponding to the user activity levels and modifies the composition of a song in response to changes in user activity.
-
公开(公告)号:US12222980B2
公开(公告)日:2025-02-11
申请号:US18362482
申请日:2023-07-31
Applicant: YAHOO ASSETS LLC
Inventor: Malcolm Slaney , Kilian Weinberger
IPC: G06F16/58 , G06F16/583 , G06F16/587 , G06F16/68 , G06F16/683 , G06F16/687 , G06F16/78 , G06F16/783 , G06F16/787
Abstract: A method of generating congruous metadata is provided. The method includes receiving a similarity measure between at least two multimedia objects. Each multimedia object has associated metadata. If the at least two multimedia objects are similar based on the similarity measure and a similarity threshold, the associated metadata of each of the multimedia objects are compared. Then, based on the comparison of the associated metadata of each of the at least two multimedia objects, the method further includes generating congruous metadata. Metadata may be tags, for example.
-
公开(公告)号:US20250021599A1
公开(公告)日:2025-01-16
申请号:US18904278
申请日:2024-10-02
Applicant: GRACENOTE, INC.
Inventor: Markus Kurt Cremer , Todd Hodges
IPC: G06F16/683 , G06F16/632 , G06F16/65 , G06F16/68 , G06F18/22
Abstract: Methods, apparatus, systems and articles of manufacture are disclosed to identify media based on historical data. An example method includes: comparing (a) a pitch shifted fingerprint, (b) a time shifted fingerprint, or (c) a resampled fingerprint to a reference fingerprint; in response to a match between any of (a) the pitch shifted fingerprint, (b) the time shifted fingerprint, or (c) the resampled fingerprint and the reference fingerprint, generating indications of (a) a pitch shift value, (b) a time shift value, or (c) a resample ratio that caused the match; in response to collecting broadcast media for a threshold period of time, processing the one or more indications; and in response to a request for a recommendation for information associated with a query, transmitting the recommendation including one or more frequencies of occurrence of (a) the pitch shift value, (b) the time shift value, or (c) the resample ratio.
-
公开(公告)号:US12198664B2
公开(公告)日:2025-01-14
申请号:US17446757
申请日:2021-09-02
Applicant: Snap Inc.
Inventor: Itamar Berger , Gal Dudovitch , Gal Sasson , Ma'ayan Shuvi , Matan Zohar
Abstract: Methods and systems are disclosed for performing operations comprising: receiving a monocular image that includes a depiction of a person wearing an article of clothing; generating a segmentation of the article of clothing worn by the person in the monocular image; obtaining one or more audio-track related augmented reality elements; and applying the one or more audio-track related augmented reality elements to the article of clothing worn by the person based on the segmentation of the article of clothing worn by the person.
-
公开(公告)号:US20240420683A1
公开(公告)日:2024-12-19
申请号:US18816342
申请日:2024-08-27
Applicant: Walmart Apollo, LLC
Inventor: Praneeth Gubbala , Xuan Zhang , Bahula Bosetti , Priya Ashok Kumar Choudhary , Dong T. Nguyen , Shivraj V. Kodak , William Craig Robinson, Jr.
IPC: G10L15/06 , G06F16/683 , G10L15/02 , G10L15/08
Abstract: Some embodiments provide retail product ordering systems comprising: a user computing device comprising an application executed by a device control circuit to: receive an audible utterance; controls a product identifier application interface to: apply a tokenizer model and obtain a set of individual search words; apply a series of featurizer models to the search words to generate features; and apply a classifier and extractor model based on the features and generate multiple requested product entities each comprising a respective sub-set of the position labeled product terms; wherein the device control circuit is further configured to access a purchase history database, confirm an accuracy of each of requested product entities relative to a purchase history, generate a listing of determined product identifiers corresponding to the confirmed set of the multiple requested product entities, and control a display system of the user computing device to render the listing of determined product identifiers.
-
公开(公告)号:US12167107B2
公开(公告)日:2024-12-10
申请号:US17350651
申请日:2021-06-17
Applicant: Musixmatch S.P.A.
Inventor: Marco Paglia , Paolo Spazzini , Pierpaolo Di Panfilo , Nicolae-Daniel Dima , Emanuele Cantalini , Christian Zanin
IPC: H04N21/472 , G06F16/68 , G06F16/683 , H04N21/4725 , H04N21/81 , H04N21/845 , H04N21/8547
Abstract: In one embodiment, a computer-implemented method for editing navigation of a content item is disclosed. The method may include presenting, via a user interface at a client computing device, time-synchronized text pertaining to the content item; receiving an input of a tag for the time-synchronized text of the content item; storing the tag associated with the time-synchronized text of the content item; and responsive to receiving a request to play the content item: playing the content item via a media player presented in the user interface, and concurrently presenting the time-synchronized text and the tag as a graphical user element in the user interface.
-
公开(公告)号:US12130857B2
公开(公告)日:2024-10-29
申请号:US17754002
申请日:2020-09-18
Applicant: VERNSTHER
Inventor: Jennifer Dahan
CPC classification number: G06F16/41 , G06F16/40 , G06F16/685 , G06F16/7867 , G06F21/16 , G10L15/22 , G10L15/26
Abstract: Method for editorializing digital audiovisual or audio recording content of an oral presentation given by a speaker using a presentation support enriched with tags and recorded in the form of a digital audiovisual file. This method comprises written transcription of the oral presentation with indication of a time code for each word, comparative automatic analysis of this written transcription and of the tagged presentation support, transposition of the time codes from the written transcription to the tagged presentation support, identification of the tags and of the time codes of the presentation support, and marking of the digital audiovisual file with the tags and time codes, so as to generate an enriched digital audiovisual file.
-
公开(公告)号:US12124768B2
公开(公告)日:2024-10-22
申请号:US18355000
申请日:2023-07-19
Applicant: Sonos, Inc.
Inventor: Robert Reimann , David Taylor , Abhishek Kumar
IPC: G06F3/16 , G06F3/0484 , G06F16/60 , G06F16/635 , G06F16/683
CPC classification number: G06F3/165 , G06F3/0484 , G06F16/635 , G06F16/60 , G06F16/683
Abstract: A first playback device comprising an amplifier and a speaker is configured to (i) display, via a touchscreen, respective indications of the first playback device and a second playback device, (ii) receive, via the touchscreen, at least one first input indicating a command to play back media content in a synchrony group with the second playback device, (iii) play back the media content in the synchrony group with the second playback device such that the amplifier drives the speaker by amplifying one or more audio signals that correspond to the media content, (iv) while playing back the media content in the synchrony group with the second payback device, determine, via a voice recognition system, at least one second input indicating a command to modify playback of the media content, and (v) based on the at least one second input, cause the playback of the media content to be modified.
-
-
-
-
-
-
-
-
-