-
公开(公告)号:US12130857B2
公开(公告)日:2024-10-29
申请号:US17754002
申请日:2020-09-18
申请人: VERNSTHER
发明人: Jennifer Dahan
CPC分类号: G06F16/41 , G06F16/40 , G06F16/685 , G06F16/7867 , G06F21/16 , G10L15/22 , G10L15/26
摘要: Method for editorializing digital audiovisual or audio recording content of an oral presentation given by a speaker using a presentation support enriched with tags and recorded in the form of a digital audiovisual file. This method comprises written transcription of the oral presentation with indication of a time code for each word, comparative automatic analysis of this written transcription and of the tagged presentation support, transposition of the time codes from the written transcription to the tagged presentation support, identification of the tags and of the time codes of the presentation support, and marking of the digital audiovisual file with the tags and time codes, so as to generate an enriched digital audiovisual file.
-
公开(公告)号:US12124768B2
公开(公告)日:2024-10-22
申请号:US18355000
申请日:2023-07-19
申请人: Sonos, Inc.
发明人: Robert Reimann , David Taylor , Abhishek Kumar
IPC分类号: G06F3/16 , G06F3/0484 , G06F16/60 , G06F16/635 , G06F16/683
CPC分类号: G06F3/165 , G06F3/0484 , G06F16/635 , G06F16/60 , G06F16/683
摘要: A first playback device comprising an amplifier and a speaker is configured to (i) display, via a touchscreen, respective indications of the first playback device and a second playback device, (ii) receive, via the touchscreen, at least one first input indicating a command to play back media content in a synchrony group with the second playback device, (iii) play back the media content in the synchrony group with the second playback device such that the amplifier drives the speaker by amplifying one or more audio signals that correspond to the media content, (iv) while playing back the media content in the synchrony group with the second payback device, determine, via a voice recognition system, at least one second input indicating a command to modify playback of the media content, and (v) based on the at least one second input, cause the playback of the media content to be modified.
-
公开(公告)号:US12124506B2
公开(公告)日:2024-10-22
申请号:US18299126
申请日:2023-04-12
发明人: James H. Pratt , Gregory Edwards
IPC分类号: G06F16/683 , B64C39/02 , G05D1/00 , G06F16/61 , H04B7/185 , B64U101/00
CPC分类号: G06F16/683 , B64C39/024 , G05D1/005 , G05D1/0088 , G05D1/0094 , G06F16/61 , H04B7/18506 , B64U2101/00 , B64U2201/10
摘要: A monitored space is monitored including the production of a first audio signal from received acoustic energy. The first audio signal is then processed against a whitelist of acoustic profiles and, based on lack of substantial correspondence with any of the acoustic profiles, a drone is navigated toward an apparent position of an apparent source. While in-flight, additional acoustic energy is received and a second audio signal is produced from the additional acoustic energy. The second audio signal is processed against the whitelist and, based on lack of substantial correspondence with any of the acoustic profiles of the whitelist, an investigate mode of the drone is initiated. The investigate mode includes notifying a remote monitor and supplying the remote monitor with an audiovisual feed. Responsive to a characterization by the remote monitor, an entry of the whitelist may be updated, added or replaced.
-
公开(公告)号:US20240330364A1
公开(公告)日:2024-10-03
申请号:US18550261
申请日:2022-02-28
发明人: Hae Na Kang , Ji Hoon Chung , Yu Jin Kim
IPC分类号: G06F16/635 , G06F16/65 , G06F16/683
CPC分类号: G06F16/635 , G06F16/65 , G06F16/685
摘要: A method for recommending music content is disclosed. The method for recommending music content of the present invention includes the steps of: receiving, by a recommendation apparatus, a plurality of pieces of lyrics selection information from a user terminal, wherein the lyrics selection information is information regarding some lyrics selected from lyrics of music content; generating, by the recommendation apparatus, characteristic information by analyzing at least some of the plurality of pieces of lyrics selection information; retrieving, on the basis of the characteristic information by the recommendation apparatus, at least one of pieces of recommended music content from a database; and recommending, by the recommendation apparatus, the recommended music content to the user terminal.
-
公开(公告)号:US12105752B2
公开(公告)日:2024-10-01
申请号:US17705100
申请日:2022-03-25
申请人: Yamaha Corporation
发明人: Dan Sasai
IPC分类号: G06F16/683 , G06F16/65 , G06F16/68 , G10H1/00
CPC分类号: G06F16/683 , G06F16/65 , G06F16/686 , G10H1/0008
摘要: An audio analysis device comprises an electronic controller including at least one processor. The electronic controller is configured to execute a plurality of modules including a signal acquisition module configured to acquire an audio signal representing performance sounds of a musical piece, a signal analysis module configured to calculate, for each of a plurality of music categories, a feature value that includes a degree of certainty that the musical piece belongs to the music category, by analyzing the audio signal, and a music selection module configured to select one or more candidate musical pieces whose feature value is similar to the feature value calculated for the musical piece from among a plurality of candidate musical pieces.
-
公开(公告)号:US20240305623A1
公开(公告)日:2024-09-12
申请号:US18667281
申请日:2024-05-17
发明人: Saurabh Mavani
IPC分类号: H04L9/40 , G06F9/451 , G06F16/683 , G10L17/00
CPC分类号: H04L63/0823 , G06F9/453 , G06F16/683 , G10L17/00 , H04L63/0861
摘要: Aspects of the disclosure relate to voice biometric authentication in a virtual assistant. In some embodiments, a computing platform may receive, from a user device, an audio file comprising a voice command to access information related to a user account. The computing platform may retrieve one or more voice biometric signatures from a voice biometric database associated with the user account, and apply a voice biometric matching algorithm to compare the voice command of the audio file to the one or more voice biometric signatures to determine if a match exists between the voice command and one of the one or more voice biometric signatures. In response to determining that a match exists, the computing platform may retrieve information associated with the user account, and then send, via the communication interface, the information associated with the user account to the user device.
-
公开(公告)号:US12080296B2
公开(公告)日:2024-09-03
申请号:US17203652
申请日:2021-03-16
发明人: John C. Mese , Arnold S. Weksler , Mark Patrick Delaney , Nathan J. Peterson , Russell Speight VanBlon
IPC分类号: G10L15/22 , G06F16/683 , G10L15/26 , G10L25/60 , H04L12/18
CPC分类号: G10L15/26 , G06F16/685 , G10L15/22 , G10L25/60 , H04L12/1831 , G10L2015/225
摘要: Apparatuses, methods, and program products are disclosed for performing a transcription action. One apparatus includes at least one processor and a memory that stores code executable by the at least one processor. The code is executable by the processor to monitor, by use of the at least one processor, a quality of audio information. The code is executable by the processor to determine whether the quality of the audio information is below a predetermined threshold. The code is executable by the processor to, in response to determining that the quality of the audio information is below the predetermined threshold, perform a transcription action corresponding to the audio information.
-
公开(公告)号:US12079277B2
公开(公告)日:2024-09-03
申请号:US17114230
申请日:2020-12-07
申请人: Gracenote, Inc.
IPC分类号: G06F16/901 , G06F16/65 , G06F16/683 , G06F18/2115
CPC分类号: G06F16/9014 , G06F16/65 , G06F16/683 , G06F18/2115 , G06F2218/16
摘要: Methods, apparatus, systems, and articles of manufacture are disclosed to improve media identification. An example apparatus includes a hash handler to generate a first set of reference matches by performing hash functions on a subset of media data associated with media to generate hashed media data based on a first bucket size, a candidate determiner to identify a second set of reference matches that include ones of the first set, the second set including ones having first quantities of hits that did not satisfy a threshold, determine second quantities of hits for ones of the second set by matching ones to the hash tables based on a second bucket size, and identify one or more candidate matches based on at least one of (1) ones of the first set or (2) ones of the second set, and a report generator to generate a report including a media identification.
-
公开(公告)号:US12079270B2
公开(公告)日:2024-09-03
申请号:US16708011
申请日:2019-12-09
IPC分类号: G06F16/00 , G06F7/00 , G06F16/61 , G06F16/638 , G06F16/683
CPC分类号: G06F16/638 , G06F16/61 , G06F16/683
摘要: A system comprising a client computer, a data store comprising a content management repository, a server computer coupled to the client computer by a network, the server computer comprising code for: receiving audio data; converting the audio data to text; extracting a specified string from the text as an extracted string; determining an extracted string attribute for the extracted string; storing a media file containing the audio data as a content object; configuring the content object to be searchable by the extracted string; receiving a search query from the client application; searching a plurality of managed objects based on the search query; and based on determining that the extracted string matches a search string, returning an indication of the first media file, the extracted string and the extracted string attribute in a search result.
-
10.
公开(公告)号:US20240281459A1
公开(公告)日:2024-08-22
申请号:US18649360
申请日:2024-04-29
申请人: Gracenote, Inc.
IPC分类号: G06F16/383 , G06F16/335 , G06F16/35 , G06F16/683 , G06F16/783 , G06F30/27 , G06F40/00 , G10L15/00
CPC分类号: G06F16/383 , G06F16/335 , G06F16/35 , G06F16/685 , G06F16/7844 , G06F30/27 , G06F40/00 , G10L15/00
摘要: A method and system for computer-based generation of podcast metadata, to facilitate operations such as searching for and recommending podcasts based on the generated metadata. In an example method, a computing system obtains a text representation of a podcast episode and obtains person data defining a list of person names such as celebrity names. The computing system then correlates the person data with the text representation, to find a match between a listed person name a text string in the text representation. Further, the computing system predicts a named-entity span in the text representation and determines that the predicted named-entity span matches a location of the text string in the text representation of the podcast episode, and based on this determination, the computing system generates and outputs metadata that associates the person name with the podcast episode.
-
-
-
-
-
-
-
-
-