-
公开(公告)号:US20240363138A1
公开(公告)日:2024-10-31
申请号:US18768497
申请日:2024-07-10
Applicant: Gracenote, Inc.
Inventor: Xiaochen Liu , Joseph P. Renner , Joshua E. Morris , Todd J. Hodges , Robert Coover , Zafar Rafii
IPC: G10L25/90 , G06F16/632 , G06F16/683 , G06N3/08 , G06N20/00 , G10L19/022
CPC classification number: G10L25/90 , G06F16/634 , G06F16/685 , G06N3/08 , G06N20/00 , G10L19/022
Abstract: A cover song identification method implemented by a computing system comprises receiving, by a computing system and from a user device, harmonic pitch class profile (HPCP) information that specifies one or more HPCP features associated with target audio content. A major chord profile feature and a minor chord profile feature associated with the target audio content are derived from the HPCP features. Machine learning logic of the computing system determines, based on the major chord profile feature and the minor chord profile feature, a relatedness between the target audio content and each of a plurality of audio content items specified in records of a database. Each audio content item is associated with cover song information. Cover song information associated with an audio content item having a highest relatedness to the target audio content is communicated to the user device.
-
2.
公开(公告)号:US12061647B2
公开(公告)日:2024-08-13
申请号:US17442463
申请日:2021-07-15
Applicant: BEIJING ZITIAO NETWORK TECHNOLOGY CO., LTD.
Inventor: Niwen Zheng , Jia Qu
IPC: G06F16/683 , H04N21/431
CPC classification number: G06F16/685 , H04N21/4316
Abstract: Provided are a method and an apparatus for lyric video display, an electronic device, and a computer-readable medium. The method includes: acquiring multimedia data to be displayed, the multimedia data including audio data and lyrics; determining a target time point, and acquiring a target lyric fragment corresponding to the target time point in the lyrics; and displaying the target lyric fragment in combination with a preset background, and playing a part of the audio data corresponding to the target lyric fragment.
-
公开(公告)号:US20240259479A1
公开(公告)日:2024-08-01
申请号:US18627521
申请日:2024-04-05
Applicant: Intrado Life & Safety, Inc.
Inventor: Mario Manzanillo
IPC: H04L67/562 , G06F9/54 , G06F16/11 , G06F16/68 , G06F16/683 , G10L15/02 , G10L15/26 , H04M3/42 , H04M3/51
CPC classification number: H04L67/562 , G06F9/547 , G06F16/116 , G06F16/685 , G06F16/686 , G10L15/02 , G10L15/26 , H04M3/42221 , H04M3/5116
Abstract: An example operation may include one or more of receiving an audio file from a public safety answering point (PSAP), the audio file comprising a recording of a telephone call, converting the audio file into a text file that comprises a transcript of the telephone call, identifying a keyword within the text file that is associated with a topic, and transmitting a portion of the text file of the telephone call to one or more subscribers that have registered with the topic.
-
公开(公告)号:US12027171B2
公开(公告)日:2024-07-02
申请号:US17402991
申请日:2021-08-16
Applicant: 105 Publishing LLC
Inventor: Jason Lloyd Raynor , Patricia Louise Jones
IPC: G10L15/26 , G06F16/635 , G06F16/68 , G06F16/683 , G06V30/32 , G10L21/0216 , G06V30/10
CPC classification number: G10L15/26 , G06F16/635 , G06F16/685 , G06F16/686 , G06V30/32 , G10L21/0216 , G06V30/10
Abstract: As an example, a server may receive, from a computing device, a submission created by an author. The submission includes book data associated with a book and author data associated with the author. The author data includes incarceration data indicating whether the author was incarcerated. The server may determine, based on the author data and the book data, that the submission is publishable. The server may create, based on the book data, a printable book, an e-book, and an audio book and make one or more of the printable book, the e-book, and the audio book available for acquisition.
-
公开(公告)号:US12026197B2
公开(公告)日:2024-07-02
申请号:US18138234
申请日:2023-04-24
Applicant: Apple Inc.
Inventor: David Chance Graham , Cyrus Daniel Irani , Aimee Piercy , Thomas Alsina
IPC: G10L15/22 , G06F3/16 , G06F9/451 , G06F16/332 , G06F16/432 , G06F16/435 , G06F16/632 , G06F16/635 , G06F16/683 , G10L15/18 , G10L15/30
CPC classification number: G06F16/634 , G06F3/167 , G06F9/453 , G06F16/3329 , G06F16/433 , G06F16/435 , G06F16/635 , G06F16/685 , G10L15/1815 , G10L15/22 , G10L15/30 , G10L2015/223
Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors and memory, receiving a first natural-language speech input indicative of a request for media, where the first natural-language speech input comprises a first search parameter; providing, by a digital assistant, a first media item identified based on the first search parameter. The method further includes, while providing the first media item, receiving a second natural-language speech input and determining whether the second input corresponds to a user intent of refining the request for media. The method further includes, in accordance with a determination that the second speech input corresponds to a user intent of refining the request for media: identifying, based on the first parameter and the second speech input, a second media item and providing the second media item.
-
公开(公告)号:US12008310B2
公开(公告)日:2024-06-11
申请号:US17846355
申请日:2022-06-22
Applicant: Microsoft Technology Licensing, LLC
Inventor: Donald E. Owen , Mehmet Mert Öz , Garret N. Erskine
IPC: G16H40/20 , A61B5/00 , G06F3/16 , G06F16/635 , G06F16/683 , G06F16/904 , G06F18/25 , G06F21/62 , G06F40/174 , G06F40/30 , G06F40/40 , G06K19/077 , G06N3/006 , G06T7/00 , G06V20/10 , G06V20/52 , G06V40/10 , G06V40/16 , G06V40/20 , G10L15/08 , G10L17/00 , G10L21/0232 , G11B27/10 , G16B50/00 , G16H10/20 , G16H10/60 , G16H15/00 , G16H30/00 , G16H30/20 , G16H30/40 , G16H40/60 , G16H40/63 , G16H50/20 , G16H50/30 , G16H80/00 , G16Y20/00 , H04L51/02 , H04L51/222 , H04N7/18 , H04R1/32 , H04R3/00 , H04R3/12 , G10L15/18 , G10L15/22 , G10L15/26 , G10L21/0208 , H04R1/40 , H04R3/02
CPC classification number: G06F40/174 , A61B5/7405 , G06F3/16 , G06F16/637 , G06F16/685 , G06F16/904 , G06F18/25 , G06F21/6245 , G06F40/30 , G06F40/40 , G06K19/07762 , G06N3/006 , G06T7/00 , G06V20/10 , G06V20/52 , G06V40/103 , G06V40/16 , G06V40/172 , G06V40/23 , G10L15/08 , G10L17/00 , G10L21/0232 , G11B27/10 , G16B50/00 , G16H10/20 , G16H10/60 , G16H15/00 , G16H30/00 , G16H30/20 , G16H30/40 , G16H40/20 , G16H40/60 , G16H40/63 , G16H50/20 , G16H50/30 , G16H80/00 , G16Y20/00 , H04L51/02 , H04L51/222 , H04N7/183 , H04R1/326 , H04R3/005 , H04R3/12 , G06T2207/10024 , G06T2207/10044 , G06T2207/10048 , G06T2207/10116 , G06T2207/10132 , G10L15/1815 , G10L15/22 , G10L15/26 , G10L2021/02082 , H04N7/181 , H04R1/406 , H04R3/02 , H04R2420/07 , H04S2400/15
Abstract: A method, computer program product, and computing system for compartmentalizing a virtual assistant is executed on a computing device and includes obtaining encounter information via a compartmentalized virtual assistant during a patient encounter, wherein the compartmentalized virtual assistant includes a core functionality module. One or more additional functionalities are added to the compartmentalized virtual assistant on an as-needed basis.
-
7.
公开(公告)号:US12001472B2
公开(公告)日:2024-06-04
申请号:US18194260
申请日:2023-03-31
Applicant: Gracenote, Inc.
Inventor: Aneesh Vartakavi , Casper Lützhøft Christensen
IPC: G06F16/00 , G06F16/335 , G06F16/35 , G06F16/383 , G06F16/683 , G06F16/783 , G06F30/27 , G06F40/00 , G10L15/00
CPC classification number: G06F16/383 , G06F16/335 , G06F16/35 , G06F16/685 , G06F16/7844 , G06F30/27 , G06F40/00 , G10L15/00
Abstract: A method and system for computer-based generation of podcast metadata, to facilitate operations such as searching for and recommending podcasts based on the generated metadata. In an example method, a computing system obtains a text representation of a podcast episode and obtains person data defining a list of person names such as celebrity names. The computing system then correlates the person data with the text representation, to find a match between a listed person name a text string in the text representation. Further, the computing system predicts a named-entity span in the text representation and determines that the predicted named-entity span matches a location of the text string in the text representation of the podcast episode, and based on this determination, the computing system generates and outputs metadata that associates the person name with the podcast episode.
-
8.
公开(公告)号:US20240177695A1
公开(公告)日:2024-05-30
申请号:US18242054
申请日:2023-09-05
Applicant: Tree Goat Media, Inc.
Inventor: Michael Kakoyiannis , Sherry Mills , Christoforos Lambrou , Vladimir Canic , Srdjan Jovanovic
IPC: G10H1/00 , G06F16/68 , G06F16/683
CPC classification number: G10H1/0008 , G06F16/685 , G06F16/686 , G10H2220/106
Abstract: A system for platform-independent visualization of audio content, in particular audio tracks utilizing a central computer system in communication with user devices via a computer network. The central system utilizes various algorithms to identify spoken content from audio tracks and selects visual assets associated with the identified content. Thereafter, a visualized audio track is available for users to listen and view. Audio tracks, for example Podcasts, may be segmented into topical audio segments based upon themes or topics, with segments from disparate podcasts combined into a single listening experience, based upon certain criteria, e.g., topics, themes, keywords, and the like.
-
公开(公告)号:US20240171620A1
公开(公告)日:2024-05-23
申请号:US18538847
申请日:2023-12-13
Applicant: Rovi Guides, Inc.
Inventor: Vikram Makam Gupta , Madhusudhan Seetharam
IPC: H04L65/401 , G06F16/683 , G06F40/35 , G06Q10/105 , G06Q10/109 , G06V40/16 , G06V40/20 , H04L12/18 , H04L65/1069 , H04L65/1093 , H04M3/56
CPC classification number: H04L65/4015 , G06F16/685 , G06F40/35 , G06Q10/105 , G06Q10/109 , G06V40/174 , G06V40/20 , H04L12/1822 , H04L12/1831 , H04L65/1069 , H04L65/1093 , H04M3/568 , H04M2201/40
Abstract: Systems and methods for creating and managing a breakout conference for a primary conference are disclosed. The system monitors communications between participants of a primary conference to determine if a) participants have a disagreement that needs to be resolved or b) if a topic from the meeting agenda requires additional time for discussion. Participant language, including negations and repetitive word usage, job profiles, body language, overlapping voice signals, among other factors, are monitored to determine if a disagreement exists. If a disagreement exists or additional time is required, the system automatically creates a virtual breakout session, determines the topic that created the disagreement, determines participants associated with the disagreed topic, and moves them to the breakout session. The system also provides meeting tools such that participants in the primary conference may communicate and alert participants in the breakout session, and vice versa, without leaving their respective sessions.
-
公开(公告)号:US20240134908A1
公开(公告)日:2024-04-25
申请号:US18326261
申请日:2023-05-30
Applicant: QUALCOMM Incorporated
Inventor: Rehana MAHFUZ , Yinyi GUO , Erik VISSER
IPC: G06F16/683 , G06F16/632 , G06F16/638 , G06F16/68
CPC classification number: G06F16/685 , G06F16/632 , G06F16/638 , G06F16/686
Abstract: A device includes one or more processors configured to generate one or more query caption embeddings based on a query. The processor(s) are further configured to select one or more caption embeddings from among a set of embeddings associated with a set of media files of a file repository. Each caption embedding represents a corresponding sound caption, and each sound caption includes a natural-language text description of a sound. The caption embedding(s) are selected based on a similarity metric indicative of similarity between the caption embedding(s) and the query caption embedding(s). The processor(s) are further configured to generate search results identifying one or more first media files of the set of media files. Each of the first media file(s) is associated with at least one of the caption embedding(s).
-
-
-
-
-
-
-
-
-